Senior Site Reliability Engineer

Location

United States

Posted

2 days ago

Salary

$180K - $200K / year

Seniority

Senior

Job Description

Senior Site Reliability Engineer

PayNearMe, Inc.

Role Description As our Site Reliability Engineer, you will design, build, and maintain the systems and infrastructure that power our applications, ensuring their reliability, scalability, and performance. You will bring a software engineering approach to operations, automating processes, and continuously improving the infrastructure and tools to support our business needs. Responsibilities - Infrastructure Management: Design, implement, and maintain scalable and resilient infrastructure using Terraform for infrastructure as code, ensuring high availability and performance. - Kubernetes and Containers: Deploy, manage, and optimize Kubernetes clusters and containerized applications using Docker. Implement best practices for container orchestration and management. - Systems and Application Monitoring/Observability: Develop and maintain comprehensive monitoring and observability solutions using Datadog. Ensure detailed visibility into system performance and application health. - SLOs and SLA Management: Define, monitor, and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs) to ensure reliable and consistent service delivery. - Incident Response and Troubleshooting: Respond to incidents, perform root cause analysis, and implement solutions to prevent recurrence. Participate in post-incident reviews and contribute to blameless postmortems. - Reliability and Production Environment Management: Ensure the reliability and stability of our production environments. Continuously assess and improve system reliability, identifying and addressing potential points of failure. - Automation and Scripting: Develop automation scripts and tools to reduce manual intervention and improve system reliability using Python, Bash, or Go. Implement and improve CI/CD pipelines. - CI/CD Pipeline Management: Enhance and maintain continuous integration and continuous deployment pipelines using GitLab CI. Ensure seamless and reliable deployment processes. - Capacity Planning and Scaling: Assist in capacity planning and ensure that systems are scalable to meet future demands. Implement auto-scaling strategies where applicable. - Security and Compliance: Implement security best practices and ensure compliance with industry standards. Regularly review and update security policies and procedures. - Collaboration and Support: Work closely with development teams to ensure reliability and scalability of new features and services. Provide technical support and guidance on infrastructure-related issues. - Software Engineering for Operations: Develop and maintain internal tools and services that enhance the efficiency and reliability of our operations. - On-Call Rotation: Participate in an on-call rotation to address production issues and collaborate in incident response efforts. Qualifications - +3 years of experience in SRE, DevOps, or a related role. - Cloud Platform Experience: Proficient with cloud platforms such as AWS, GCP, or Azure. Experience with EC2, RDS, VPCs, and security groups is essential. - Kubernetes and Containers: Strong experience with Kubernetes and Docker, including deployment, scaling, and management of containerized applications. - Infrastructure as Code: Expert in using Terraform for infrastructure as code. Proficient with configuration management tools such as Ansible, Puppet, or Chef. - Monitoring and Observability: Extensive experience with monitoring and observability tools like Datadog, Prometheus, Grafana, ELK stack, or Splunk. Skilled in setting up detailed monitoring and logging systems. - SLOs and SLA Management: Proven ability to define, monitor, and maintain SLOs and SLAs to ensure reliable service delivery. - Scripting and Automation: Strong skills in scripting languages like Python, Bash, or Go. Experience automating repetitive tasks and processes. - CI/CD Practices: Familiarity with GitLab CI or similar tool for continuous integration and deployment. Experience in setting up and managing pipelines. - Production Environments: Experience supporting production environments running Go or Ruby/Rails applications. - Tool Development: Ability to write and update tools to support infrastructure and application management, demonstrating the principle that “SRE is what happens when you ask a software engineer to design an operations team.” - DevOps Best Practices: Deep understanding of DevOps principles, practices, and tools to drive continuous improvement in the software development lifecycle. - Soft Skills: Strong organizational skills, attention to detail, and the ability to work collaboratively in a team environment. Excellent documentation skills to ensure accurate and detailed records. - Problem-Solving Ability: Excellent analytical and problem-solving skills to diagnose and resolve complex system issues quickly and effectively. Requirements - The annual base salary range for this role represents PayNearMe's good-faith estimate of the base salary it reasonably expects to offer for this position at the time of hire. Actual compensation may vary based on factors including the candidate's experience, qualifications, skills, and work location. - This position will remain posted until filled. - Annual Salary Range: $180,000 — $200,000 USD. Benefits - Competitive salary and benefits with growth-company options grant. - Fast-paced and professional work culture. - Stock options with standard startup vesting - 1 year cliff; 4 years total. - $50 monthly communication expense stipend to go towards your phone/internet bill. - $250 stipend to enhance your WFH setup. - Reimbursement for peripheral equipment: monitor (up to $400), keyboard and mouse (up to $200). - Premium medical benefits including vision and dental (100% coverage for employees). - Company-sponsored life and disability insurance. - Paid parental bonding leave. - Paid sick leave, jury duty, bereavement. - 401k plan. - Flexible Time Off (our team members typically take off ~3-4 weeks per year). - Volunteer Time Off. - 13 scheduled holidays.

Related Categories

Related Job Pages

More Engineer Jobs

Bertoni Solutions logo

Senior Vendavo Configuration Engineer

Bertoni Solutions

Translating technology into your success

Engineer2 days ago
ContractRemoteTeam 11-50Since 2016H1B No Sponsor

• Lead hands-on configuration, implementation, and optimization of Vendavo Profit Analyzer, Deal Manager, and Pricepoint solutions for enterprise clients • Serve as the primary technical resource for Vendavo platform work • Collaborate with pricing analysts, solution architects, and client stakeholders to deliver measurable margin improvement outcomes • Comfortable operating autonomously in a remote delivery model across multiple time zones

Brazil
Full TimeRemoteTeam 10,001+Since 1966H1B Sponsor

• Lead the end-to-end integration and testing of rack electrical solutions, including medium/low voltage systems (48V DC up to +/- 400V or 800V DC), uninterruptible power supplies (UPS), and power distribution units (PDUs) in varying form factors and product lines. • Participate in customer engagements and site visits to understand and document deployment constraints and gather feedback for design improvements. • Build accurate rack power budgets and model electrical capacity for both current and future high‑density server or switch configurations. • Manage relationships with electrical/power component vendors to build out a robust partner ecosystem consisting of PSUs, PDUs, bus bars, power shelves, and high voltage equipment. • Ensure all rack designs meet safety and certification requirements such as NEC/NFPA, IEEE, and UL/IEC. • Design, review, or integrate rack PDUs and power‑distribution components, including breaker configurations, metering, and load‑balancing features. • Perform detailed failure‑mode analysis, including FMEA and root‑cause investigations for electrical defects or field issues. • Define, validate, and improve factory electrical test strategies, including continuity testing, sequencing, load testing, and safety verification. • Be highly proficient in developing, updating, and interpreting electrical one‑line diagrams for rack‑level and row‑level power architectures, ensuring accurate representation of power flow, grounding, redundancy, and protection devices. • Collaborate with Architects/Compliance/Reliability and global vendors to align electrical architectures with total system design and international data center/power standards (e.g., OCP, ASHRAE, ANSI, TIA-942).

Texas
$164.2K - $295.6K / year
Thermo Fisher Scientific logo

Engineer II, Field Service

Thermo Fisher Scientific

The World Leader In Serving Science

Engineer2 days ago
Full TimeRemoteTeam 10,001+Since 1956H1B Sponsor

• Delivering advanced technical service, maintenance, troubleshooting, and customer support for laboratory instrumentation and integrated systems • Performing installations, preventative maintenance, calibrations, validations, training, and repairs while diagnosing complex hardware, software, and application issues • Maintaining accurate service documentation and ensuring compliance with company quality and safety standards • Collaborating with cross-functional teams to resolve escalations and improve service delivery • Building strong customer relationships and serving as the primary field-based customer advocate • Maximizing instrument uptime, operational efficiency, and overall customer satisfaction through proactive communication and responsiveness

Washington
$31 - $50 / hour
Solenis logo

Process Safety Engineer

Solenis

Building a safer & healthier world through sustainable innovation.

Engineer2 days ago
Full TimeRemoteTeam 10,001+H1B Sponsor

• The Process Safety Engineer is responsible for developing, facilitating, implementing, and sustaining process safety programs that support safe, reliable, and compliant manufacturing operations. • This role provides technical leadership and expertise in process hazard analysis, risk management, engineering standards, and capital project execution while supporting the organization's Process Safety Roadmap. • Lead and facilitate process hazard analyses (PHAs), risk assessments, and hazard reviews for capital projects and targeted manufacturing processes. • Identify process safety risks and recommend mitigation measures to reduce operational and project-related hazards. • Support the implementation and closure of action items resulting from hazard evaluations and risk assessments. • Develop, maintain, and continuously improve process safety standards, procedures, and best practices that align with corporate process safety objectives and the Engineering Process Safety Roadmap. • Collaborate with engineering, operations, and EHS teams to integrate process safety principles into business processes. • Provide process safety technical expertise to capital project teams throughout project design, development, and execution phases. • Review engineering designs and project documentation to ensure compliance with process safety requirements and standards. • Partner with Environmental, Health, and Safety (EHS) teams to plan and conduct process safety audits and assessments.

United States
$123.7K - $181.4K / year