Job Closed
This listing is no longer active.
Scratch Financial is the world's simplest patient financing solution.
Staff DevOps Engineer
Location
California
Posted
85 days ago
Salary
$130K - $160K / year
Seniority
Lead
Job Description
Staff DevOps Engineer
Scratch Financial
Company Description NBCUniversal is one of the world's leading media and entertainment companies. We create world-class content, which we distribute across our portfolio of film, television, and streaming, and bring to life through our global theme park destinations, consumer products, and experiences. We own and operate leading entertainment and news brands, including NBC, NBC News, NBC Sports, Telemundo, NBC Local Stations, Bravo, and Peacock, our premium ad-supported streaming service. We produce and distribute premier filmed entertainment and programming through our powerhouse film and television studios, including Universal Pictures, DreamWorks Animation, and Focus Features, and the four global television studios under the Universal Studio Group banner, and operate industry-leading theme parks and experiences around the world through Universal Destinations & Experiences, including Universal Orlando Resort, home to Universal Epic Universe, and Universal Studios Hollywood. NBCUniversal is a subsidiary of Comcast Corporation. Visit www.nbcuniversal.com for more information. Our impact is rooted in improving the communities where our employees, customers, and audiences live and work. We have a rich tradition of giving back and ensuring our employees have the opportunity to serve their communities. We champion an inclusive culture and strive to attract and develop a talented workforce to create and deliver a wide range of content reflecting our world. Job Description As the DevOps Lead Engineer, you will be responsible for spearheading our DevOps initiatives. You will foster a culture of automation, continuous integration, observability and delivery. Your efforts will support consumer data driven advertising and marketing products, standardized consumer identity solutions, and machine learning initiatives for NBCUniversal and its brands. You will collaborate with cross-functional teams to optimize our cloud infrastructure, ensuring high availability, scalability, and security. Your expertise in AWS services, containerization technologies, monitoring tools, and cloud architecture will be pivotal in designing and implementing robust DevOps solutions that streamline our development, testing, and deployment processes. Responsibilities: - Develop and lead the implementation of DevOps strategies and best practices to improve the efficiency, reliability, and scalability of our cloud-based applications. - Design, build, and maintain robust continuous integration and continuous delivery pipelines to automate the software development and deployment lifecycle. - Utilize your in-depth knowledge of AWS services to architect, deploy, and manage scalable and resilient cloud infrastructure solutions. - Implement containerization technologies (e.g., Docker, Kubernetes) to orchestrate application deployment and ensure consistent environments across various stages of development. - Implement effective monitoring and logging solutions to proactively identify performance bottlenecks, security issues, and system anomalies. Develop auto-scaling solutions to meet fluctuating demand. - Design and optimize cloud architecture to ensure high availability, disaster recovery, and cost-effectiveness. - Implement security measures and best practices to safeguard our cloud infrastructure and applications against potential threats and vulnerabilities. - Lead and mentor a team of DevOps engineers, fostering a collaborative and innovative work environment. - Promote automation in all aspects of DevOps and maintain detailed documentation of infrastructure, processes, and procedures. Qualifications - Bachelor's degree in Computer Science, Software Engineering, or a related field. - Proven experience of 6+ years in DevOps and cloud engineering, with at least 2 years in a leadership or senior role. - Expertise in building and managing CI/CD pipelines using tools like Jenkins, GitLab CI/CD, or AWS CodePipeline. - Strong proficiency in AWS services, including EC2, S3, RDS, Lambda, IAM, and VPC. - Solid understanding of containerization technologies (e.g., Docker, Kubernetes) and container orchestration. - Experience with infrastructure-as-code tools (e.g., CloudFormation, Terraform). - Familiarity with monitoring and logging tools such as Prometheus, Grafana, ELK stack, Splunk, Datadog and CloudWatch. - Knowledge of cloud security best practices and compliance standards (e.g., CIS benchmarks, CCPA, GDPR). - Strong problem-solving skills and the ability to troubleshoot complex issues in a cloud environment. - Excellent communication and leadership skills to effectively collaborate with cross-functional teams. Additional Requirements: - Fully Remote: This position has been designated as fully remote, meaning that the position is expected to contribute from a non-NBCUniversal worksite, most commonly an employee’s residence. This position is eligible for company sponsored benefits, including medical, dental and vision insurance, 401(k), paid leave, tuition reimbursement, and a variety of other discounts and perks. Learn more about the benefits offered by NBCUniversal by visiting the Benefits page of the Careers website. Salary range: $130,000 - $160,000 (bonus eligible) We are accepting applications for this position on an ongoing basis. Additional Information As part of our selection process, external candidates may be required to attend an in-person interview with an NBCUniversal employee at one of our locations prior to a hiring decision. NBCUniversal's policy is to provide equal employment opportunities to all applicants and employees without regard to race, color, religion, creed, gender, gender identity or expression, age, national origin or ancestry, citizenship, disability, sexual orientation, marital status, pregnancy, veteran status, membership in the uniformed services, genetic information, or any other basis protected by applicable law. If you are a qualified individual with a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access nbcunicareers.com as a result of your disability. You can request reasonable accommodations by emailing [email protected]. For LA County and City Residents Only: NBCUniversal will consider for employment qualified applicants with criminal histories, or arrest or conviction records, in a manner consistent with relevant legal requirements, including the City of Los Angeles' Fair Chance Initiative For Hiring Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, where applicable. - Business Segment: Operations & Technology - Compensation: USD 130000 - USD 160000 - yearly
Benefits
- 401(K), 401(K) matching, Adoption Assistance, Childcare benefits, Commuter benefits, Company equity, Company-sponsored outings, Company sponsored family events, Continuing education stipend, Customized development tracks, Dental insurance, Disability insurance, Volunteer in local community, Employee stock purchase plan, Family medical leave, Fitness stipend, Flexible Spending Account (FSA), Generous parental leave, Generous PTO, Health insurance, Job training & conferences, Open door policy, Life insurance, Charitable contribution matching, Mentorship program, Paid volunteer time, Online course subscriptions available, Onsite gym, Open office floor plan, Paid holidays, Paid industry certifications, Pair programming, Paid sick days, Onsite office parking, Partners with nonprofits, Performance bonus, Pet insurance, Promote from within, Recreational clubs, Lunch and learns, Relocation assistance, Return-to-work program post parental leave, Team based strategic planning, OKR operational model, Team workouts, Continuing education available during work hours, Tuition reimbursement, Vision insurance, Wellness programs, Mental health benefits, Fertility benefits, Personal development training
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Principal DevOps Engineer
SagentSagent powers banks and lenders to make loans and homeownership simpler and safer for millions of consumers.
• Collaborate with senior leadership to develop and refine the company’s cloud strategy, ensuring alignment with business goals. • Stay abreast of emerging cloud technologies and assess their applicability and potential benefits to our organization. • Design robust, scalable, and highly available cloud architectures that meet business requirements and align with industry best practices. • Architect solutions adhering to security, compliance, and performance requirements, incorporating GCP and Azure platforms. • Provide technical leadership and mentorship to the cloud engineering team. • Lead architecture discussions and guide development teams in implementing cloud solutions, focusing on Kubernetes and container orchestration with Helm. • Implement cloud solutions hands-on, including infrastructure setup, configuration, and troubleshooting. • Develop, troubleshoot, and maintain CI/CD pipelines using Azure DevOps and integrate cloud components and services with cross-functional teams. • Continuously monitor and optimize cloud infrastructure for performance, cost, and scalability. • Recommend improvements to existing cloud-based systems for enhanced efficiency and effectiveness. • Create and maintain comprehensive documentation related to cloud architecture, configurations, and processes. • Generate regular reports on system performance and usage. • Effectively collaborate with internal stakeholders, vendors, and partners on cloud-related initiatives. • Communicate complex technical concepts to non-technical stakeholders clearly and concisely.
• Design, build, and operate scalable ML infrastructure on GCP (GKE), supporting both experimentation and production workloads for LLMs and NLP systems. • Manage Kubernetes-based environments (GKE): deployment, scaling, upgrades, and reliability of training and inference workloads across GPU/TPU/CPU pools. • Build and maintain CI/CD pipelines (GitHub Actions, Jenkins) to automate testing, training, and deployment of ML services and infrastructure. • Implement infrastructure as code (Terraform, Ansible) to provision and manage cloud resources in a reproducible, secure, and cost-efficient way. • Ensure observability of ML systems: monitoring, logging, and alerting for infrastructure, pipelines, and production inference workloads. • Collaborate with ML engineers and Data Engineers to design and support reliable training and inference pipelines. • Optimize resource utilization and cost, improving efficiency of training and serving infrastructure. • Troubleshoot and resolve issues across the ML platform - from data pipelines to distributed training and production deployments. • Contribute to engineering best practices: code reviews, automation, and continuous improvement of platform reliability and developer experience.
• Become a member of a highly collaborative engineering team offering a unique blend of Cloud Infrastructure Administration, Site Reliability Engineering, Security Operations, and Vulnerability Management across multiple clients. • Coordinate with client product teams, engineering team members, and other stakeholders to monitor and maintain a secure and resilient cloud-hosted infrastructure to established SLAs in both production and non-production environments. • Innovate and implement using automated orchestration and configuration management techniques. Understand the design, deployment, and management of secure and compliant enterprise servers, network infrastructure, boundary protection, and cloud architectures using Infrastructure-as-Code. • Create, maintain, and peer review automated orchestration and configuration management codebases, as well as Infrastructure-as-Code codebases. Maintain IaC tooling and versioning within Client environments. • Implement and upgrade client environments with CI/CD infrastructure code and provide internal feedback to development teams for environment requirements and necessary alterations. • Work across AWS, Azure and GCP, understanding and utilizing their unique native services in client environments. • Configure, tune, and troubleshoot cloud-based tools, manage cost, security, and compliance for the Client’s environments. • Monitor and resolve site stability and performance issues related to functionality and availability. • Work closely with client DevOps and product teams to provide 24x7x365 support to environments through Client ticketing systems. • Support definition, testing, and validation of incident response and disaster recovery documentation and exercises. • Participate in on-call rotations as needed to support Client critical events, and operational needs that may lay outside of business hours. • Support testing and data reviews to collect and report on the effectiveness of current security and operational measures, in addition to remediating deviations from current security and operational measures. • Maintain detailed diagrams representative of the Client’s cloud architecture. • Maintain, optimize, and peer review standard operating procedures, operational runbooks, technical documents, and troubleshooting guidelines
Senior Engineer, FinOps, DevOps
Thinkahead Consultant Psychologist Pty LtdWe get to the heart of the matter.....real people......real solutions
• Develop and maintain cost allocation, budgeting, and chargeback models across cloud accounts (AWS, Azure, GCP). • Implement and enforce tagging and resource hierarchy standards, ensuring >90% coverage for cost-critical tags (e.g., Application, Environment, Cost Center). • Build and publish cost visibility dashboards and reports using Power BI, QuickSight, Looker, or other FinOps tooling. • Support unified multi-cloud cost reporting and forecasting for engineering and finance teams. • Execute rightsizing, scheduling, and lifecycle management of cloud resources across AWS, Azure, and GCP (EC2, VM, GCE, RDS, S3, Storage, Networking). • Manage and optimize Reservations, Savings Plans, Committed Use Discounts (CUDs), and licensing benefits (BYOL, AHB). • Implement policy-as-code and governance using tools like Terraform, AWS Config, Azure Policy, or GCP Organization Policies. • Participate in anomaly detection, spend forecasting, and automation of remediation workflows. • Contribute to CI/CD pipeline management, infrastructure automation, and GitOps practices using Azure DevOps, GitHub Actions, AWS CodePipeline, or Google Cloud Build. • Provide actionable insights in monthly cost and performance reviews with engineering and product stakeholders. • Partner with Finance and Procurement teams on budgeting, forecasting, and billing validation. • Collaborate with SRE and Platform teams to balance cost efficiency, performance, and reliability. • Maintain operational hygiene through scripting, compliance audits, and automation.



