Job Closed
This listing is no longer active.
We are a Y-Combinator-backed startup building your AI-powered Recruiter Agent
DevSecOps Engineer
Location
India
Posted
79 days ago
Salary
0
Seniority
Senior
Job Description
DevSecOps Engineer
Weekday (YC W21)
• Responsible for integrating security practices into the DevOps lifecycle. • Build and maintain scalable, secure, and reliable cloud infrastructure. • Collaborate closely with software engineers, security teams, and infrastructure specialists. • Design and manage cloud environments, automating infrastructure provisioning. • Strengthen CI/CD pipelines and embed security controls throughout the software development lifecycle.
Job Requirements
- 4–7 years of experience in DevOps, DevSecOps, or cloud infrastructure engineering.
- Strong hands-on experience with AWS services such as EC2, S3, IAM, VPC, CloudWatch, Lambda, and RDS.
- Experience implementing secure CI/CD pipelines using tools like Jenkins, GitHub Actions, GitLab CI, or similar platforms.
- Solid understanding of Infrastructure as Code tools such as Terraform or AWS CloudFormation.
- Experience with containerization and orchestration technologies like Docker and Kubernetes.
- Knowledge of cloud security best practices, identity and access management, encryption, and compliance frameworks.
- Familiarity with security scanning tools for code, containers, and infrastructure.
- Experience implementing monitoring and observability tools for production systems.
- Strong scripting and automation skills using languages such as Python, Bash, or similar.
- Good understanding of networking, system architecture, and distributed systems.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Lead
Resolve Tech SolutionsERP/SAP Modernization | Managed Cloud Delivery Services | Advanced Tech - AI / ML | Cyber Security | Digital Signature
• Lead the design and implementation of scalable, resilient cloud infrastructure across AWS, Azure, or GCP environments • Architect, build, and optimize CI/CD pipelines using tools such as Jenkins, GitLab CI, GitHub Actions, or Azure DevOps • Champion infrastructure-as-code practices using Terraform, Ansible, or similar automation tools • Design and manage containerized environments using Docker and orchestrate workloads with Kubernetes or managed Kubernetes services • Establish and enhance monitoring, logging, and observability platforms using tools such as Prometheus, Grafana, Datadog, or cloud-native monitoring solutions • Lead DevOps team members by providing technical guidance, mentorship, and performance support • Collaborate cross-functionally with engineering, security, and product teams to streamline release cycles and improve deployment reliability • Implement and enforce cloud security best practices, governance standards, and compliance requirements • Drive cloud cost optimization strategies and infrastructure efficiency initiatives • Promote a culture of automation, reliability, and continuous improvement across platform and engineering teams • Troubleshoot complex infrastructure and deployment issues, ensuring minimal disruption to business operations • Contribute to documentation, standards development, and long-term platform architecture strategy
Senior Site Reliability Engineer
ClickHouseClickHouse, Inc. is a database management system that allows users to generate analytical reports using real-time SQL queries. The company’s technology works
• Collaborate with various engineering teams in ClickHouse to design and implement scalable, secure, and highly available systems for ClickHouse. • Establish and manage service level objectives (SLOs) and service level agreements (SLAs) for ClickHouse Cloud. • Ensure all the infrastructure components in ClickHouse Cloud (including Dataplane, Control Plane, ClickHouse Core, etc) have monitoring and alerting in place to ensure timely detection and resolution of incidents. • Enhance and refine incident response processes and post-mortem analysis for any outages in ClickHouse Cloud including working with the support team to communicate to the impacted customers. • Continuously improve the reliability and performance of our ClickHouse services. • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities. • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize downtime.
Staff Site Reliability Engineer
SmarterDxImproving clinical and financial outcomes with physician-validated AI for documentation and coding.
• Define and evolve reliability standards for the SmarterDx platform, including SLIs, SLOs, and error budgets that align engineering work with customer impact. • Implement a “reliability” platform using Terraform and infrastructure-as-code best practices. • Enhance observability systems (metrics, logs, traces, alerting) to provide actionable insights and reduce mean time to detect (MTTD) and resolve (MTTR). • Lead incident response, drive blameless postmortems, and implement systemic improvements to prevent recurrence. • Reduce operational toil through automation, self-healing systems, and improved deployment and rollback mechanisms. • Provide production support for the SmarterDx platform, applying SRE principles to ensure availability, performance, and data durability. • Research,prototype, and advocate for new reliability practices, tooling, and architectural improvements across the engineering organization.
Senior DevOps Engineer
8x88x8 is a Software-as-a-Service (SaaS) provider that delivers business communication solutions in an effort to empower workforces across the globe to "collaborate faster and work sm
• Evaluate product designs and architectures against operational excellence standards for availability, security, performance, and cost optimization. • Partner with product development teams to proactively manage cloud capacity planning - anticipating growth and ensuring infrastructure meets performance and scalability requirements. • Plan, implement, and continuously improve a multi-account, multi-region AWS cloud platform with focus on scalability, reliability, performance, and high availability. • Design and maintain infrastructure automation using Terraform, CloudFormation, Ansible, and CI/CD pipelines (GitHub Actions, Atlantis) to accelerate deployments and reduce operational toil. • Develop and manage access control frameworks, implementing IAM policies, roles, and permission boundaries following least-privilege principles. • Implement and maintain observability for the cloud platform including monitoring, alerting, logging, and self-healing automation where applicable. • Document platform designs, operational runbooks, and troubleshooting procedures. • Provide 2nd and 3rd level support, liaise with network teams and third-party vendors for problem resolution, and participate in incident root cause analysis. • Contribute to cloud cost optimization efforts through right-sizing, reserved capacity planning, and resource lifecycle management.




