Empowering Global Growth at Scale
Site Reliability Engineer – SRE
Location
India
Posted
67 days ago
Salary
0
Seniority
Senior
Job Description
Site Reliability Engineer – SRE
InOrg Global
• Ensure the reliability, availability, and performance of customer platforms and services. • Bridge the gap between development and operations.
Job Requirements
- Minimum 3+ years of experience in Site Reliability Engineering, DevOps, or a related role.
- Proficiency in the ELK stack (Elasticsearch, Logstash, Kibana) for log monitoring.
- Experience with the TICK stack (Telegraf, InfluxDB, Chronograf, Kapacitor) for metrics monitoring.
- Strong scripting skills in languages such as Python, Bash, or Ruby.
- Understanding of Operating System: Ubuntu(OpenStack) - Must have Debian and Redhat etc.
- DevOps Platforms Gitlab - Good to have Or similar
- Solid understanding of Grafana and Prometheus.
- Having worked with ServiceNow or something similar.
- Experience with configuration management tools like Ansible, Puppet, or Chef.
- Familiarity with containerization and orchestration tools like Docker and Kubernetes.
- Understanding of cloud platforms (Any of AWS, Azure, or GCP) and their services.
- Bachelor’s degree in computer science, Information Technology, or a related field.
- Excellent problem-solving skills and attention to detail.
- Strong communication and collaboration abilities.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Proficiency in scripting, programming, and database management. • Optimize performance and resource management. • Document and manage requirements collaboratively.
• Projetar, implementar e manter a infraestrutura baseada em AWS, garantindo escalabilidade, segurança, alta disponibilidade e conformidade com as melhores práticas de segurança. • Implementar políticas e práticas de segurança desde o desenvolvimento até a operação, com foco em hardening de infraestrutura, análise de vulnerabilidades e conformidade com as diretrizes OWASP. • Administrar e otimizar clusters Kubernetes (EKS), utilizando Karpenter para autoscaling eficiente e KEDA para escalabilidade baseada em eventos, com atenção especial à segurança dos clusters. • Criar e manter pipelines de integração e entrega contínuas (CI/CD) utilizando Terraform e GitHub Actions para provisionamento de infraestrutura e deploys automatizados, integrando testes de segurança e compliance. • Manter e versionar infraestrutura e serviços utilizando Helm e ArgoCD, garantindo rastreabilidade e integridade das configurações. • Implementar e gerenciar serviços como DynamoDB, RDS, Lambda, SQS e SNS, garantindo eficiência, interoperabilidade e segurança entre os sistemas. • Criar e otimizar dashboards e alertas no Grafana, assegurando a coleta e análise de métricas com Prometheus e OpenTelemetry, e incluindo monitoramento de aspectos de segurança. • Colaborar com times de desenvolvimento, produto e segurança para otimizar desempenho, reduzir custos e aprimorar a confiabilidade e segurança da plataforma.
DevOps Engineer
Tactibit TechnologiesMission-focused, innovative, and agile cybersecurity and IT operations support for the most demanding missions.
• Build and implement new development tools and infrastructure • Develop and implement fully automated CI/CD pipelines in an enterprise cloud environment • Design and build serverless applications and infrastructure as code in the cloud following development standards and best practices • Test and examine code written by others and analyze results • Develop tools, architecture components, and APIs in support of scientific application deployment • Collaborate with software developers, scientists, system/network administrators, and other technical experts to automate cloud deployments and manage CI/CD pipelines • Perform code reviews, refactoring, and source code repository maintenance • Function in a DevOps environment supporting development, testing, operations, and troubleshooting in a mission-critical environment • Support security testing, hardening, and assessments to meet strict compliance and operational security requirements




