Fabric Group logo
Fabric Group

Good Problems. Unlocking value from business challenges

Senior Consultant – Site Reliability Engineering

DevOps EngineerDevOps EngineerContractRemoteSeniorTeam 51-200Since 2006H1B No SponsorCompany SiteLinkedIn

Location

India

Posted

1 day ago

Salary

0

Seniority

Senior

Job Description

Senior Consultant – Site Reliability Engineering

Fabric Group

• Consultative Ownership: Work with autonomy to own problems and deliver solutions, acting as a bridge between development and operations. • Observability Architecture: Design and implement robust monitoring solutions using the LGTM stack to ensure system health and performance. • Reliability Strategy: Advise clients on defining meaningful SLOs/SLIs and managing error budgets to balance innovation with stability. • AI Assistance: Drive use of AI Agents or AI tools for intelligent automation and improving operational efficiency. • Incident Leadership: Lead post-incident reviews (Blameless Post-Mortems) to identify systemic improvements and reduce future toil. • Mentorship: Coach less experienced engineers within Fabric and our client teams on SRE principles and modern infrastructure patterns. • Advising our clients on the right technical decisions and advocating for the right practices to use. • Participate in interviewing and recruitment based on business needs. • Thought Leadership: Contribute to the SRE community through blog posts, meetups, or internal knowledge sharing. • Operational Support & Availability: Rotational Support Coverage: Participate in a sustainable team rotation to provide extended service coverage (including weekends) for business-critical systems. • Incident Response: Act as a primary responder for high-priority (P1/P2) incidents during your rostered shift, focusing on rapid restoration and clear stakeholder communication.

Job Requirements

  • Strong expertise in Observability: Deep comfort with Grafana, including the LGTM stack (Loki, Grafana, Tempo, Mimir) or Grafana Cloud, OpenTelemetry.
  • Container Orchestration: Solid experience with Kubernetes management, configuration, and troubleshooting in production.
  • Good understanding of AI Agent frameworks and tools like Grafana AI Assistant.
  • Cloud Proficiency: Hands-on experience with GCP or AWS, including networking, security, and cloud-native services.
  • Modern Deployment: Proven experience implementing GitOps (ArgoCD) and CI/CD pipelines (GitLab CI, GitHub Actions, etc.).
  • Infrastructure as Code (IaC): Experience with tools like Terraform.
  • Automation & Scripting: Proficiency in at least one language (e.g., Python, Go, or Bash) for building tooling and automating operational tasks.
  • Incident Management: Experience with on-call rotation tools (Grafana on-call, Opsgenie) and a strong commitment to a blameless culture.

Benefits

  • Flexibility to support work-life balance while maintaining professional independence.
  • Contract duration is typically 12 months, with the possibility of renewal based on project needs and performance.
  • Payment is in daily rates in Australian dollars, reflecting your experience and meeting the local Indian market.
  • Contractors are fully integrated into project teams and Fabric’s culture.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Quzara LLC logo

Site Reliability Engineer – Google Cloud Platform

Quzara LLC

Cybersecurity & Managed Services firm providing Technical Advisory support to Federal and Commercial customers.

Full TimeRemoteTeam 11-50Since 2015H1B No Sponsor

• Design, build, and operate secure GCP cloud foundations and landing zones for federal and regulated environments, including organization hierarchy, policy guardrails, Assured Workloads, and Cloud Foundation Toolkit-based deployment patterns. • Engineer and maintain secure GCP network architectures, including Shared VPC, hub-and-spoke topology, VPC Service Controls, Access Context Manager, Private Google Access, Private Service Connect, Cloud NGFW, Cloud Armor, load balancing, DNS, NAT, VPN, and Interconnect under least-exposure principles. • Implement and administer identity, access, privileged access, and encryption controls, including least-privilege IAM, custom roles, IAM Conditions, deny policies, service-account hygiene, Workload Identity Federation, Privileged Access Manager, Access Approval, Access Transparency, BeyondCorp Enterprise, IAP, Cloud KMS, Cloud HSM, CMEK, and Cloud EKM. • Develop and operate security monitoring, threat detection, and response capabilities using Chronicle/Google Security Operations, Security Command Center, curated detections, YARA-L, threat intelligence, SOAR playbooks, telemetry pipelines, and integration with MDR/SOC workflows. • Build and maintain logging, audit, observability, and reliability capabilities using Cloud Audit Logs, aggregated log sinks, retention policies, BigQuery/Chronicle exports, Cloud Monitoring, Cloud Logging, dashboards, uptime checks, SLIs/SLOs, alerting, on-call operations, incident response, and blameless postmortems. • Secure and operate cloud workloads and platforms, including Sensitive Data Protection/Cloud DLP for CUI discovery and de-identification, hardened GKE environments, Workload Identity, Shielded/Confidential nodes, network policy, GKE Policy Controller, Binary Authorization, and secure Artifact Registry image promotion. • Automate infrastructure, security, compliance, and reliability operations using Terraform, Infrastructure Manager, Cloud Foundation Toolkit, policy-as-code, secure CI/CD pipelines, Cloud Build, Cloud Deploy, and scripting in Python, Go, or Bash to reduce manual work and operational toil. • Translate federal security and compliance requirements into GCP configurations and audit-ready evidence, including NIST SP 800-53, NIST SP 800-171, FedRAMP, CMMC, control inheritance, customer responsibility matrices, RMF/FedRAMP authorization support, and assessor/AO documentation. • Partner directly with customers and internal stakeholders to communicate technical requirements, operational risks, compliance expectations, and implementation status to both technical and non-technical audiences.

United States
Sedona Digital logo

Senior Application DevOps Engineer

Sedona Digital

Experts in software development and cloud technologies.

ContractRemoteTeam 51-200H1B No Sponsor

• Join Sedona Digital, a fast-growing scale-up organization with an ambition to be recognized as one of the leading technology companies servicing high tech, global enterprises across Technology, Finance, and Life Sciences sectors. • You will join a team working with cutting-edge technologies and striving to utilise cloud services to the maximum. • Your role will be to provide the necessary infrastructure using Terraform, develop CI/CD pipelines in GitLab or Jenkins, and create automations to support the development teams. • Additionally, you will ensure that the underlying infrastructure runs smoothly, and the systems and tools work as expected. • You'll be responsible for helping developers with troubleshooting and providing consultation in case of alerts. • As a senior engineer, you will also take ownership of specific domains or projects within the business, leading technical direction and ensuring alignment with strategic goals. • You will play a key role in mentoring and overseeing other engineers, fostering collaboration, sharing best practices, and helping to drive continuous improvement across the team. • Your activities will include collaborating with the development teams to plan, deploy and administer applications running on AWS and Kubernetes, initiating and driving the adoption of technologies, mentoring other engineers, overseeing engineering processes, implementing and overseeing cloud migration projects, and ensuring compliance with company standards and best practices.

Romania
TechBiz Global logo

Senior AI DevOps, LLMOps

TechBiz Global

TechBiz Global is a leading IT recruitment and software development company

Full TimeRemoteTeam 51-200H1B No Sponsor

• Automation of Build-to-Production - Design and implement robust CI/CD pipelines tailored for AI • Develop specialized workflows for PromptOps • Automate the deployment of Agentic workflows • Provision and manage high-performance compute environments (GPU clusters, TPU pods) • Define and enforce Policy-as-Code for AI endpoints • Maintain a consistent environment across Hybrid Infrastructure • Architect Progressive Delivery strategies for AI • Build 'Evaluation-in-the-Loop' gates within the pipeline • Establish deep observability into Inference Endpoints

Poland
Sedona Digital logo

Senior Application DevOps Engineer

Sedona Digital

Experts in software development and cloud technologies.

ContractRemoteTeam 51-200H1B No Sponsor

Role Description Join Sedona Digital, a fast-growing scale-up organization with an ambition to be recognized as one of the leading technology companies servicing high tech, global enterprises across Technology, Finance, and Life Sciences sectors. Our global client base needs builders: engineers and developers who love technology, have deep expertise in software, engineering, and cloud technologies, and importantly, have a passion for culture and customers. - Collaborate with the development teams to plan, deploy and administer applications running on AWS and Kubernetes - Maintain the AWS infrastructure using Infrastructure as Code principles and Terraform - Initiate and drive the adoption of technologies and use of good patterns for development and operations - Mentoring and overseeing other engineers - Lead and manage specific technical domains or projects, ensuring architectural consistency, scalability, and alignment with business objectives - Oversee and improve engineering processes and practices, identifying opportunities for automation, optimization, and enhanced system reliability - Implement and oversee cloud migration projects - Provide guidance for re-platform and re-factor of current cloud infrastructure - Communicate to the senior leadership, the cloud migration projects progress - Preserve business continuity 24/7 with minimum downtime and financial impact - Investigate and perform regular assessments of cloud deployments in compliance with the company’s standards and best practices - Ensure the Company’s deployment standards and pillars are followed in cloud solutions and resources - Stay up to date with the latest tools and trends in the industry - Follow and provide training regarding new and current technologies and services used Qualifications - BSc/MSc in Computer Science, or a similar discipline - Overall knowledge of solution design and deployment projects - Experience with Docker - Experience in or knowledge of CI/CD with GitLab and/or Jenkins - Experience in AWS technologies, services and ecosystem - Experience with Kubernetes - Knowledge of Infrastructure as Code (IaC) concepts and tools, preferably Terraform - Experience with versioning tools such as Git - Distinctive organisation and documentation skills - Excellent time management skills - Essential Linux and Windows admin, networking and scripting skills Benefits - Remote/home working - Opportunity to work in a rapidly growing scale-up organisation - Exposure to complex, global client engagements - Training on market trends and client needs - Ongoing learning and development opportunities - Competitive compensation package

Worldwide