Job Closed

This listing is no longer active.

Mighty Acorn Digital logo
Mighty Acorn Digital

At Mighty Acorn, we make it easier for governments to deliver world-class digital services.

Software Engineer IV – Infra/SRE

DevOps EngineerDevOps EngineerFull TimeRemoteLeadTeam 1-10H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

24 days ago

Salary

$130K - $150K / year

Seniority

Lead

Bachelor Degree7 yrs expEnglishAWSJavaScriptTerraformTypeScript

Job Description

Software Engineer IV – Infra/SRE

Mighty Acorn Digital

• Designing and implementing a comprehensive monitoring strategy for a high-availability application, balancing immediate operational needs with long-term sustainability. • Building and maintaining observability infrastructure using Terraform, integrating AWS-native monitoring services with New Relic to provide full-stack visibility. • Collaborating closely with application engineers working on TypeScript/JavaScript services running on AWS ECS, with RDS and EventBridge in the stack — understanding the application well enough to instrument it effectively. • Establishing reliability standards, runbooks, alerting thresholds, and incident response practices that the broader team can own and operate. • Leading and mentoring a technical team, setting direction, unblocking others, and coaching engineers through ambiguous and high-pressure situations. • Working directly with government stakeholders to communicate the reliability posture of the application, surface risk, and build confidence in the systems you're responsible for.

Job Requirements

  • 7+ years of engineering experience, with significant time spent in SRE, platform, or infrastructure-focused roles.
  • Hands-on experience building and managing infrastructure with Terraform in AWS environments.
  • Deep familiarity with AWS observability tooling and services, including hands-on experience with ECS, RDS, and EventBridge.
  • Experience implementing and operating APM and monitoring platforms such as New Relic.
  • Ability to read, understand, and work alongside TypeScript/JavaScript application codebases — enough to instrument effectively and debug across the stack.
  • Experience operating systems that process personally identifiable information (PII), with sound judgment about the operational and security practices that entails.
  • Demonstrated experience leading a technical team in a high-trust, high-velocity environment — setting direction, maintaining standards, and developing the people around you.
  • Experience working in or alongside government agencies, with an understanding of the organizational dynamics and constraints involved.
  • Strong communication skills across technical and non-technical audiences, including the ability to translate complex reliability concepts for stakeholders without an engineering background.
  • Curiosity, patience, and resilience when navigating ambiguous or rapidly changing environments.
  • A Bachelor's degree (or equivalent experience) is contractually required for this role.

Benefits

  • Profit sharing bonus available after 90 days

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Senior DevOps Engineer

STR

STR makes the world a safer place by developing technology and applying it to solve emerging national security challenges.

DevOps Engineer24 days ago
Full TimeRemoteTeam 800Since 2010

About the Team: The Real-time Architectures, Integration, and Demonstration (RAID) Group focuses on transition of algorithms from concept to real-time software, providing open architecture expertise, and facilitating integration of capabilities for experimentation, test, and deployment. The Role: As a Senior DevOps Engineer you will establish and maintain a continuous integration / continuous deployment (CI/CD) infrastructure for real-time embedded systems software. You will work with software development teams to implement build automation, testing frameworks, containerization, and deployment pipelines for complex embedded and distributed systems. You will be responsible for creating and maintaining development environments, build systems, and tooling that enable rapid prototyping and transition of algorithms from research to real-time implementation. What you will do: - Design, implement, and maintain CI/CD pipelines for multi-language, multi-platform software projects - Establish and maintain containerized development and deployment environments (Docker, Kubernetes) - Implement automated build systems for C/C++ and Python codebases using CMake, Conan, and similar tools - Create and maintain automated testing frameworks (unit tests, integration tests, system tests) - Implement security scanning and vulnerability assessment tools in CI/CD pipelines - Manage version control workflows and branching strategies for collaborative development - Configure and maintain GitLab runners, build agents, and testing infrastructure across multiple sites - Develop scripts and tools to automate software deployment to embedded and distributed systems - Monitor build health, test coverage, code quality metrics, and system performance - Support developers across multiple teams in resolving build, dependency, and environment issues - Document DevOps processes, tools, best practices, and technical documentation - Collaborate with software engineers, systems engineers, security, IT, and integration teams - Support approximately 20% travel for system integration events and customer demonstrations Who you are: - Active Secret security clearance with ability to obtain and maintain a Top Secret clearance, for which U.S. citizenship is a government requirement - BS, MS, or PhD in Computer Science, Computer Engineering, Electrical Engineering, or a related field with relevant experience depending on degree (BS +5 years, MS +3 years, PhD +1 year) - Strong proficiency with Linux system administration and embedded Linux environments - Experience with: - CI/CD platforms (GitLab CI, Jenkins, or similar) - Scripting languages (Bash, Python) - Containerization technologies (Docker, Kubernetes) - Build systems for C/C++ projects (CMake, Make, or similar) - Version control systems (Git) and collaborative development workflows Even Better: - Active Top Secret clearance (TS/SCI preferred) - DevOps or software infrastructure engineering experience - Experience with: - Real-time embedded systems software development - Package management systems (Conan, vcpkg, Artifactory) - Static analysis tools, code coverage tools, and performance profiling - Security scanning and vulnerability assessment tools (SonarQube, Trivy, etc.) Join us and be part of a team that's making an impact at the forefront of technology and innovation. Pay Information Full-Time Salary Range: $134,000 - $184,000 The salary range listed is based on external market data. Offers are based on factors, such as but not limited to, the candidate’s experience, education, training, key skills/critical skills, security clearances, and prevailing market and business conditions. STR is a growing technology company with locations near Boston, MA, Arlington, VA, near Dayton, OH, Melbourne, FL, and Carlsbad, CA. We specialize in advanced research and development for defense, intelligence, and national security in: cyber; next generation sensors, radar, sonar, communications, and electronic warfare; and artificial intelligence algorithms and analytics to make sense of the complexity that is exploding around us. STR is committed to creating a collaborative learning environment that supports deep technical understanding and recognizes the contributions and achievements of all team members. Our work is challenging, and we go home at night knowing that we pushed the envelope of technology and made the world safer. STR is not just any company. Our people, culture, and attitude along with their unique set of skills, experiences, and perspectives put us on a trajectory to change the world. We can't do it alone, though - we need fellow trailblazers. If you are one, join our team and help to keep our society safe! Visit us at www.str.us for more info. STR is an equal opportunity employer. We are fully dedicated to hiring the most qualified candidate regardless of race, color, religion, sex (including gender identity, sexual orientation and pregnancy), marital status, national origin, age, veteran status, disability, genetic information or any other characteristic protected by federal, state or local laws. If you need a reasonable accommodation for any portion of the employment process, email us at appassist@str.us and provide your contact info. Pursuant to applicable federal law and regulations, positions at STR require employees to obtain national security clearances and satisfy the requirements for compliance with export control and other applicable laws.

Georgia
$134K - $184K / year
Rain Technologies Inc. logo

Senior DevOps Engineer

Rain Technologies Inc.

Rain is the world's first AI Financial Health Platform, serving 3.5 million employees at leading organizations like McDonald's, Marriott, and T-Mobile. Rain works in the background to optimize every employee's financial life to prevent shortfalls and build long-term stability. Backed by top investors including QED and Prosus, Rain has raised $150M in venture funding to fuel our next stage of hyper growth.

DevOps Engineer24 days ago
Full TimeRemoteTeam 51-200

Role Description As a Senior DevOps Engineer at Rain, you will play a central role in designing, building, and operating our cloud infrastructure as we continue to scale to millions of users globally. You will work alongside a small, high-performing cloud team to drive automation, improve observability, and ensure the reliability and security of our platform. This role goes beyond keeping the lights on — you will actively shape how we build and operate infrastructure and influence architectural decisions. - Design, build, and maintain scalable, secure cloud infrastructure on AWS using Terraform and Terragrunt (IaC, Infrastructure as Code) - Manage and evolve our Kubernetes (EKS) clusters — including node group management, autoscaling with Karpenter, and workload reliability - Own and improve our CI/CD pipelines (GitLab CI), ensuring fast, reliable, and secure delivery - Drive observability initiatives: metrics, logging, alerting, and dashboards using Prometheus, Grafana, and related tooling - Support and evolve our Kafka infrastructure in collaboration with backend engineering teams - Champion infrastructure-as-code practices, ensuring consistent, reviewed, and well-documented changes - Respond to production incidents, lead post-mortems, and drive improvements in incident response processes - Collaborate with backend, security, and product engineering teams to support their infrastructure needs - Leverage AI-assisted tooling (e.g., GitHub Copilot, AI-powered incident analysis, LLM-based automation) to increase productivity and quality Qualifications - You bring 5+ years of experience managing large-scale production environments and aren't afraid of architectural complexity - You have a "code everything" mindset, replacing manual tasks with scalable, DRY Infrastructure as Code (IaC) - You understand the "why" behind Kubernetes internals and cloud networking, not just the "how" of deployment - You communicate complex infrastructure concepts clearly to both engineering peers and business stakeholders - You treat security, secrets management, and observability as core features, not afterthoughts Requirements - Advanced AWS & EKS: Deep proficiency in EC2, RDS, S3, IAM, and VPC networking, specifically within multi-account EKS environments - Kubernetes Internals: Hands-on experience with CNI, RBAC, Affinity/Taints, and managing complex workloads (StatefulSets/DaemonSets) - IaC Mastery: Proven ability to scale infrastructure using Terraform and Terragrunt with modular, reusable patterns - CI/CD & Helm: Expertise in designing secure GitLab CI pipelines and managing versioned Helm charts across environments - Observability: Proficiency in building dashboards and alerting logic using Prometheus and Grafana - Linux & Scripting: Strong Bash skills for environment management (Python proficiency is a significant plus) Company Description Rain is the world's first AI Financial Health Platform, serving 3.5 million employees at leading organizations like McDonald's, Marriott, and T-Mobile. Rain works in the background to optimize every employee's financial life to prevent shortfalls and build long-term stability. Backed by top investors including QED and Prosus, Rain has raised $150M in venture funding to fuel our next stage of hyper growth.

Portugal
CAI logo

DevOps Engineer

CAI

WHEN YOU NEED TO MEET A HIGHER STANDARD® in US | ASIA | EUROPE | AUSTRALIA

DevOps Engineer24 days ago
Full TimeRemoteTeam 501-1,000H1B Sponsor

Role Description We are looking for a DevOps Engineer to manage and maintain AWS cloud infrastructure and design, build, and manage Databricks infrastructure on AWS. This position will be full-time and remote - India. - Manage and maintain AWS cloud infrastructure, ensuring scalability, security, and cost efficiency - Design, build, and manage Databricks infrastructure on AWS - Support deployment and operationalization of Databricks-based data platforms, including cluster management and job orchestration - Implement Infrastructure as Code (IaC) practices using tools such as Terraform or AWS CloudFormation - Develop automation scripts and tools using Python and Shell Scripting to streamline operational processes - Design, implement, and optimize CI/CD pipelines using CloudBees (Jenkins) to support application and data platform deployments - Collaborate with software engineering, data engineering, and architecture teams to enable reliable and efficient delivery pipelines - Monitor system performance, availability, and reliability using enterprise monitoring solutions - Troubleshoot production issues and implement root cause analysis with preventive actions - Ensure compliance with TE Connectivity security, governance, and quality standards - Drive continuous improvement initiatives aligned with DevOps best practices and TE operational excellence Qualifications - Bachelor’s degree in Computer Science, Information Technology, Engineering, or related discipline - Shift Timings – 6 PM to 3 AM (IST) Monday - Friday - 4+ years of experience in DevOps, cloud engineering, or related roles - Exposure to Databricks platform (cluster setup, job execution, notebooks) - Hands-on experience with AWS services (EC2, S3, IAM, VPC, Lambda, etc.) - Experience with CloudBees/Jenkins for CI/CD pipeline implementation - Experience with Git-based version control systems - Experience with monitoring and logging tools such as CloudWatch, DataDog - Understanding of cloud security, networking, and system architecture principles - Strong scripting/programming skills in Python and Groovy - Experience working in Agile/Scrum environments Physical Demands - This role involves mostly sedentary work, with occasional movement around the office to attend meetings, etc. - Ability to perform repetitive tasks on a computer, using a mouse, keyboard, and monitor Reasonable Accommodation Statement If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employment selection process, please direct your inquiries to application.accommodations@cai.io or (888) 824 – 8111.

India
CAI logo

Senior Fabric Administrator

CAI

WHEN YOU NEED TO MEET A HIGHER STANDARD® in US | ASIA | EUROPE | AUSTRALIA

DevOps Engineer24 days ago
Full TimeRemoteTeam 501-1,000H1B Sponsor

Role Description We are looking for a motivated Senior Fabric Administrator who ensures enterprise-level stability, scalability, and compliance across Fabric capacities, workspaces, and connected services. If you have experience with data architects, data engineering, analytics engineering, BI architects, and governance teams to sustain a governed self-service analytics ecosystem and deliver operational excellence through proactive monitoring, auditing, and performance optimization and are looking for your next career move, apply now. This position will be full-time and remote – India. What You’ll Do - Administer and maintain Microsoft Fabric capacities, tenants, and workspaces to support secure and reliable analytics operations. - Oversee platform governance, workspace provisioning, naming conventions, and capacity allocation in alignment with enterprise standards. - Manage and enhance Fabric audit logs, usage telemetry, and admin monitoring dashboards for visibility, compliance, and operational insights. - Develop and maintain Power BI Pro license utilization reports to identify inactive users and coordinate with governance teams to reclaim unused licenses for cost efficiency. - Implement proactive alerting and automated health checks for capacity utilization, gateway performance, and workspace compliance. - Lead the maintenance, upgrade, and configuration of the on-premises data gateway to sustain seamless on-premises and cloud data connectivity. - Administer RBAC, object-level, and workspace security through Microsoft Entra ID, ensuring governance compliance across Fabric and Power BI environments. - Manage and resolve user community tickets and service requests in alignment with defined SLAs, escalating or collaborating with Microsoft as needed. - Monitor capacity consumption and optimize workloads across multiple Fabric SKUs to maintain cost and performance balance. - Collaborate with other teams to integrate Fabric telemetry into enterprise monitoring and analytics dashboards (e.g., Log Analytics, Sentinel, Power BI). - Support CI/CD pipelines, deployment pipelines, and version control integration for Fabric artifacts. - Develop and maintain operational documentation, standard operating procedures, and runbooks for platform maintenance and incident response. - Mentor and provide technical guidance to associate administrators and analysts, promoting operational excellence and professional development. - Develop and maintain internal knowledge base articles, how-to guides, and technical enablement materials for Power BI users and the analytics community. - Demonstrate deep technical knowledge of Power BI Desktop, Power BI Report Builder, Power Query (M), DAX, and semantic modeling to support and guide report developers and analytics engineers. - Support integration of Power BI with Microsoft Teams, PowerPoint, and the Power BI mobile app to drive collaboration and cross-platform analytics adoption. - Participate in roadmap discussions, tenant feature reviews, and platform improvement initiatives with the BI governance and architecture teams. Qualifications - Bachelor’s degree in Information Systems, Computer Science, or related field with 8–12 years of relevant experience in analytics or platform administration. - 5+ years of hands-on experience administering Microsoft Power BI Service or Fabric in enterprise-scale deployments. - Strong technical expertise in Power BI Desktop, Power BI Report Builder, DAX, Power Query (M), and semantic model design. - In-depth understanding of Fabric architecture, capacity management, and Lakehouse integration. - Experience managing Fabric audit logs, activity monitoring, and Power BI Admin APIs for operational and compliance reporting. - Experience with PowerShell scripting, REST APIs, and automation of administrative tasks in Fabric and Power BI Service. - Familiarity with Azure Monitor, Log Analytics, and Power BI Admin APIs for audit and capacity monitoring. - Proven ability to configure and maintain the on-premises data gateway, including updates, troubleshooting, and performance optimization. - Proficiency with Microsoft Entra ID, RBAC, and workspace access control across Fabric and Power BI. - Experience managing license utilization, capacity consumption, and SLA-driven support operations. - Familiarity with CI/CD pipelines, deployment automation, and Git integration for Power BI and Fabric artifacts. - Strong understanding of data governance, sensitivity labeling, and compliance in Microsoft environments. - Ability to create and maintain enablement documentation, internal community articles, and user training materials. - Demonstrated leadership in mentoring junior administrators, coordinating platform improvements, and promoting user enablement. - Microsoft Certified Fabric Analytics Engineer Associate DP-600 and Microsoft Certified Power BI Data Analyst Associate PL-300 is required. Physical Demands - Ability to safely and successfully perform the essential job functions. - Sedentary work that involves sitting or remaining stationary most of the time with occasional need to move around the office to attend meetings, etc. - Ability to conduct repetitive tasks on a computer, utilizing a mouse, keyboard, and monitor. Reasonable Accommodation Statement If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employment selection process, please direct your inquiries to application.accommodations@cai.io or (888) 824 – 8111.

India