Job Closed
This listing is no longer active.
Custom Wall Murals for your home and office.
DevOps Engineer
Location
United States
Posted
153 days ago
Salary
0
Seniority
Senior
Job Description
DevOps Engineer
MagicDecor®
• Own and operate infrastructure in AWS/Azure across environments (Dev, Staging, Production, DR) • Lead cloud-to-cloud migrations (e.g., AWS to Azure / GCP) • Manage CI/CD pipelines, versioned deployments, and environment isolation • Monitor and tune database performance (PostgreSQL, MySQL, Mongo) • Manage Azure Application Gateway for layer 7 load balancing, SSL termination, and WAF policies • Implement and manage caching layers (e.g., Redis) and data indexing with Elasticsearch for performance and observability • Deep experience with GitHub Actions • Proficient in managing infrastructure-as-code (IaC) • Scripting knowledge in Python, Go, or Shell for custom tooling and glue code. • Basic understanding of network fundamentals like Networks, DNS, PORTS, ROUTES,NAT GATEWAYS and VPN. • Configure CPU, memory, and disk partitions as required • Experience designing and maintaining backup strategies and DR plans in cloud and on-premise setups
Job Requirements
- 3+ years of experience in a DevOps/Site Reliability Engineering (SRE) role or similar infrastructure engineering function.
- Expertise with Kubernetes (K8s), including production deployments, cluster configuration, and performance tuning.
- Advanced experience with AWS services, such as: EC2, S3, IAM, VPC, EKS, CloudWatch, CloudFormation, Route53
- Linux systems administration (Debian, Ubuntu or RedHat) with Shell scripting
- Network configuration and security hardening
- Knowledge of 3rd party monitoring tools like New Relic etc
Benefits
- Remote working
- Open Door Policy
- Work in a pleasant environment with little hierarchy
- Freedom to innovate and learn new things
- Mon to Fri 10:00AM TO 7:00PM working
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Design, implement, and maintain GitLab CI / CD pipelines for M&S applications and services • Integrate automated testing, code quality checks, SAST or DAST tools, vulnerability scanning, and deployment controls • Build and manage Docker and Kubernetes clusters supporting simulation platforms on AWS and hybrid environments • Operate secure cloud environments using IaC, configuration as code, and automated security controls • Produce SBOMs, scan findings, and pipeline documentation to support RMF and ATO compliance • Implement observability frameworks for logging, metrics, and tracing and resolve performance bottlenecks • Provide remediation guidance for findings triggered by automated code and container scanning • Support platform deployment and readiness events that may require TDY travel • Work closely with developers, security engineers, and PMO to standardize and modernize legacy delivery workflows
• Provide hands-on operational support for Azure cloud environments • Manage subscriptions, resource groups, networking, storage, Key Vaults, virtual machines, private endpoints, and security configurations • Support Databricks platform operations including users, groups, service principals, clusters, jobs, notebooks, serverless components, models, and Unity Catalog • Execute routine operational tasks to maintain platform stability and reliability • Support secure connectivity and basic cloud networking patterns • Apply Infrastructure as Code using Terraform or equivalent tools • Work closely with the internal platform team, following established standards, tooling, and governance • Contribute to long-term platform maturity through consistent, reliable execution
• Innovate and Implement: Design, implement, and maintain large-scale HPC/AI clusters with state-of-the-art monitoring, logging, and alerting systems. • Infrastructure as Code (IaC): Utilize and develop tools to manage infrastructure as code, ensuring scalable and repeatable deployments. • Streamline CI/CD Pipelines: Develop and maintain continuous integration and continuous delivery (CI/CD) pipelines to automate and streamline deployment processes. • Automate Everything: Develop automation scripts and tools to automate deployment, configuration management, and operational monitoring. • Develop complex Networking automations. • Troubleshoot Complex Issues: Perform comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency. • Lead and Educate: Serve as a technical resource, developing and sharing best practices with internal teams. • Drive Innovation: Support R&D activities and engage in proof of concepts (POCs) and proof of values (POVs) for future improvements.
• Design, build, and operate the core infrastructure that powers Owner’s engineering organization, with an emphasis on reliability, security, and ease of use. • Own and evolve our Kubernetes-based platform on AWS, improving how services are deployed, scaled, monitored, and secured in production. • Build and maintain CI/CD pipelines that are fast, reliable, and easy to reason about by reducing deploy risk while increasing developer confidence and velocity. • Focus deeply on developer experience: identifying pain points in local development, testing, deployment, and observability, then replacing manual or error-prone workflows with self-service tooling and automation. • Partner closely with application engineers to set clear patterns and golden paths for how services are built and run at Owner, balancing flexibility with strong, opinionated defaults. • Strengthen our approach to operational excellence by improving monitoring, alerting, incident response, and post-incident learning. • Help embed security into our infrastructure and delivery pipelines, ensuring best practices are automated and invisible.




