Job Closed
This listing is no longer active.
AI-Powered Retail Pricing & Trade Fund Optimization
Cloud Platform Architect – DevOps
Location
United States
Posted
103 days ago
Salary
0
Seniority
Lead
Job Description
Cloud Platform Architect – DevOps
DemandTec
• Design and implement scalable and secure cloud-based solutions on Azure, ensuring best practices in architecture and security are followed. • Create and maintain continuous integration and deployment (CI/CD) pipelines to streamline development efforts while facilitating rapid and reliable software delivery. • Architect and oversee the implementation of infrastructure automation using Infrastructure as Code (IaC) tools such as Terraform. • Collaborate across cross-functional teams to define cloud architecture standards and improve operational processes and methodologies. • Lead the evaluation and integration of new tools and technologies to enhance productivity, performance, and security in the cloud. • Perform capacity planning, monitoring, and optimization of cloud resources to manage costs effectively without compromising performance. • Establish and promote best practices for cloud security, compliance, and governance within the organization's cloud infrastructure. • Serve as a mentor and technical lead to other engineers, guiding them through cloud architecture and DevOps practices.
Job Requirements
- Extensive experience (7+ years) in cloud architecture, DevOps practices, and related technologies
- Proven ability to design and implement complex cloud solutions with hands-on experience in infrastructure automation (Terraform).
- Demonstrated expertise in CI/CD pipeline development and maintenance using Jenkins, GitLab CI, or similar tools.
- Strong programming skills with experience in languages such as Python, Go, or Bash for automation and tooling.
- Deep understanding of cloud security best practices, compliance frameworks, and governance methodologies.
- Experience with containerization technologies (Docker, Kubernetes) and orchestration tools.
- Exceptional problem-solving skills, with the ability to troubleshoot complex system interactions and performance issues.
- Strong communication and collaboration skills, with experience working in agile teams and leading cross-functional initiatives.
- Preferred Qualifications:
- AWS or Azure Certified Solutions Architect or DevOps Engineer certification.
- Experience with monitoring and logging solutions ( ELK Stack, Prometheus, Grafana).
- Familiarity with configuration management tools (Ansible, Chef, Puppet).
- Experience in a startup or fast-paced environment with a focus on cloud-native and microservices architecture.
Benefits
- Health Care Plan (Medical, Dental & Vision)
- Retirement Plan (401k, IRA)
- Paid Time Off (Vacation, Sick & Public Holidays)
- Work From Home
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior Site Reliability Engineer
Omilia - Conversational IntelligenceOmilia is the leading provider of Natural Language Understanding enabled IVR & natural dialogue interaction solutions.
About the Role We're looking for a Senior Site Reliability Engineer who approaches operational problems as engineering challenges. You won't just monitor dashboards and respond to pages — you'll help define and drive service level objectives, identify reliability risks, and work alongside engineering teams to ensure reliability and performance are first-class concerns from design through to production. Your mission is not only to keep the platform running but also to make the platform more reliable by default — through better practices, smarter automation, and a culture where every engineer thinks about failure modes. What You'll Do Drive Incident Excellence - Act as a first responder during incidents; lead root cause analysis and blameless post-mortems. - Turn incident learnings into systemic improvements — better tooling, better runbooks, better architecture. - Provide input and guidance to squads on troubleshooting documentation and operational runbooks, ensuring they are practical and effective for production support. Engineer Reliability - Define, implement, and iterate on SLIs, SLOs, and error budgets to drive data-informed reliability decisions. - Identify and measure operational toil; build software and automation to systematically reduce it. - Conduct capacity planning and performance analysis to stay ahead of scaling challenges. Build Observability - Design and evolve observability platforms (metrics, logs, traces, dashboards) that give engineering teams genuine insight into system behaviour — not just noise. - Continuously improve alert quality: reduce false positives, increase signal, and ensure every alert is actionable. Shape Reliability Culture - Partner with development teams to embed reliability thinking into the software delivery lifecycle — from design reviews to deployment strategies. - Champion practices like chaos engineering, progressive rollouts, and failure injection testing. - Mentor engineers across teams on reliability principles and operational best practices. Participate in On-Call - Join on-call rotations and continuously improve the on-call experience for yourself and others.
DevOps SRE
Southwest Power PoolSouthwest Power Pool (SPP) is about more than power. We’re about the power of relationships. Our employees have the opportunity to work together to ensure electricity is delivered reliably and affordably to the millions of people living in our service territory. We have been voted one of Arkansas’ Best Places to work by Arkansas Business and we are looking for a member of our team who is passionate about our mission to keep the lights on! We have a core ideology here at SPP that we stand by: Do the right thing, for the right reason, in the right way. PLEASE NOTE: SPP is not able to sponsor employment visas or student-work authorizations (STEM OPT) for this position. Please ensure you are eligible to work in the U.S. without sponsorship prior to applying. COMPENSATION INFORMATION: The salary range(s) represents our good faith estimate for the role at this time. While we strive to provide competitive and transparent compensation, there may be circumstances where an offer is above or outside of the listed range. We are open to discussing salary expectations with qualified candidates considering factors such as the candidate's qualifications, skills, competencies, experience and geographic location will all be considered during the hiring process. Lead DevOps SRE | Pay Range: $112,240.00 - $145,810.00 Senior DevOps SRE | Pay Range: $87,950.00 - $112,190.00
Join a mission-driven technology team powering the reliability of the electric grid for millions across the central United States. As a DevOps SRE, you’ll play a pivotal role in ensuring the performance, resilience, and long-term scalability of SPP’s production systems and user-facing services. If you're passionate about high-availability architecture, automation at scale, and shaping the future of reliability engineering, this is your opportunity to make a meaningful impact. Design and lead implementation of large-scale, highly available infrastructure solutions across multiple platforms. Serve as the primary technical escalation point for complex incidents, outages, and performance issues—ensuring rapid, effective resolution. Establish SRE best practices and operational frameworks that raise reliability standards across the entire organization. Mentor and develop senior and junior SRE team members through coaching, code reviews, and knowledge-sharing. Partner with leadership, architects, and technical teams to align reliability initiatives with long-term architectural strategy. Drive continuous improvement, automation, and reduction of operational toil across environments. Lead major enterprise projects involving new technologies, cross-team automation, and reliability enhancements. Make architectural decisions that influence scalability, resiliency, and infrastructure management standards. You'll also contribute hands-on technical expertise through testing, coding, scripting, designing integrated systems, reviewing new and existing projects for architectural compliance, and researching emerging technologies to keep SPP’s ecosystem future-ready. Developing and executing test plans to evaluate system performance. Planning and scheduling deployment activities and resolving release pipeline issues. Maintaining deep knowledge of SPP's architecture and technology portfolio. Creating and communicating technology standards and policies. Designing and overseeing implementation of integrated systems and databases. Ensuring system and software integrations meet functional and compliance requirements. Conducting application testing (unit, system, regression, performance, acceptance). Writing and translating code to enhance application performance. Providing architectural consulting across engineering and development teams. Serving as the organizational subject matter expert in SRE.
• Design and implement cloud architecture solutions. • Centralize and automate workflows through developer tooling. • Develop and maintain Helm charts for Kubernetes deployments. • Optimize Docker images for performance, security, and maintainability. • Provision and manage infrastructure using Terraform in Google Cloud Platform (GCP). • Evaluate and improve the organization’s security posture, ensuring best practices are followed. • Collaborate closely with cross-functional teams to implement scalable, reliable, and secure systems
• Focuses on the operations and stability of a global IVR system, including deployments to AWS and on-premises environments • Assist with deployments, configurations, and release activities • Support IVR systems deployed across AWS and on-premises environments • Manage day-to-day operations of global IVR platforms • Monitor system performance, availability, and reliability • Respond to incidents and operational issues in a timely manner • Collaborate with engineers, architects, QA, and partners during releases and incidents • Troubleshoot infrastructure, application, and voice-related issues • Assist in debugging IVR, Genesys, SIP, and call flow–related problems • Analyze logs and metrics to identify root causes and improvement opportunities • Participate in on-call, evening, or weekend support as required


