The modern IT documentation platform.
DevOps Engineer
Location
United States
Posted
52 days ago
Salary
$80K - $120K / year
Seniority
Senior
Job Description
DevOps Engineer
Hudu
• Deploy and manage Ruby on Rails applications in AWS and Kubernetes environments, ensuring high availability, scalability, and resilience across all production and staging systems. • Implement and maintain security best practices across the infrastructure, including identity and access management (IAM), encryption at rest and in transit, container security scanning, and patch management. • Monitor and analyze application and infrastructure logs (Rails logs, Kubernetes logs, AWS CloudWatch, S3 access logs, Nginx, PostgreSQL, etc.) to proactively identify, investigate, and resolve issues. • Debug system performance bottlenecks across the stack, including slow database queries, S3 object storage latency, misconfigured Nginx or load balancers, or Rails application-level issues. • Design and maintain CI/CD pipelines that automate build, test, and deployment processes with minimal downtime. • Collaborate with developers to improve observability and instrumentation, ensuring that metrics, tracing, and logging are in place to diagnose issues quickly. • Conduct infrastructure capacity planning to ensure resources are optimized for cost and performance as customer usage grows. • Respond to incidents and outages, participate in root cause analysis, and implement corrective actions to prevent recurrence. • Maintain and optimize Kubernetes clusters, ensuring proper resource allocation, autoscaling, and workload distribution. • Work with databases (PostgreSQL) to tune queries, configure backups, manage replication, and ensure reliability. • Manage and monitor cloud storage systems (S3, EBS, etc.), ensuring secure, performant, and cost-effective use. • Implement disaster recovery strategies, including regular testing of backups and failover processes. • Stay current on DevOps, Rails, AWS, and Kubernetes practices and technologies, applying them to continuously improve system reliability, security, and performance.
Job Requirements
- Associates degree required
- Minimum of 5 years’ experience in a DevOps engineering role
- Advanced expertise in AWS EC2, Aurora, Postgres, Puma, NGINX, and Kubernetes
- Experience with Ruby programming language
- Extensive knowledge of Ubuntu
- Experience with Git or Mercurial, GitHub Actions/Gitlab Pipelines, and CI/CD tools
- Excellent time managing skills with the ability to multi-task, prioritize, and meet deadlines
- Must possess fluent ability to communicate in English in oral and written format
Benefits
- Health Insurance
- 401k plan with company matching
- Paid time-off
- Flexible work hours
- Work Life Balance
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Responsible for the organizational and operational management as well as the strategic development of the DevOps team • Consideration of infrastructure projects, structured planning and sensible allocation of capacity • Reliable operation of technical platforms • Continuous development of internal platforms and infrastructure processes • Supporting the team in technical decisions
Senior SRE
CisionCision is the global leader in consumer and media intelligence, engagement, and communication solutions. We equip PR and corporate communications, marketing, and social media professionals with the tools they need to excel in today's data-driven world. Our deep expertise, exclusive data partnerships, and award-winning products, including CisionOne, Brandwatch, and PR Newswire, enable over 75,000 companies and organizations, including 84% of the Fortune 500, to see and be seen, understand and be understood by the audiences that matter most to them. Cision is committed to fostering an inclusive environment where all employees can be their authentic selves and perform at their best. We believe diversity, equity, and inclusion is vital to driving our culture, sparking innovation and achieving long-term success.
• Design and implement reliability solutions for data ingestion, processing, and delivery pipelines • Define and maintain SLIs/SLOs for data licensing services and manage error budgets • Build automation for deployment, monitoring, and incident response • Enhance system observability through metrics, logging, and tracing • Develop and maintain dashboards and alerts to proactively detect and resolve issues • Participate in on-call rotations and lead incident response efforts • Conduct root cause analysis and drive post-incident improvements • Maintain runbooks and operational documentation • Partner with software and data engineers to embed reliability into system design • Contribute to blameless postmortems and reliability reviews • Share knowledge and mentor junior team members
Join Triumph! At Triumph, our vision is a world where freight transactions are accurate and seamless on the most modern and secure freight transaction network. That’s why we’re looking for passionate, innovative, solutions-oriented people to join our team. We thrive on providing exceptional customer service and we look for team members with an entrepreneurial spirit and a passion to build successful partnerships with our clients. Because at the end of the day our goal is to help our partners businesses run better. We are a fast-growing FinTech company that is looking for a highly skilled Senior DevOps Engineer to join our team. In this role you will be responsible for designing, implementing, and maintaining our organization's DevOps infrastructure. You will work closely with development, security, and operations teams to ensure that our systems are secure, reliable, and scalable. We expect you to be quick to grasp new concepts, thoroughly explore the depths of an issue, and be persistent in understanding the root cause of issues. We hop you have strong interpersonal skills due to continual interaction with managers and users with varying technical backgrounds in a fast-paced work environment. What You’ll Be Doing: - Design, implement, and maintain our organization's cloud infrastructure, including CI/CD pipelines, automation tools, and monitoring systems in AWS. - Work closely with development teams to ensure that our applications are reliable and scalable in a secure and compliant manner. - Own the deployment and maintenance of Kubernetes clusters, ensuring efficient resource utilization, scalability, and high availability. - Develop, maintain, and optimize Helm charts for deploying and managing containerized applications. - Collaborate with security teams to ensure that our systems meet security standards and regulations such as SOX, SOC2, and FFIEC. - Monitor and optimize systems and applications performance. - Automate the deployment and configuration of systems and applications. - Recommend impactful design and process changes to ensure the success of platforms and infrastructure services. - Design and deploy technical implementations to achieve SLOs and SLIs to ensure we are meeting our customer’s expectations. - Continuously evaluate and implement new technologies and tools to improve our DevOps processes. What Makes You a Great Fit: We hope you’ll possess business operations experience and skills, leadership expertise, analytical and critical thinking skills, and attention to detail. Additionally, you should possess the following: - Technical certifications in AWS or related technologies highly desirable - Minimum of 5 years of experience in an IT support role with strong troubleshooting skills. - Comprehensive experience with AWS services including a solid understanding of S3, EBS, EC2, ECS, EKS, ELB/ALB, IAM, VPC, RDS, CloudTrail, and CloudWatch, KMS, Secrets Manager, Route53, SSM, Redshift, Cloudfront, Elastic Beanstalk. - Must have Experience with Kubernetes, Helm Charts. - Experience working with Redis and Postgres. - Experience with CICD platforms like Argo CD, Github Actions. - Experience with Machine Learning, Snowflake, and Looker are a big plus. - Experience running with migration projects and technology consolidation efforts. - Recent experience with Terraform and CI/CD processes in a production environment. - Knowledge of networking fundamentals and network requirements for a hybrid cloud environment. - Experience working within Agile software development methodology. - Experience with Linux. - Ability to write technical documentation and present materials to mixed audiences is required. Skills and Competencies We Value: - Knowledge and understanding of IT concepts, best practices and procedures. - Strong troubleshooting experience. - Self motivated with the ability to work individually or in a team. - Ability to leverage tools to perform day-to-day administration tasks, root-cause analysis and service restoration (such as backup, restore, failover, log interpretation, and performance monitoring). - Ability to multitask and manages work effectively by prioritizing own assignments, schedules, and meetings resulting in timely completion of work. - High degree of personal integrity. - The applicant should eager to learn and obtain technical certification. - Must be able to receive and follow instructions given by management. - Must have the ability develop solutions to unique problems. Work Environment The work environment characteristics described here are representative of those an employee may encounter while performing the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions. - Moderate noise level typical of a business office environment. - Ability to work in a confined office or workstation area. - Ability to sit at a computer terminal for extended periods of time. - Occasional stooping, kneeling, or light physical activity may be required. - Regular use of hands, fingers, and vision for computer and phone work. - Light to moderate lifting may be required. - Regular, predictable attendance is required. - Travel or additional physical requirements may be added as needed. #LI-JC1 Compensation Range Annual Salary: $151,038.00 - $234,109.00 ***Location: Dallas, TX or Remote U.S. excluding the following states: AK, DE, ID, ND, RI, VT, WY *** We offer Medical, Dental, Vision, Paid Time Off, 401k and much more. Go on. Do it. Apply Today!
• Lead current-state assessments of cloud infrastructure practices, team capabilities, delivery bottlenecks, and governance gaps across a globally distributed organization • Design a Cloud Center of Excellence (CoE) operating model that defines ownership boundaries, responsibilities, and escalation paths between the central platform team and product engineering teams • Define a cross-cutting standards framework for AWS account structure, multi-region strategy, security baselines, resilience, and scalability • Design exception-handling and design-checkpoint processes that enable innovation beyond standard patterns without creating bottlenecks • Create a transition roadmap and enablement strategies to reduce product team dependency on the central Cloud team • Partner with security architects to embed security guardrails and IAM governance into the CoE framework • Provide strategic guidance and manage stakeholders across technical and executive audiences




