Tier 1 Automation Engineer
Location
United States
Posted
73 days ago
Salary
$55K - $65K / year
Seniority
Mid Level
Job Description
Tier 1 Automation Engineer
Rightworks LLC
Rightworks offers the only intelligent cloud purpose-built for accounting firms and professionals. Backed by award-winning support, our fully managed IT and applications ensure customers have secure, reliable, on-demand access to their technology. We provide a curated software ecosystem that simplifies the complexity of running an accounting firm or small business, supported by a community of thought leaders, peer networks, and educational resources. Our success is made possible by leveraging decades of specialized experience in leading accounting firms, SMBs and technology companies. Thousands of Firms and SMBs count on us to run their business every day. We have a great team, we’re growing fast and have a winning culture based on innovation, teamwork, and mutual respect. Job Overview: We are looking for an experienced and proactive Tier 1 Automation Engineer to join our IT operations team. In this role, you will be responsible for maintaining, optimizing, and securing our organization’s customer facing server infrastructure, both on-premises and in the cloud. You will work closely with various teams to ensure the availability, performance, and security of critical systems and services. Your expertise will help guide IT strategy, troubleshoot issues, and ensure operational excellence in managing enterprise systems. This is a remote based position. Responsibilities: - Administer and maintain servers, networks, and systems across on-premises and cloud environments - Perform system upgrades, patches, and troubleshooting to ensure system reliability and security. - Finishes work as assigned by Lead and AE Manager, adhering to established code standards and delivery processes. - Creates new automation for applications that need update or install automation, using existing Tier 2 tooling and libraries. - Maintains annual application update automation, ensuring timely and successful rollout of new versions. - Creates and maintains custom automation for RMM (e.g., custom scripts, monitors, and administrative tasks). - Documents all new and updated automation scripts and processes in the central repository (Gitlab/Atlassian). - Troubleshoots and resolves failed application deployments and automation tasks in the client environment. - Ensure compliance with industry regulations and best practices related to system security, data privacy, and IT governance. - Stay up to date with the latest trends and technologies in system administration, recommending improvements when necessary. Requirements: - Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent work experience). - 5+ years of experience in system administration, devops, or other automation, with a solid track record of managing enterprise-level IT infrastructures. - Strong experience with operating systems (Linux, Windows Server, etc.) and server management. - In-depth knowledge of virtualization technologies (VMware, Hyper-V, KVM, Containers). - Expertise in network configuration, routing, and security (DNS, DHCP, VPN, firewalls). - Experience with automation and scripting (e.g., PowerShell, Bash, Python, Ansible). - Strong troubleshooting and problem-solving skills, particularly with complex infrastructure issues. - Familiarity with containerization technologies (Docker, Kubernetes) and continuous integration/continuous deployment (CI/CD) practices is a plus. - Experience with monitoring tools (e.g., Nagios, Zabbix, SolarWinds) and performance tuning. - Understanding of IT security best practices and experience implementing security measures. - Certifications such as CompTIA Server+, Microsoft Certified: Azure Administrator Associate, AWS Certified SysOps Administrator, or similar are a plus. - Excellent communication and teamwork skills, with the ability to interact effectively with various technical and non-technical teams. - Ability to manage multiple tasks and prioritize effectively in a fast-paced environment. Eligibility Requirements - This role is open to US Citizens or permanent residents authorized to work in the United States. Rightworks LLC is unable to offer visa sponsorship. - Due to specific state regulations, we are unable to accept applications from residents of California, Hawaii, or Alaska. - Relocation will not be offered for this position. Compensation Our Compensation range for this role ranges from $55,000 to $65,000 annually, and is determined based on factors such as relevant experience, skills, and internal equity. Benefits To provide best-in-class solutions, we need a best-in-class team. We offer competitive salaries to recruit the best talent. We provide company-paid short and long-term disability insurance, life insurance and a generous 401K match. We offer highly affordable medical, dental, vision coverage, and many other valuable benefits. We offer flexible PTO, and numerous paid holidays, affording you the time to be there for what is important in your life. We encourage giving back to our communities by providing paid volunteer time off. We are proud to be an Equal Opportunity Employer! This job description may not be inclusive of all assigned duties, responsibilities, or aspects of the job described, and may be amended at any time at the sole discretion of the employer.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
- Location: Remote (India preferred) - Department: Product, Engineering & Data Science - Report to: Senior Director of Engineering About Us ELSA is a global leader in AI-powered English communication training, dedicated to transforming how people learn and speak English with confidence. Founded in 2016 and headquartered in San Francisco, we operate across the U.S., Vietnam, Portugal, Indonesia, Brazil and Japan. Powered by proprietary speech-recognition technology and generative AI, ELSA delivers real-time, hyper-personalized feedback to help learners improve pronunciation, fluency, and overall communication effectiveness. With over 50 million learners and 1 billion hours of anonymized speech data, ELSAs depth of language training intelligence is unmatched in the industry. Our B2B flagship platforms ELSA Enterprise and ELSA Schools empower organizations and educational institutions to elevate communication capabilities and unlock personal and professional opportunities for their people. We design engaging, bite-sized learning experiences that adapt to each learner's goals and context, ensuring measurable improvement and lasting confidence. Our vision is to become the global standard for real-time English communication training, enabling 1.5 billion language learners worldwide to speak clearly, be understood, and share their stories with the world. Backed by world-class investors including Googles Gradient Ventures, Monks Hill Ventures, and SOSV, ELSA has been recognized among the top global AI innovators: - Forbes Top 4 Companies Using AI to Transform the World - Research Sniper Top 5 Best AI Apps - ASU+GSV EdTech 150 - CB Insights Top 100 AI Companies Join us in shaping the future of language learning and empowering millions to unlock opportunity through confident communication. Role Summary We are looking for a Principal DevOps / SRE engineer to build and own our reliability practice end-to-end. This is not a firefighting role — our team already responds well to incidents. This person will formalize what works, automate what repeats, and build the foundation for enterprise-grade SRE as ELSA scales its B2B footprint. Key Responsibilities - Own the SRE practice: define severity tiers (P1–P4), formalize on-call rotation, build SLA tracking dashboards, and establish incident management workflows across a team of 4 DevOps engineers. - Build runbooks for the top recurring operational issues — pod scaling, deploy rollbacks, access management, EKS upgrades, CI/CD pipeline failures — and automate L1/L2 responses using tools like Shoreline.io, Rundeck, or PagerDuty automation. - Introduce and operationalize AI-assisted DevOps tooling: AIOps for alert correlation, CastAI/Kubecost for cost optimization, GitHub Copilot for IaC acceleration. Train the existing team on these tools. - Drive infrastructure modernization: EKS upgrades, Karpenter migration, observability (SigNoz/Prometheus), secrets management (ArgoCD/SOPS), and Terraform-based IaC maturity. - Collaborate with AI Engineering, Mobile, and B2B teams to ensure infrastructure supports real-time speech processing, GPU workloads, and multi-region enterprise deployments. - Design and plan round-the-clock SRE coverage model as B2B enterprise SLA commitments grow — evaluate vendor partnerships or strategic hires for Americas timezone coverage. What You Will Have - 2+ years in DevOps/SRE, with at least 2 years in a principal or staff-level role owning reliability practices for a production SaaS product. - Deep hands-on experience with AWS (EKS, EC2, DynamoDB, S3, IAM, Secrets Manager), Kubernetes (HPA, KEDA, Karpenter, pod scheduling, GPU workloads), and IaC (Terraform, Helm, ArgoCD). - Track record of building runbooks, on-call rotations, and incident management frameworks — not just participating in them. - Experience with observability stacks (Prometheus, Grafana, SigNoz or Datadog), CI/CD (GitLab CI, GitHub Actions), and alerting (PagerDuty, Opsgenie). - Comfort working across timezones with distributed teams (India, Vietnam, Portugal). Strong written communication — you'll be writing runbooks, RCAs, and proposals as much as Terraform. Nice to Have - Experience with AI/ML infrastructure (GPU scheduling, model serving, real-time audio/speech workloads). - Familiarity with compliance frameworks (ISO 27001, SOC 2, Vanta) in a DevOps context. - Hands-on experience with AIOps tooling, automated remediation platforms (Shoreline, Rundeck), or FinOps tools (CastAI, Kubecost). What We Offer - Flexible work setup: Remote-first for Singapore, India, Indonesia, Malaysia; hybrid model for Vietnam. - Comprehensive employee well-being benefits. - Free ELSA Premium courses to polish your language skills - Collaborative, international team culture. - Opportunity to contribute to a fast-growing, well-funded Silicon Valley startup with global impact.
L2 Cloud Operations Engineer
ScalableOSScalableOS is a premium offshoring solutions provider based in the Philippines.
• Provide second-line technical support to hedge fund and financial services clients across the US and UK • Take ownership of complex issues and drive them through to resolution • Monitor client cloud infrastructure and trading systems using enterprise monitoring tools • Manage support tickets end-to-end using Jira Service Management • Deliver expert-level desktop troubleshooting across Windows 10/11 environments • Operate and support client Microsoft Azure environments • Configure and troubleshoot SSL VPN and IPsec VPN connections for remote client access • Collaborate with L3 engineers and senior operations staff on escalated issues
L3 Cloud Operations Engineer
ScalableOSScalableOS is a premium offshoring solutions provider based in the Philippines.
• Serve as the senior technical leader for the L1 and L2 Cloud Operations Engineers, providing day-to-day guidance, coaching, and knowledge transfer • Lead by example on complex incidents, walking junior engineers through advanced troubleshooting methodologies and resolution strategies • Define and maintain standard operating procedures, runbooks, and escalation workflows to ensure consistent and high-quality service delivery • Act as the technical point of contact during operational hours, overseeing ticket queues, prioritization, and SLA adherence across the team • Conduct internal training sessions and knowledge sharing on new technologies, processes, and client-specific environments • Identify skill gaps within the team and recommend training paths and certification goals to the Head of Managed Services • Serve as the highest-level technical escalation point within the service desk, resolving the most complex incidents spanning cloud, networking, security, and back-end infrastructure • Own and lead major incident processes end-to-end, including triage, communication, escalation to third parties, root cause analysis, and post-incident reviews • Perform advanced troubleshooting across multi-tenant Azure environments, hybrid infrastructure, and complex networking topologies • Document and track all escalations, major incidents, and problem records in Jira Service Management with thorough root cause analysis • Architect and manage complex Azure environments including Azure AD, Conditional Access, Azure Virtual Desktop (AVD), Azure Networking (VNets, NSGs, ExpressRoute), and Azure Automation • Administer and optimize Office 365 tenants at an advanced level, including Exchange Online mail flow, hybrid configurations, security and compliance policies, and tenant-to-tenant migrations • Manage and troubleshoot Windows Server infrastructure (2016/2019/2022) at an advanced level, including Active Directory design, Group Policy architecture, DNS/DHCP, DFS, and Certificate Services • Oversee VMware ESXi and virtualization environments including capacity planning, performance optimization, host management, and migration strategies • Lead VDI environment management including Azure Virtual Desktop, Citrix, and thin client deployments at scale • Perform intermediate to advanced network troubleshooting and configuration across TCP/IP, DNS, DHCP, VLANs, routing protocols, and WAN connectivity • Configure and manage Fortinet FortiGate firewalls including advanced policy management, SD-WAN, IPS/IDS, web filtering, and high-availability configurations • Manage Cisco Meraki environments on a scale, including complex wireless deployments, SD-WAN, switch stacking, and security appliance policies • Design, configure, and troubleshoot SSL VPN and IPsec VPN solutions across multiple client environments • Serve as the subject matter expert for advanced desktop and endpoint issues that cannot be resolved at L1/L2, including complex OS corruption, driver conflicts, and application compatibility • Design and optimize Intune/Endpoint Manager deployment strategies including Autopilot, compliance policies, and application packaging • Liaise directly with client stakeholders on escalated issues, service reviews, and change management activities • Collaborate with the account management teams on client escalations, service improvement plans, and quarterly business reviews • Design and implement automation solutions using PowerShell, Azure Automation, and other scripting tools to eliminate manual overhead and improve operational efficiency
• Providing technical leadership and architectural direction across all DevOps initiatives. • Establishing engineering standards for CI/CD, infrastructure-as-code, container orchestration, observability, and DevSecOps practices. • Allocating DevOps resources across concurrent product initiatives based on priorities set by the Director of Product Development. • Conducting performance evaluations, career development planning, and technical mentorship. • Ensuring consistent operational excellence, reliability, security compliance, and automation maturity across environments. • Building scalable DevOps processes that enable autonomy within full-stack teams while maintaining governance and architectural alignment.



