AlpacaDB, Inc., also known as Alpaca and Alpaca Securities, is an API stock and crypto brokerage platform that enables services to embed investing and developer

Operations Reliability Engineer - Automations

DevOps EngineerDevOps EngineerFull Time Remote Mid Level Company Site

Location

Worldwide

Posted

32 days ago

Salary

Seniority

Mid Level

No structured requirement data.

Job Description

Role Description As an Operations Reliability Engineer , you will embed directly within brokerage operations functions to systematically eliminate manual work and replace it with durable, auditable software systems. You start by immersing yourself in operational workflows: observing, documenting, and deeply understanding processes end-to-end before designing solutions. Every recurring manual process is treated as a system defect, and every fix you ship is measured by its real-world impact on efficiency and reliability. You will work closely with licensed brokerage staff, domain experts, and platform engineers to build automations and tooling that allow Alpaca's operations to scale globally without scaling headcount linearly. The ideal candidate is equally comfortable shadowing an operational process and architecting the backend service that replaces it. Things You Get To Do - Design, build, test, deploy, and monitor production automations and UIs that remove manual steps and reduce operation time. - Partner with frontend engineers to productize ops tooling so global teams can run functions with predictable staffing. - Execute operational procedures to surface painful manual processes prior to automation. - Instrument and report baseline and outcome metrics (MTTC, manual-steps removed, queue sizes, ops satisfaction) and iterate based on measured impact. - Produce Platform Opportunity Briefs / RFCs for higher-level platform tooling and automations. - Collaborate with licensed BD leadership, Compliance, and Security to build auditable, safe automations with role-based access and clear runbooks. - Own the full lifecycle of the systems you build, including automated deployment (CI/CD with tools like ArgoCD and Terraform), proactive monitoring, On-call support rotations and incident response, following a "you build it, you run it" philosophy. - Build systems with auditability, traceability, and data lineage as a first-class concern to ensure transparency for our auditors and regulators. Qualifications - 5+ years of professional software engineering experience, with a proven track record of shipping and operating complex, large-scale systems in production. - Strong business sense and understanding of operations. - Deep, hands-on expertise in Golang, including a strong command of its concurrency models (goroutines, channels), memory management, and standard library. - Proven track record of building user-facing features end-to-end with Typescript/React. - Proficient with SQL and relational databases, preferably PostgreSQL. - Demonstrated ability to reason about human workflows as systems, not just software services. - Experience with observability, tracing, continuous profiling. - Exceptional analytical and problem-solving skills, with the ability to deconstruct complex requirements into clear technical components and excellent communication skills for working in a cross-functional environment. - High ownership mindset with bias toward durable, structural fixes over tactical patches. Requirements - Knowledge of service oriented architectures. - Experience with major cloud platforms (we primarily use GCP). - Financial market (exchange, broker-dealers, clearing, etc.) knowledge. - Experience with Docker and Kubernetes. - A passion for financial markets or the desire to learn. - Knowledge of Agile/Scrum methodologies. - Demonstrable experience in designing, building, and reasoning about distributed systems, including a strong understanding of microservices architecture and API design patterns (e.g., REST, gRPC). - Experience with capacity planning and benchmarking. Benefits - Competitive Salary & Stock Options. - Health Benefits. - New Hire Home-Office Setup: One-time USD $500. - Monthly Stipend: USD $150 per month via a Brex Card.

Related Categories

DevOps Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

DevOps Engineer, GCP

Intermedia Cloud Communications

DevOps Engineer32 days ago

Full Time RemoteTeam 1,001-5,000H1B No Sponsor

Company Site LinkedIn

• Collaborate closely with the development team to deploy and maintain application infrastructure. • Assist in the development and support of tooling to streamline the deployment and maintenance of our products. • Work with Kubernetes, Docker, Helm and ArgoCD to deploy applications from development through to production environments. • Support both in-house and third-party applications, including handling deployments, upgrades, and troubleshooting. • Write and manage automation pipelines for application deployment and maintenance. • Provision and manage infrastructure using Terraform. • Document processes and best practices clearly and concisely.

Ansible Cloud Docker ElasticSearch Google Cloud Platform Jenkins Kubernetes Linux PostgreSQL Python RabbitMQ Redis Terraform Go

View details: DevOps Engineer, GCP

United Kingdom

Apply

Senior Site Reliability Engineer

PlayOn! Sports

The nation's leading high school media company providing live streaming and digital ticketing services.

DevOps Engineer32 days ago

Full Time RemoteTeam 201-500H1B No Sponsor

Company Site LinkedIn

• Contribute to system observability i.e implementing, improving metrics, alerting, and dashboards for better insight and faster recovery. • Develop automation, tooling, and monitoring solutions to support high service availability. • Partner with application and quality engineering teams to implement best practices in reliability, release automation, and testing. • Drive operational excellence through proactive incident prevention, blameless postmortems, and capacity planning. • Participate in on-call rotations to support critical services and ensure rapid response to incidents.

AWS Azure Cloud Distributed Systems Docker Google Cloud Platform Grafana Java Kubernetes Linux Prometheus Python Terraform Go

View details: Senior Site Reliability Engineer

United States

Apply

Site Reliability Engineer

Orion Health

Revolutionising global healthcare so every individual receives the perfect care for them.

DevOps Engineer32 days ago

Contract RemoteTeam 501-1,000Since 1993H1B Sponsor

Company Site LinkedIn

• Ensure the reliability, availability and performance of cloud infrastructure and operating systems. • Design, manage and execute upgrade and maintenance schedules for clients. • Automate infrastructure processes and implement best practices. • Introduce new tools that enhance software delivery pipeline. • Produce upgrade and maintenance plans for all clients under responsibility. • Implement and review infrastructure monitoring and observability tools.

Ansible AWS Cloud DNS Grafana Linux Prometheus VMware

View details: Site Reliability Engineer

Texas

$60 - $75 / hour

Apply

Job Closed

Senior DevOps

CI&T

Navigate Change

DevOps Engineer32 days ago

Full Time RemoteTeam 5,001-10,000Since 1995H1B No Sponsor

Company Site LinkedIn

•Manage and optimize CI/CD pipelines and Devops. •Configure and maintain infrastructure (OS, webservers, VMs, Docker, Bitbucket, Networking). •Monitor systems, resolve incidents and help to keep systems up and running, especially during US EST time-zone. •Handle security updates and hardening for servers and applications. •Mentor and provide technical leadership to other teams.

AWS Docker Grafana Jenkins Prometheus Python Shell Scripting

View details: Senior DevOps

Brazil

Apply

Operations Reliability Engineer - Automations

Job Description

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

DevOps Engineer, GCP

Senior Site Reliability Engineer

Site Reliability Engineer

Senior DevOps