AlpacaDB, Inc., also known as Alpaca and Alpaca Securities, is an API stock and crypto brokerage platform that enables services to embed investing and developer
Operations Reliability Engineer - Automations
Location
Worldwide
Posted
32 days ago
Salary
0
Seniority
Mid Level
No structured requirement data.
Job Description
Operations Reliability Engineer - Automations
AlpacaDB
Role Description As an Operations Reliability Engineer , you will embed directly within brokerage operations functions to systematically eliminate manual work and replace it with durable, auditable software systems. You start by immersing yourself in operational workflows: observing, documenting, and deeply understanding processes end-to-end before designing solutions. Every recurring manual process is treated as a system defect, and every fix you ship is measured by its real-world impact on efficiency and reliability. You will work closely with licensed brokerage staff, domain experts, and platform engineers to build automations and tooling that allow Alpaca's operations to scale globally without scaling headcount linearly. The ideal candidate is equally comfortable shadowing an operational process and architecting the backend service that replaces it. Things You Get To Do - Design, build, test, deploy, and monitor production automations and UIs that remove manual steps and reduce operation time. - Partner with frontend engineers to productize ops tooling so global teams can run functions with predictable staffing. - Execute operational procedures to surface painful manual processes prior to automation. - Instrument and report baseline and outcome metrics (MTTC, manual-steps removed, queue sizes, ops satisfaction) and iterate based on measured impact. - Produce Platform Opportunity Briefs / RFCs for higher-level platform tooling and automations. - Collaborate with licensed BD leadership, Compliance, and Security to build auditable, safe automations with role-based access and clear runbooks. - Own the full lifecycle of the systems you build, including automated deployment (CI/CD with tools like ArgoCD and Terraform), proactive monitoring, On-call support rotations and incident response, following a "you build it, you run it" philosophy. - Build systems with auditability, traceability, and data lineage as a first-class concern to ensure transparency for our auditors and regulators. Qualifications - 5+ years of professional software engineering experience, with a proven track record of shipping and operating complex, large-scale systems in production. - Strong business sense and understanding of operations. - Deep, hands-on expertise in Golang, including a strong command of its concurrency models (goroutines, channels), memory management, and standard library. - Proven track record of building user-facing features end-to-end with Typescript/React. - Proficient with SQL and relational databases, preferably PostgreSQL. - Demonstrated ability to reason about human workflows as systems, not just software services. - Experience with observability, tracing, continuous profiling. - Exceptional analytical and problem-solving skills, with the ability to deconstruct complex requirements into clear technical components and excellent communication skills for working in a cross-functional environment. - High ownership mindset with bias toward durable, structural fixes over tactical patches. Requirements - Knowledge of service oriented architectures. - Experience with major cloud platforms (we primarily use GCP). - Financial market (exchange, broker-dealers, clearing, etc.) knowledge. - Experience with Docker and Kubernetes. - A passion for financial markets or the desire to learn. - Knowledge of Agile/Scrum methodologies. - Demonstrable experience in designing, building, and reasoning about distributed systems, including a strong understanding of microservices architecture and API design patterns (e.g., REST, gRPC). - Experience with capacity planning and benchmarking. Benefits - Competitive Salary & Stock Options. - Health Benefits. - New Hire Home-Office Setup: One-time USD $500. - Monthly Stipend: USD $150 per month via a Brex Card.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Collaborate closely with the development team to deploy and maintain application infrastructure. • Assist in the development and support of tooling to streamline the deployment and maintenance of our products. • Work with Kubernetes, Docker, Helm and ArgoCD to deploy applications from development through to production environments. • Support both in-house and third-party applications, including handling deployments, upgrades, and troubleshooting. • Write and manage automation pipelines for application deployment and maintenance. • Provision and manage infrastructure using Terraform. • Document processes and best practices clearly and concisely.
Senior Site Reliability Engineer
PlayOn! SportsThe nation's leading high school media company providing live streaming and digital ticketing services.
• Contribute to system observability i.e implementing, improving metrics, alerting, and dashboards for better insight and faster recovery. • Develop automation, tooling, and monitoring solutions to support high service availability. • Partner with application and quality engineering teams to implement best practices in reliability, release automation, and testing. • Drive operational excellence through proactive incident prevention, blameless postmortems, and capacity planning. • Participate in on-call rotations to support critical services and ensure rapid response to incidents.
Site Reliability Engineer
Orion HealthRevolutionising global healthcare so every individual receives the perfect care for them.
• Ensure the reliability, availability and performance of cloud infrastructure and operating systems. • Design, manage and execute upgrade and maintenance schedules for clients. • Automate infrastructure processes and implement best practices. • Introduce new tools that enhance software delivery pipeline. • Produce upgrade and maintenance plans for all clients under responsibility. • Implement and review infrastructure monitoring and observability tools.
•Manage and optimize CI/CD pipelines and Devops. •Configure and maintain infrastructure (OS, webservers, VMs, Docker, Bitbucket, Networking). •Monitor systems, resolve incidents and help to keep systems up and running, especially during US EST time-zone. •Handle security updates and hardening for servers and applications. •Mentor and provide technical leadership to other teams.




