Banking for startups: mercury.com
Senior Infrastructure Engineer
Location
United States + 1 moreAll locations: United States | Canada
Posted
83 days ago
Salary
$200K - $250K / year
Seniority
Senior
Job Description
Senior Infrastructure Engineer
Mercury
Imagine being a pioneer, venturing through the uncharted territories of the cloud. You're not just navigating; you're shaping the landscape, constructing robust architectures that withstand the tests of time and scale. At Mercury, your mission, should you choose to accept it, is to help steer our cloud infrastructure into the future. With projects as dynamic as migrating our entire fleet to ECS and building out our golden paths for service deployment, your role is pivotal. This isn't just a job; it's an epic tale of transformation and triumph. As a senior member of our infrastructure team, you will be equipped with essential tools and technologies designed for scaling and enhancing Mercury's infrastructure: - AWS Services: Proficiently utilize EC2, RDS, IAM, Networking, Opensearch, and ECS to build and manage robust cloud environments. - Terraform: Leverage Terraform for infrastructure as code to efficiently manage and provision our cloud resources. - Agentic Infrastructure: Build the frameworks around using AI safely in our infrastructure, both for the agents and the users that kick off those agents. - Monitoring and Observability Tools: Employ Prometheus, Grafana, Opensearch, and OpenTelemetry to maintain high availability and monitor system health. - Version Control and CI/CD: Manage code and automate deployments using GitHub & GitHub Actions. As we gear up for the next stages of Mercury's growth, you will: - Build our “Infrastructure Platform” to support the growing needs of the Engineering Organization. - Focus on building a platform that is AI friendly while still usable for engineers. We want our users to be humans and Agents. - Lead key infrastructure projects, break-down complex initiatives, and define our infrastructure strategy through detailed RFCs and technical specifications. Must haves: - You have 5+ years of experience with AWS - You have extensive experience, ideally 3 years or more, with observability and monitoring tools like Prometheus, Grafana, and OpenTelemetry, optimizing system performance and reliability. - You have demonstrated ability in technical writing, with at least 3 years of experience creating detailed technical documentation, RFCs, and tech specs that clearly communicate complex ideas. The ideal candidate should: - You bring at least 2 years of experience leading infrastructure projects in regulated environments such as HITRUST or SOC2, ensuring compliance and security. - You have 3+ years of experience managing large-scale Terraform implementations, including the setup and maintenance of Terraform CI/CD pipelines. - You have 2+ years of experience writing code. We are building an Infrastructure Platform from scratch and there is plenty of code to write to support that. - Experience mentoring and elevating those around you, we are force multipliers for the engineering org. If this role interests you, we invite you to explore our public demo at demo.mercury.com. The total rewards package at Mercury includes base salary, equity, and benefits. Our salary and equity ranges are highly competitive within the SaaS and fintech industry and are updated regularly using the most reliable compensation survey data for our industry. New hire offers are made based on a candidate’s experience, expertise, geographic location, and internal pay equity relative to peers. Our target new hire base salary ranges for this role are the following: - US employees: $200,700 - $250,900 - Canadian employees: CAD $189,700 - $237,100 Mercury values diversity & belonging and is proud to be an Equal Employment Opportunity employer. All individuals seeking employment at Mercury are considered without regard to race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, sexual orientation, or any other legally protected characteristic. We are committed to providing reasonable accommodations throughout the recruitment process for applicants with disabilities or special needs. If you need assistance, or an accommodation, please let your recruiter know once you are contacted about a role. We use Covey as part of our hiring and / or promotional process for jobs in NYC and certain features may qualify it as an AEDT. As part of the evaluation process we provide Covey with job requirements and candidate submitted applications. We began using Covey Scout for Inbound on January 22, 2024. [Please see the independent bias audit report covering our use of Covey for more information.] #LI-ME1
Related Guides
Related Categories
Related Job Pages
More Infrastructure Engineer Jobs
Senior eDiscovery Analyst & Infrastructure Engineer
Array.comArray is a financial services company that is on a mission to use meaningful information-sharing to help businesses form deeper bonds with their customers. As a
Job Description Summary The Senior eDiscovery Analyst & Infrastructure Engineer is a senior-level, hands-on role responsible for the administration, reliability, and evolution of Relativity and other eDiscovery platforms, along with the supporting infrastructure and applications they depend on. This position serves as a technical escalation point for complex platform and infrastructure issues and partners closely with IT Operations, Application Development, Project Management, and Security teams. This role emphasizes eDiscovery platform expertise, infrastructure engineering, and operational reliability, with sufficient SQL Server knowledge to support Relativity and other platform troubleshooting while collaborating with dedicated database administrators. Core Responsibilities eDiscovery Platforms & Infrastructure Engineering - Administer and support Relativity and other eDiscovery platforms and dependent applications - Own platform health and operations (upgrades, configuration, integrations, dependencies) - Engineer and support the Windows Server infrastructure underpinning eDiscovery platforms - Maintain required components for availability/resilience (Windows, SQL, analytics, etc.) - Troubleshoot complex cross-stack issues (application, network, storage, clustering, OS) - Partner with virtualization, storage, network, and security teams on performance and scalability - Provide guidance on architecture, capacity planning, and infrastructure best practices Data Migration Planning & Execution - Lead or contribute to large-scale migration initiatives for Relativity, including: - Planning and executing workspace, data, or instance migrations - Coordinating technical cutovers and minimizing client and operational impact - Validating post‑migration performance, functionality, and data integrity - Collaborate with PMs, eDiscovery teams, and infrastructure staff to define migration scope, risks, dependencies, and success criteria - Document migration strategies, lessons learned, and repeatable approaches for future initiatives Database & SQL Support (Platform‑Focused) - Provide SQL Server support as it relates to Relativity and other eDiscovery platforms, including: - Troubleshooting platform‑related database performance or connectivity issues - Assisting with restores, validations, and environment troubleshooting as needed - Reviewing queries, indexing behavior, and execution plans in support of platform issues - Collaborate closely with the primary SQL/database administrator on database‑level changes, tuning, and improvements - Maintain a working knowledge of SQL Server high‑availability concepts sufficient to support platform reliability and incident response Operational Support & Escalation - Serve as a senior escalation resource for platform, infrastructure, and application issues impacting eDiscovery operations - Respond to high‑priority incidents affecting Relativity, other eDiscovery tools, and supporting systems - Act as a technical bridge between IT, PMs, and eDiscovery teams during incidents and critical workflows - Provide clear incident communications (status, root cause, remediation plan) - Participate in on‑call or after‑hours support, as needed Documentation & Process Improvement - Create and maintain technical documentation, runbooks, and operational procedures for eDiscovery platforms and infrastructure - Drive continuous improvement of platform operations, infrastructure standards, and support workflows - Participate in change management, release planning, and post‑incident reviews Security & Compliance Support - Support security and compliance requirements related to eDiscovery platforms and supporting infrastructure - Assist with audit evidence collection, technical control implementation, and platform hardening efforts - Support investigation and documentation of infrastructure or platform‑related security incidents Required Experience & Qualifications Required - 7+ years in eDiscovery, infrastructure engineering, or related technical roles - Hands-on production administration of Relativity - Experience supporting other eDiscovery platforms and/or complex business applications - Working SQL Server knowledge for platform troubleshooting and partnering with DBAs - Strong Windows Server administration; Failover Clustering/high availability experience - Proven ability to troubleshoot complex, multi-layer issues independently as a senior technical resource Preferred / Highly Desired - Relativity Certified Infrastructure Specialist or equivalent hands‑on experience - Experience with Relativity migrations, upgrades, or environment consolidations - Experience in virtualized environments (Hyper‑V, VMware, or similar) - Familiarity with enterprise storage and performance troubleshooting - Experience supporting regulated or high‑security environments - Strong technical documentation skills Key Competencies - Deep technical ownership and accountability - Strong troubleshooting and analytical skills - Clear communication with technical and non‑technical stakeholders - Ability to mentor junior engineers and analysts - Focus on stability, reliability, and operational excellence The base Compensation range for this role will $125,000 - $145,000 and will be dependent upon the individual's location, skills, experience and qualifications. In addition to the base pay, this role will be eligible for a discretionary bonus program. What We Offer • People-Focused Culture • Competitive Pay & Quarterly Incentives • Comprehensive Benefits, 401k & Wellbeing Programs (link for details) • Flexible Time Off & Remote Work Options • Professional Development & Career Growth Opportunities • Exposure to cutting-edge technology in the legal services industry Array is committed to providing equal employment opportunities to all individuals. We ensure that all hiring decisions are made without unlawful consideration of any person's race, color, religion, national origin, age, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity or expression, veteran status, disability, genetic information, marital status, citizenship, ancestry, or any other basis protected by applicable local, state, provincial, or federal law. We are dedicated to making our application process accessible. If you require an accommodation or assistance due to a disability, please notify us. Your request will be handled confidentially, and your application status will not be negatively affected. We strive to maintain a diverse, inclusive, and fair workplace where all team members are valued and respected. All persons hired will be required to complete a comprehensive background check and provide proof of eligibility to work in the country of the job location.
• Administer and support Relativity and other eDiscovery platforms and dependent applications • Own platform health and operations (upgrades, configuration, integrations, dependencies) • Engineer and support the Windows Server infrastructure underpinning eDiscovery platforms • Maintain required components for availability/resilience (Windows, SQL, analytics, etc.) • Troubleshoot complex cross-stack issues (application, network, storage, clustering, OS) • Partner with virtualization, storage, network, and security teams on performance and scalability • Provide guidance on architecture, capacity planning, and infrastructure best practices • Lead or contribute to large-scale migration initiatives for Relativity • Document migration strategies, lessons learned, and repeatable approaches for future initiatives • Provide SQL Server support as it relates to Relativity and other eDiscovery platforms • Serve as a senior escalation resource for platform, infrastructure, and application issues impacting eDiscovery operations • Create and maintain technical documentation, runbooks, and operational procedures for eDiscovery platforms and infrastructure • Support security and compliance requirements related to eDiscovery platforms and supporting infrastructure
Infrastructure Engineer II
Emergent HoldingsWe are an Equal Opportunity Employer. We will not tolerate discrimination or harassment in any form. Candidates for the position stated above are hired on an "at will" basis. Nothing herein is intended to create a contract.
The Infrastructure Engineer II role is responsible for the operations of secure and highly available computing platforms, servers, and networks. This role is responsible for installing, maintaining, upgrading, and continuously improving the Company’s operating environment while maintaining the ongoing reliability, performance, and support of the infrastructure. This includes monitoring the operating environments, responding to incidents, problems and planning for growth. Infrastructure Engineers deploy the release of new technologies as well as design, install, configure, maintain, and perform testing of PC/server operating systems, networks, and related utilities and hardware. Additionally, this role’s responsibilities include troubleshooting problems as reported by users, supporting Web access and electronic messaging services and maintaining a secure systems environment. - May act as a technical project leader or provide work leadership for lower-level employees. - Participates in and may lead groups/committees related to processes, standards, and best practices. - Mentors peers and IT personnel less senior. - Acts as oversight for the network administrator duties to ensure established standards and policies are consistently being followed. - Mentors and accepts trouble reports from operations and less experienced technical support personnel. - Investigates and analyzes resource utilization and prepares reports. - Optimizes the network infrastructure to maintain the highest possible level of performance and security. - Plans for the replacement of obsolete resources that make up the enterprise network infrastructure. - Recommends new software and hardware that provide new features/functions and prepares documentation to support recommendation of new software/hardware. - Conducts appropriate, routine tests to ensure the proper working condition and security of developed and purchased software/hardware. - Coordinates and schedules initial installation of new equipment or reinstallation of relocated equipment. - Maintains an up-to-date technical and practical knowledge and understanding of system testing and analysis. - Ensures security among data management and associated processes, including adhering to data retention policies, file systems and data transfers. - Ensures maintenance and support agreements are in place with the vendor and licensing is up to date. - Build, support and maintain the protection of company data and systems using security solutions, backups, redundancy, and disaster recovery solutions. - Perform installation, maintenance and support of system software, hardware, and infrastructure for moderately complex projects. - Management of asset lifecycle process through planning and forecasting, including recommendation for replacement of obsolete or end-of-life resources as needed. - Interacts with external vendors to evaluate technology changes, including licensing and contracts, and their impact on the business. - Investigates and analyzes resource utilization and prepares reports and metrics, making appropriate changes to optimize the infrastructure and provide the highest possible level of performance and security. - Conducts appropriate, routine tests to ensure the proper working condition and security of developed and purchased software/hardware. - Coordinates and schedules initial installation of new equipment or reinstallation of relocated equipment. - Recommends improvements and changes to methods and procedures. - Maintains an up-to-date technical and practical knowledge and understanding of system testing and analysis. - Troubleshoots and resolves complex issues that involve the core operating system or desktop components, performing root cause analysis for service interruption and implementing preventative measures. - Analyze complex local and wide area network systems, including planning, designing, evaluating, selecting operating systems and protocol suites and configuring communication media with concentrators, bridges and other devices. - Resolves difficult interoperability problems to obtain operation across all platforms including e-mail, files transfer, multimedia, teleconferencing and the like. - Configures systems to user environments. Supports acquisition of hardware and software as well as subcontractor services. - Acts as a focal point and subject matter expert for all network administrator responsibilities. - Acts as oversight for the network administrator duties to ensure established standards and policies are consistently being followed. - Performs administrative language (shell) programming. - Build, support and maintain the VDI environment, including standardized templates, application delivery methods, enterprise antivirus, persona management infrastructure, host servers and zero clients. - Build, support and maintain the configuration management suite, including multiple Windows Enterprise desktop images (WIMs), automated task sequences, an automated backup infrastructure, central imaging environment, automated software deployment and patch management updates. - Build, support and maintain Active Directory Group Policy Objects that are used to secure and configure the enterprise client-computing infrastructure. - Build, support and maintain role-based access control for enterprise computing solutions, including VDI, endpoint management suite and Print/Scan solutions. - Build, support and maintain support documents, standard operating procedures and policies related to the network, security systems, server and storage infrastructure, and the client-computing infrastructure. - Work with application developers and third-party vendors to develop software or virtualized installation packages for network deployment via the Enterprise configuration management suite. - Build, support and maintain the enterprise network, security systems, servers and storage infrastructure, both virtual and physical, through the use of routine commands, administration techniques, tools, and utilities. - Mentor Network Infrastructure engineers in maintaining multi cloud ecosystem. - Understanding of network security-based tools available within in Muli-cloud infrastructure. - Follow industry best practices for maintaining Muli-cloud workloads. - Leverage the right tools to script configuration changes that rescale, resize and reform the workload ecosystems through automation. - Partnering with internal business client(s), the Middleware Advisor will have a clear understanding of the context of the application and its usage to ensure successful implementation. - Train members of the technical staff and interface with user groups to resolve project and production issues. - Research and plan implementation of middleware upgrades, security patches and bug fixes. - Assist the development staff as the technical lead for development, integration, test, and production environments. - Responsible for technical recovery plans and lead disaster recovery planning and exercises. - Research and analyze systems, design, implement and conduct thorough Performance Test (PT) reviews in accordance with Application Middleware Infrastructure to accommodate new systems to the shared infrastructure and provide system capacity and enhancements. - Produce periodic reports for management and staff. - Serve as a point of contact for middleware applications and manage vendor relationships. - Leverage middleware technologies to provide robust, cutting-edge integration solutions to achieve new goals and meet new challenges rapidly and cost-effectively fully and successfully. - Mentor Middleware Infrastructure engineers in maintaining multi cloud ecosystem. - Understanding of middleware infrastructure security-based tools available within in multi-cloud infrastructure. - Follow industry best practices for maintaining Muli- cloud workloads. - Leverage the right tools to script configuration changes that rescale, resize and reform the workload ecosystems through automation. - Mentor Middleware Infrastructure engineers in maintaining multi cloud ecosystem. - Understanding of network security-based tools available within in multi-cloud infrastructure. - Follow industry best practices for maintaining multi-cloud workloads. - Leverage the right tools to script configuration changes that rescale, resize and reform the workload ecosystems through automation. - Serves as a technical leader in the installation and configuration of off the shelf enterprise-level applications. - Assists development with analysis and design for new or existing systems. - Provides some scripting (Windows / Unix) as needed. - Participates in and may lead groups/committees related to processes, standards and best practices. - Ensures documentation is up to date. - Supports a wide number and type of applications, including middleware and batch activity through operational (maintenance) and/or new project implementation(s). - Installs and configures off the shelf enterprise-level applications. - Ensures security among data management and associated processes, including adhering to data retention policies, file systems and data transfers. - Identifies, tracks, resolves, responds, and raises awareness of complex application and data issues. - Develops and maintains technical documentation for supported applications (implementation design documents, support activities, etc.). - Provides 24x7 support as needed. - Acts as a focal point and subject matter expert for all telecommunications analyst responsibilities. - Participates in and may lead groups/committees related to processes, standards, and best practices. - Mentors peers and IT personnel with respect to the telecommunications infrastructure. - Act as a proxy for management as needed. - Ensures established standards and policies are consistently being followed as it relates to Telecommunications. - Maintain up-to-date technical knowledge and understanding of applications. EDUCATION - Bachelor's degree in computer science, information technology, or related field required. - Certification or progress toward certification of, industry-recognized professional designation preferred and encouraged. - Combinations of relevant education and work experience may be considered in lieu of a degree. - Continuous learning, as defined by Company’s learning philosophy, is required. EXPERIENCE - 5 years’ experience within an IT environment which provides the necessary skills, knowledge and abilities. - One-year relevant experience supporting personal computers in a multi-site, multi-platform environment as well as telephone support of remote staff preferred. - Experience within the insurance industry highly preferred. QUALIFICATIONS - Exceptional customer support with proven track record of positive outcomes. - Consistent and proficient demonstration of required job SKA which exceed standard job expectations. - Consistent and proficient demonstration of troubleshooting that demonstrates a comprehensive and holistic understanding of systems integration. - Recognized as a technically credible role model. - Demonstrated ability to mentor and coach others with less experience. - Knowledge of applications and platforms including Microsoft Exchange, Microsoft Active Directory and Group Policy, Microsoft Office and Office 365, DNS servers, DHCP servers and Lightweight Directory Access Protocol. - Considerable knowledge of, and the ability to practically apply, necessary testing, practices and procedures. - Knowledge of IT system installation, configuration, and maintenance. - Strong software Development Life Cycle principles, processes, tools, and techniques. - Knowledge of performance measuring and monitoring of IT systems. - Ability to apply the principles of independent logical thinking to define problems, collect data, establish facts, and draw valid conclusions. - Ability to comprehend the consequences of various problem situations and to refer them for appropriate decision-making. - Ability to handle multiple priorities, establish workflows, and meet necessary deadlines. - Ability to work with minimum supervision. - Excellent oral and written communication skills. - Ability to communicate factual and technical information with customers clearly and concisely. - Ability to effectively exchange information clearly and concisely and to present ideas, report facts and other information and respond to questions as appropriate. - Ability to maintain confidentiality. - Ability to perform other assignments at locations outside the office. - Ability to work varying hours, including evenings, weekends and holidays as required. - Ability and proficiency in the use of computers and company standard software. - Must be able to work collaboratively as well as independently. Networking Qualifications - Advanced knowledge of IT systems, including networking, server, storage, and applications. - Demonstrated leadership ability with proven results as a team facilitator/leader within multi-functional teams. - Advanced knowledge of applications and platforms including Microsoft Exchange, Microsoft Active Directory and Group Policy, Microsoft Office and Office 365, DNS servers, DHCP servers and Lightweight Directory Access Protocol. - Advanced technical knowledge of Microsoft System Center Configuration Manager, VMware vSphere 5.x or later, VMware View 5.x or later. - Advanced knowledge of, and the ability to practically apply, necessary testing, practices, and procedures. Middleware Qualifications - Ability to work effectively with all levels of management and different business partner organizations. - Ability to document and develop standard operating procedures for day-to-day activities. - Demonstrated ability to go above and beyond to support the business needs and aid in resolution. - Working knowledge implementing, migrating, and upgrading several middleware application platforms. - Strong skills in mentoring staff on their technical challenges and help them become successful. - Ability to set roadmaps for sunsetting or upgrading obsolete software. Platform Qualifications - Strong leadership, negotiation, conflict management and facilitation skills. - Ability to set priorities and manage workload to meet those priorities. - Ability to work effectively with all levels of management and different business partner organizations. - Strong technical writing and documentation skills. - Advanced knowledge of performance measuring and monitoring of IT systems. Telecommunications Qualifications - Consistent and proficient demonstration of troubleshooting that demonstrates a comprehensive and holistic understanding of telecommunications systems. - Consistent and proficient demonstration of required job SKA which exceed standard job expectations. - Advanced knowledge of telecommunications systems and applications. - Demonstrated ability to resolve and collaborate on complex, multifaceted issues. - Demonstrated leadership ability with proven results as a team facilitator/leader within multi-functional teams. PAY RANGE: “Actual compensation decision relies on the consideration of internal equity, candidate’s skills and professional experience, geographic location, market and other potential factors. It is not standard practice for an offer to be at or near the top of the range, and therefore a reasonable estimate for this role is between $64,900 and $147,250.” We are an Equal Opportunity Employer. We will not tolerate discrimination or harassment in any form. Candidates for the position stated above are hired on an "at will" basis. Nothing herein is intended to create a contract. #LI-CH1 #AFG Accident Fund Insurance Company of America
Remote Site Services Data Center Manager, Infrastructure Services
AppleWell-known for creating the Mac, iPhone, iPad, and Apple Watch, as well as its App Store, Apple Music, Apple Pay, and iTunes services, Apple's goal is to leave the world better tha
Role Description The RDC Site Service Manager will be responsible for overseeing and managing the deployment, maintenance, and operational excellence of our data center infrastructure across the AMR region. This is a critical role that requires a strong leader with a proven track record in managing Remote Data Center teams, vendors, and infrastructure projects. You will ensure the reliable and efficient operation of our data centers, supporting the company's critical services and growth. This role involves frequent travel across America and other global regions. Qualifications - Bachelor's or Master's degree in Electrical Engineering, Mechanical Engineering, Computer Science, or a related field, or equivalent work experience. - 2+ years of experience managing or leading data center operations, with practical experience in areas like rack sizing, construction, equipment installation, and/or decommissioning. - Hands-on experience with copper and fibre cabling structures, complemented by proficiency in areas such as cable infrastructure design and/or labeling methodologies. - Experience with DCIM tools (e.g., Nlyte, Sunbird, Proprietary DCIM tools) and enterprise-level management tools (e.g., Service Now, APM, Splunk, Wrike, Quip, radar, Ansible) for securing and monitoring equipment. - Ability to travel domestically and internationally. Requirements - Certification in data center operations or a related field (e.g., CDOM, CDFOM, CDCE, CDCS). - Strong understanding of data center infrastructure, including power, cooling, networking, and security systems. - Proven ability to manage data center projects, ensuring delivery on time and within budget. - Proven track record in remote vendor management for infrastructure expansion and compliance. - Excellent written and verbal communication skills, with the ability to effectively communicate with a variety of audiences, including technical and non-technical stakeholders. - Ability to work effectively in a global, multi-cultural, multi-location organization. - In-depth experience with a specific DCIM platform (e.g., Nlyte, Sunbird, Schneider Electric StruxureWare) and a proven ability to customize and optimize the platform for specific business needs. - Experience in data center design and construction, including knowledge of industry standards and best practices (e.g., TIA-942, Uptime Institute Tier Standards).



