Senior Site Reliability & Platform Engineer
Location
United States
Posted
51 days ago
Salary
$130K - $170K / year
Seniority
Senior
Job Description
Senior Site Reliability & Platform Engineer
Inktavo
Engineering @ MergeCo At MergeCo, we aren't just merging two companies; we are building a unified, scalable foundation for the future of our industry. Our IT philosophy is rooted in Governance over Administration. We don't just "fix tickets"—we build the secure, automated "rails" that allow our Engineering and Business teams to run their "trains" at high velocity. We are seeking a Senior Site Reliability & Platform Engineer who views infrastructure as code and security as a baseline requirement. You will be a key architect in defining our shared responsibility model, ensuring that while we provide the platform, the platform provides the guardrails. In this role, you will be a systems thinker who understands that IT is an enabler, focusing on building robust platforms rather than performing arbitrary third-party integrations. Day in the Life - Platform Engineering: Design and maintain our core Kubernetes and Cloud Native environments within GCP, AWS, and Azure, ensuring high availability, scalability, security, and seamless deployment patterns. - Observability & Reliability: Implement a comprehensive observability stack to provide deep insights into system health, performance, and security posture. - Cross-Cloud Strategy: While GCP is our primary home, you will provide expertise in integrating and bridging legacy or specialized workloads in Azure and AWS. - Automation & Lifecycle: Build automated, repeatable processes for provisioning and deprovisioning infrastructure, reducing manual toil to near zero. - The "Rails" Philosophy: Develop self-service tools that empower DevOps and Engineering teams to manage their own tool configurations while remaining compliant with MergeCo security standards. Who You Are - A Systems Thinker: You understand that IT is an enabler. You focus on building robust platforms rather than performing arbitrary third-party integrations. - Kubernetes Expert: You have deep experience managing production-grade clusters (GKE preferred) and understand the intricacies of service meshes, networking, and container security. - Cloud Polyglot: GCP is your native tongue, but you are fluent enough in Azure and AWS to navigate complex multi-cloud environments. - Security-First Mindset: You treat security as a core feature of reliability, not an afterthought. - Collaborative Partner: You prefer "Partnership" over "Gatekeeping," working with business units to define where the platform ends and their application Must Haves - 5+ years in SRE, DevOps, or Platform Engineering roles. - Expert-level Kubernetes orchestration and containerization (Docker/Containerd). - GCP Professional Cloud Architect or equivalent experience (IAM, VPCs, GKE, Cloud Operations). - IaC Mastery: Deep proficiency in Terraform, CDK, or Pulumi - Observability: Experience with Prometheus, Grafana, ELK, or Datadog to drive SLIs/SLOs. - Familiarity with Azure/AWS for hybrid-cloud connectivity and migrations. - Scripting/Coding: Proficiency in Go, Python, or similar for tooling and automation. Nice to Haves - Cloud Polyglot: Familiarity with Azure/AWS for hybrid-cloud connectivity and migrations. - Observability Tooling: Experience with Prometheus, Grafana, ELK, or Datadog to drive SLIs/SLOs. - Experience navigating complex multi-cloud environments. A Few of the Perks - Competitive benefits - Unlimited PTO - Remote work available for U.S.-based candidates - 401(k) with employer match - Paid parental leave - In-office benefits for those local to Dallas, TX: - - Catered lunches - Casual office atmosphere & located in the Design District - Fully stocked kitchen
Related Guides
Related Categories
Related Job Pages
More Platform Engineer Jobs
Staff AI Platform Engineer, Corporate AI Systems
CriblCribl, the Data Engine for IT and Security, empowers organizations to transform their data strategy.
• Define and own the architecture for Cribl’s internal AI platform, LLM deployments, MCP gateway design, orchestration patterns, and the shared services required to run AI use cases safely at scale • Establish the identity and access model for AI systems, including distinct non-human identities, scoped credentials, audit logging, cost controls, and token governance infrastructure that supports least-privilege access • Build safe, reusable sandbox environments and self-service patterns that allow business and technical teams to experiment with AI inside a governed framework rather than through ad hoc or unapproved tooling • Design the connective tissue between AI tooling and Cribl’s enterprise systems, helping define secure patterns for integrating with platforms such as Salesforce, NetSuite, Workday, Jira, Confluence, Slack, Google Drive, Glean, and other business-critical tools • Work hand in hand with the AI Security team to ensure secrets management, MCP governance, prompt-injection defenses, AI telemetry, and compliance-ready controls are built into the platform • Stand up the platform capabilities needed for AI-accelerated development, including AI coding infrastructure and guardrails, DevOps pipeline integration, and secure workflows that help builders move faster without compromising quality or security • Define and track the metrics that matter most for a shared AI platform, including platform availability, reliability, usage, adoption, guardrail effectiveness, cost efficiency, and time to enable new use cases • You may be required to occasionally perform duties outside your standard working hours
Platform Engr Sr
CSGCSG delivers innovative customer engagement solutions that help you acquire, monetize, engage and retain customers.
Hi, I'm Bharanidharan, your Recruiter and guide to joining CSG! We are excited to learn more about you and your unique background. We are looking for an experienced and creative Platform / Systems Engineer to join our agile and distributed Solution Systems team. This team’s key responsibilities will involve ensuring deployment and operational integrity of our next generation product suite from initial customization through to in-life deployment for any given Customer. This includes solution verification, versioning, staged deployment and operations through to the customer. This brings together many release trains and products in development and ensure we have quality, consistent solution for every release and for every customer. We are looking for a Platform / Systems Engineer who will: - Automating deployment and environment workflows end-to-end using Terraform and other Infrastructure-as-Code tools. - Designing, building, and maintaining cloud-based development and test environments to support reliable and efficient delivery. - Analyzing existing infrastructure, pipelines, and processes and improving them for reliability, consistency, and scalability. - Integrating and “weaving together” various systems, services, and tools to deliver more efficient and cohesive platform solutions. - Supporting internal teams by acting as an SME on platform tooling (for example, CI/CD pipelines, DevOps tooling, and development environments). - Identifying areas of improvement in the platform ecosystem and proposing practical, secure, and cost-efficient solutions. - Writing strong, maintainable, and creative automation scripts in Bash, Python, PowerShell, or similar languages to provide both quick fixes and longer-term solutions. - Implementing and operating monitoring, alerting, and enhanced cost controls so environments remain healthy, secure, and well-governed. - Being productive and communicating effectively within a distributed team and with stakeholders across engineering and business functions. Is this opportunity right for you? We are looking for candidates who have: - Bachelor’s degree in computer science or related field, or equivalent practical experience. - Minimum of 5 years of relevant experience, especially in Platform Engineering, DevOps, or Infrastructure Engineering. - Strong experience using Git for version control, including branching strategies, pull requests, and collaborative workflows. - Hands-on experience with AWS (or another major cloud provider) and an understanding of foundational cloud services. - Strong scripting background across languages such as Bash, Python, and PowerShell. - Prior use, of Infrastructure-as-Code technologies such as Terraform (or tools like CloudFormation, Chef, Puppet, or Ansible). - Excellent understanding of containerization technologies such as Docker, Kubernetes, and Open Shift. - Familiarization with GitOps and related tools like Flux & Argo - Comfortable working on both Windows and Linux platforms. - Enjoys automation, problem-solving, and improving the engineering experience for other teams. - Proactive, curious, and able to partner well across teams and disciplines. Location(s): India Remote Accommodation: If you would like to be considered for employment opportunities with CSG and need special assistance due to a disability or accommodation for a disability throughout any aspect of the application process, please call us at +1 (402) 431-7440 or email us at accommodations@csgi.com. CSG provides accommodations for persons with disabilities in employment, including during the hiring process and any interview and/or testing processes. Our Guiding Principles: Impact: Always help and empower others, whether they’re colleagues or customers. When our employees set their minds to something, great things happen. Integrity: Do what’s right for our customers and our people while being authentic. We treat everyone with trust and respect—that’s just who we are. Inspiration: Be bold in the way you think and passionate about the work you do. Test out innovative ideas without the fear of failure. Our Story: CSG empowers companies to build unforgettable experiences, making it easier for people and businesses to connect with, use and pay for the services they value most. For over 40 years, CSG's technologies and people have helped some of the world's most recognizable brands solve their toughest business challenges and evolve to meet the demands of today's digital economy. By channeling the power of all, we make ordinary customer and employee experiences extraordinary. Our people [CSGers] are fearlessly committed and connected, high on integrity and low on ego, making us the easiest company to do business with and the best place to work. We power a culture of integrity, innovation, and impact across our locations, representing the most authentic version of ourselves to build a better future together. That's just who we are. Learn more about CSG Inclusion & Impact here.
Platform Engineer
SingleStoreThe cloud-native database built with speed and scale to power real-time applications.
• Develop, enhance, and maintain CI/CD self-hosted infrastructure using modern DevOps tooling (GitHub Actions Runners stack, ArgoCD, FluxCD etc.). • Automate infrastructure provisioning, configuration management, monitoring, and operational workflows using IaC and scripting languages. • Own the deployment, maintenance, and lifecycle management of systems supporting engineering (Kubernetes clusters, container registries, artifact systems, and internal developer platforms) leveraging deep expertise in Kubernetes, container runtimes, and the broader cloud-native ecosystem (Helm, Kustomize, etc.). • Troubleshoot complex infrastructure and application issues, driving root-cause analysis and developing long-term remediation solutions. • Design, build, and maintain cloud infrastructure across major cloud providers (AWS, GCP, Azure), and develop/support deployments of applications, services, and monitoring with a strong focus on scalability, reliability, and cost optimization. • Develop internal tooling and automation using Terraform, Python, Go, or similar languages to streamline operational tasks and improve developer productivity. • Implement and manage security best practices across cloud environments, including identity management, secrets handling, audit logging, and network controls. • Leverage AI/ML tools to automate repetitive DevOps tasks, operational workflows.
Specialist/Tech & Functional SME - Power Platform & Sharepoint focus
Control RisksThe global specialist risk consultancy - Helping organisations succeed in a volatile world
Role Description The location preference for the role is New Delhi but can also be in Mumbai/Remote with hours of working 12:30 hrs – 21:00 hrs IST. To provide both technical and functional expertise of the Power Platform, SharePoint, and D365 CE areas with focus on enhancing the performance, efficiency, and capability of the Control Risks business. This is a hands-on role and the holder will equally lead the development, design, implementation, and support of Power Platform & SharePoint solutions as well as optionally provide configuration and enhancements to Dynamics 365 CE. The role requires strong communication to a non-technical audience and the holder needs to be able to work at a strategic level as well as have hands-on tactical and operational skills to define work and troubleshoot/resolving issues in an effective way. What You'll Do: - Work closely with the Enterprise Applications Solution Architect and Technical Delivery Manager to design and deliver Power Platform and SharePoint solutions including architecture and design decisions. - Design and implement solutions using PowerApps (Canvas and Model Driven), Power Automate, Power BI, Microsoft Dataverse, and SharePoint. - Establish and enforce Power Platform environment policies, Data Loss Prevention (DLP) rules (in conjunction with the Infosec team), and license optimization using the Center of Excellence (CoE) toolkit. - Build complex, efficient Power Apps and automated workflows to streamline operations. - Integrate Power Platform solutions with external data sources, including SharePoint, Azure SQL, REST APIs, and the Microsoft Dynamics 365 suite. - Provide expert-level guidance, documentation, and best practices to developers (including citizen developers) and stakeholders. - Manage Application Lifecycle Management (ALM) by configuring CI/CD pipelines for solution deployment across environments. - Work with the Business Teams to understand opportunities/requirements, develop user stories in DevOps (or ITSM), and define product backlog. - Inspire the business through demonstrations of the possibilities of Power Platform, SharePoint, and their wider capabilities/integration to meet Control Risks needs and strategic ambitions. - Explore the functionality of AI tools including Co-Pilot, use cases within Control Risks, and advise on the approach and implementation. - Advise customers on best practices for Power Platform & SharePoint processes, user interface, and architecture. - Develop functional designs, test plans, and scripts where applicable. - Develop and deliver the configuration/customization/integration/enhancements to the solutions as necessary. - Prepare and represent Power Platform and SharePoint changes at CAB. - Work with other teams on ad hoc projects as a SME utilization resource from the Enterprise Apps team to complete project tasks on time and within budget. - Provide technical support for D365 CE solutions focusing on Sales, Project Operations, Customer Service, and Omni channel. - Drive innovation and developing solutions that improve the efficiency and capability of Power Platform and D365 CE users. - Assist in the platform releases (upgrades) with regression testing and work with the business to arrange functional tests. - Provide Go-live/Hypercare support where relevant. - Resolve tickets and issues raised by Business Teams in accordance with internal service levels, processes, and procedures (L3/L4 Support). - Implement product best practices based on Microsoft Dynamics standards and supported configurations. - Help and provide guidance with training SMEs in areas specific to their BAU activities. - Work with the business SMEs, help with knowledge, answer questions, drive a knowledge base. - Data migration: Assist with the facilitation of data sets for ETL activities and work with the business teams. Qualifications - 5+ years of development and extensions using the Power Platform: Canvas driven Apps, Model driven Apps, Flow, Power Virtual Agent, Dataverse & SharePoint based solutions. - Desirable: 3+ years of experience configuring and implementing Dynamics 365 CE, including technical development skills (customization, extension, and integration). - Bachelor’s degree and 5+ years relevant work experience. - Proven experience in business analysis with organizations of various sizes and complexity is an advantage. - Excellent problem-solving and analytical skills. - Business process mapping and optimization: experience leading ongoing reviews of business processes and developing optimization strategies. - Good interpersonal skills, possessing the confidence to build relationships with all levels of stakeholders. - Must have in-depth knowledge of Microsoft Power Platform and SharePoint, including deep knowledge of Power Apps (Canvas & Model-driven), Power Automate, Dataverse, Power BI. - Good understanding of Dataverse security models, ALM, and governance practices. - Good grasp of new/emerging technology trends – experience of Co-Pilot for Power Platform, Power Virtual Agents (Copilot Studio), or AI Builder will be a significant advantage. - Background in .NET or C# development is desirable. - Any experience on D365-CE is an advantage! Requirements - Problem Solving: Owns problems, identifies and works with the right people to solve problems quickly within own remit and wider team(s). - Innovation & Creativity: Reviews and looks for efficiencies in ways of working; constantly seeks innovative ways to improve services offered to clients. - Applied Thinking/Decision Making: Be prepared to make decisions and effectively implement those decisions. - Results Oriented: Delivers on personal objectives to deliver to strategic and department plans – focuses on delivery, strives to exceed expectations. - Driving profit/margin improvement: Suggests and makes improvements and efficiencies to manage costs and improve margins. - Communication, planning work, and influencing: Communicates clearly and concisely using language appropriate to the audience.



