Build software faster. The One DevOps Platform enables your entire org to collaborate around your code. We're hiring.
Engineering Manager – Observability, Monitoring, Integrations
Location
India
Posted
4 days ago
Salary
0
Seniority
Senior
Job Description
Engineering Manager – Observability, Monitoring, Integrations
GitLab
• Architect end-to-end telemetry and observability across CustomersDot, Salesforce, and Zuora to enable real-time monitoring and faster issue resolution across purchasing and provisioning flows. • Implement automated billing anomaly detection to identify event drops, abuse spikes, and transactional discrepancies that affect reconciliation and financial integrity. • Define the integration strategy between customer-facing monetization services and enterprise applications, including Salesforce and Workato, with a focus on durable API contracts and reliable financial workflows. • Drive an AI-native engineering approach that uses machine learning and AI-enabled tooling to surface anomalies, improve infrastructure operations, and inform delivery planning. • Hire and grow a high-performing team, setting expectations for ownership, quality, and collaboration from the ground up. • Guide architecture and implementation discussions, helping the team make sound tradeoffs across reliability, availability, data model quality, and developer experience. • Partner with cross-functional stakeholders across engineering, product, finance, customer service, and go-to-market teams to coordinate changes in revenue-impacting systems with care and clarity. • Establish incident management practices, rollout safeguards, and operational standards for systems handling customer-facing and financially critical workflows.
Job Requirements
- Experience managing product, platform, or development teams and building values-aligned environments where team members can grow and deliver meaningful results.
- Strong technical judgment across architecture, system design, and pragmatic delivery for complex integrations and business-critical workflows.
- Experience working on purchasing, billing, provisioning, subscription, commerce, or other revenue-impacting systems.
- Knowledge of observability-first architectures, including tools such as Prometheus, Grafana, and modern incident management tooling.
- Experience defining service level indicators and targets, writing runbooks, and coordinating incident management in operationally rigorous environments.
- Strong collaboration and written communication skills in asynchronous, handbook-first teams that work across engineering, product, finance, customer service, and go-to-market functions.
- Ability to translate complex technical and product issues into clear, actionable language for both technical and non-technical audiences.
- Sound judgment when handling confidential or financial information, customer-facing incidents, and changes that can affect revenue, with openness to transferable experience from related domains.
Benefits
- Benefits to support your health, finances, and well-being
- Flexible Paid Time Off
- Team Member Resource Groups
- Equity Compensation & Employee Stock Purchase Plan
- Growth and Development Fund
- Parental Leave
Related Guides
Related Categories
Related Job Pages
More Engineering Manager Jobs
Engineering Project Manager
Core4ceCore4ce is a data-driven national security partner based in Arlington, Virginia, focused on advancing research and development, delivering innovative technology solutions, and prot
Role Description Core4ce is looking to hire a Project Manager to support a mission critical project for the Military Health Systems under the Defense Health Administration. - Communicate daily with Engineers, Engineering Leadership and Senior Leadership. - Thoroughly understand and communicate day-to-day status of current projects to appropriate stakeholders. - Create a plan, set goals and milestones; communicate timelines and progress updates to all stakeholders. - Track progress and review project tasks to make certain deadlines are met appropriately. - Record and distribute meeting minutes. - Set-up meetings, and coordinate calendar requests. - Conduct regular status meetings with all stakeholders, keeping the stakeholder’s needs and requirements continuously in view. - Maintain project schedule updates. - Maintain and update data in the Program Management Tool (Redmine) via imports/exports and direct entry. - Manage cost analysis, budgets, and engineering bill of materials (EBOM). - Analyze data and prepare reports and dashboards for management use. - Review and summarize data in Microsoft Excel and PowerPoint. - This position is designed to be flexible, with responsibilities evolving to meet business needs and enable individual growth. Qualifications - Secret Clearance Preferred or ability to obtain. - Experience with Microsoft Excel and PowerPoint. - Experience collecting, summarizing and analyzing data. - Must be able to obtain and maintain government eligibility requirements. - Strong customer service and interpersonal skills. - Excellent presentation & communication skills in both oral and written form. Requirements - Ability to communicate complexity in simple and effective ways, including appropriate context based on audience. - Forge strong working relationships. - Desire to keep learning and developing, and always seek ways to improve the outcomes you’re achieving. - Adaptable to new technology and software. - Naturally detail oriented and require organization to thrive. - Capable of managing multiple projects simultaneously. - Problem solver who is comfortable tackling any challenge thrown your way. - Self-driven, requiring minimal guidance and direction to achieve outcomes. - Provides open and honest feedback and are willing to speak your mind. - Make dynamic decisions to assist resources at client sites, mediate internal conflicts, and manage client escalations. - Put the customer first and always willing to do what is right, for your customer, for the business and for your team. - Experience using software tools for functions like Documents (Google Docs, MS Office), Project Management (Redmine), and Expenses (Spreadsheets). Benefits - 401(k) with 100% company match on the first 6% deferred, with immediate vesting. - Comprehensive medical, dental, and vision coverage—employee portion paid 100% by Core4ce. - Unlimited access to training and certifications, with no pre-set cap on eligible professional development. - Tuition assistance for job-related degrees and courses. - Paid parental leave, PTO that grows with tenure, and generous holiday schedules. - Got a big idea? At Core4ce, The Forge gives every employee the chance to propose bold innovations and help bring them to life with internal backing. - Join us to build a career that matters—supported by a company that invests in you.
Role Description We are seeking an expert technical leader to drive the Ford Container-as-a-Service (CaaS) platform, hosting OpenShift Virtualization (OSV), responsible for building, operating, and evolving secure, scalable, and automated Kubernetes infrastructure primarily on-premise. In this role, you will lead the design and delivery of a unified, self-service Kubernetes platform that empowers Ford development teams to efficiently deploy and manage production-grade container clusters hosting a combination of containers and virtualized machines. You will shape the future of Ford’s cloud-native infrastructure strategy, fostering innovation, optimizing operational excellence, and enabling seamless developer experience through industry-leading container orchestration and automation practices. You will work hand-in-hand with the GCP focused Container as a Service group. - Leadership and Team Management: - Lead and inspire the Kubernetes Platform Services team focused on delivering secure, scalable, and automated container orchestration at Ford. - Develop and execute strategic initiatives to optimize Kubernetes cluster deployment and operational efficiency. - Drive the team to consistently achieve performance goals and high-quality service delivery. - Technical Architecture and Oversight: - Own the end-to-end design, deployment, and lifecycle management of Kubernetes clusters, ensuring seamless integration across Ford’s on-premise environments. - Guarantee the clusters meet stringent requirements for availability, scalability, security, and resource efficiency. - Cross-Functional Collaboration and Enablement: - Partner deeply with application developers, platform engineers, and infrastructure teams to understand needs and provide expert Kubernetes guidance and troubleshooting support. - Foster a culture of collaboration, knowledge sharing, and continuous learning within the team and across Ford’s technical organization. - Innovation, Standardization, and Continuous Improvement: - Stay abreast of cutting-edge Kubernetes developments and cloud-native best practices. - Lead the adoption of innovations that improve deployment automation, cluster lifecycle management, security compliance, and observability. - Establish and enforce standardized processes to ensure consistent, reliable, and secure Kubernetes operations organization-wide. - Developer Experience: - Focus on developer/consumption experience as a first-class citizen through simplification of environment provisioning and application deployment. Qualifications - 8+ years of overall Software Engineering / IT Experience - Bachelor's Degree in Computer Science, Software Engineering, Information Technology, or a closely related technical discipline. - Extensive Kubernetes Expertise: Proven experience managing, deploying, and scaling Kubernetes clusters in production environments, ideally across hybrid cloud and on-premises data centers. - Cloud Platform Proficiency: Strong working knowledge of one or more public cloud platforms (preferably Google Cloud Platform, AWS, or Azure) including container native services, networking, IAM, and storage integrations. - Experience with Service Mesh Technologies: Hands-on experience with Istio, Linkerd, or other service mesh platforms to enhance security, observability, and traffic management. - Strong Background in Monitoring & Observability Tools: Experience with Prometheus, Grafana, Jaeger, Dynatrace/Datadog, ELK stack, or Google Stackdriver for end-to-end observability and proactive incident detection. - Experience Leading Large-Scale Automation Initiatives: Past success automating infrastructure provisioning, deployments, and operational tasks using advanced CI/CD tooling integrated with Kubernetes platforms. - Infrastructure Automation and CI/CD: Hands-on experience with Infrastructure as Code tools (e.g., Terraform, Kustomize), Kubernetes operators, and continuous integration/deployment pipelines for containerized applications. - Security and Compliance: Solid understanding of cloud-native security best practices, container security, and compliance frameworks relevant to enterprise IT environments. - Strong Stakeholder Engagement: Demonstrated ability influencing at senior management levels and driving enterprise-wide adoption of platform standards and best practices. Benefits - Immediate medical, dental, and prescription drug coverage - Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more - Vehicle discount program for employees and family members, and management leases - Tuition assistance - Established and active employee resource groups - Paid time off for individual and team community service - A generous schedule of paid holidays, including the week between Christmas and New Year’s Day - Paid time off and the option to purchase additional vacation time.
Young Apprentice in Software Engineering
AppmaxO time da Appmax é feito por pessoas que encantam os parceiros e valorizam a entrega de resultados. Trabalham lado a lado para ampliar o potencial dos negócios digitais e desta forma, maximizam o seu desenvolvimento profissional. Em um ambiente inovador e colaborativo, impulsionado pelos nossos valores e atitudes, estamos revolucionando o mercado digital por meio da nossa plataforma de pagamentos online.
Role Description Vem ser um Jovem Aprendiz em Engenharia de Software! - Aprendizado constante em um ambiente inovador; - Experiência prática que vai turbinar o seu currículo; - Integração com uma equipe incrível; Qualifications - Idade entre 16 e 24 anos; - Cursando ou concluído o Ensino Médio; Benefits - Bolsa auxílio no valor de R$ 786,95; - Vale Alimentação; - Gympass; - Vale Transporte; - Auxilio home office no valor de R$ 130,00; Company Description O time da Appmax é feito por pessoas que encantam os parceiros e valorizam a entrega de resultados. Trabalham lado a lado para ampliar o potencial dos negócios digitais e desta forma, maximizam o seu desenvolvimento profissional. Em um ambiente inovador e colaborativo, impulsionado pelos nossos valores e atitudes, estamos revolucionando o mercado digital por meio da nossa plataforma de pagamentos online.
• Manage an engineering team of 3-6 infrastructure and software engineers responsible for the technical ownership of our Government Cloud offering. • Support the growth and development of engineers across all levels of experience through regular feedback, coaching and mentorship. • Work with colleagues across GTM, CS, and Security to plan and prioritize the projects that get us to authorization and keep us there, helping break them down into manageable tasks and milestones. • Facilitate useful communication between your team, other Product & Engineering teams, and other departments at Tines, acting as a key stakeholder in product features beneficial to our Public Sector customers. • Enable your team to act as the ultimate point of escalation for technical issues in our government environments, including directly engaging with Public Sector prospects and customers where necessary. • Contribute to technical discussions, while preserving autonomy for engineers. • Help make our engineering processes effective and efficient, including the change control and review practices a compliance-regulated environment depends on. • Work on attracting and recruiting great engineers to our team. • Foster a culture of inclusion, ambition, and collaboration. • Maintain technical familiarity through direct work on smaller, lower-priority engineering tasks.


