Nadia Bakhurji Architectural & Engineering Consultants (NBA)
Senior Site Reliability Engineer, Messaging Services
Location
New Jersey
Posted
41 days ago
Salary
$140K - $150K / year
Seniority
Senior
Job Description
Senior Site Reliability Engineer, Messaging Services
NBA
• Own reliability and operational excellence for messaging and collaboration platforms. • Serve as a senior escalation point for complex issues across Proofpoint, Exchange, Outlook, SMTP, and various collaboration services. • Lead incident response, service restoration, and root cause analysis in high-pressure, real-time scenarios. • Design and operate highly available, secure, and compliant messaging architectures. • Automate operations using PowerShell and Microsoft Graph. • Administer and support Slack and Microsoft Teams, including tenant/workspace configuration and escalations. • Provide white-glove, high-touch support for the NBA Executive Leadership Team, requiring professionalism, discretion, and urgency. • Partner with Security, Legal, and Audit teams on compliance, retention, and eDiscovery. • Participate in a formal rotating on-call schedule. Provide after-hours, weekend, and holiday support.
Job Requirements
- A bachelor's degree in computer science or related technical field
- 10+ years of experience in enterprise messaging, infrastructure, or reliability engineering
- Deep expertise in Microsoft Exchange Online / M365 and hybrid messaging environments
- Strong experience with email security (DMARC, DKIM, SPF, ARC; Proofpoint a plus)
- Advanced PowerShell automation skills
- Experience supporting Slack enterprise environments
- Proven ability to support executives and business-critical operations with urgency and precision
- Calm, decisive, and effective during live incidents and high-visibility NBA events
Benefits
- medical
- dental
- vision
- life/AD&D insurance
- short- and long-term disability
- fertility and family-forming assistance
- wellbeing allowance
- educational assistance
- mental health coaching/therapy
- tax advantaged accounts such as HSA and healthcare/dependent care FSAs
- a 401(k) retirement plan
- time off benefits that include vacation, sick time, and personal days
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Role Description We are hiring a DevOps Engineer to own the deployment, infrastructure, and operational reliability of OpenFn across our implementation portfolio. - The primary focus of this role is on-premise and local infrastructure deployments for government clients. - Support and optimize cloud-hosted OpenFn deployments, primarily on GCP and AWS, and occasionally Azure. - Work directly with government IT counterparts and guide implementation engineers on infrastructure decisions. - Build internal standards and tooling for consistent deployment across various environments. What you’ll be doing - Build World-Class Deployment, Monitoring, and Instance-Maintenance Tooling: - Develop and maintain DevOps tooling (Ansible, Terraform, custom CLI programs), deployment runbooks, configuration templates, and documentation. - Deliver On-Premise and Local Deployments: - Lead and execute OpenFn deployments on government and ministry-managed infrastructure. - Configure and maintain containerized deployments using Docker, Docker Compose, and Docker Swarm. - Work directly with government IT teams to navigate local infrastructure constraints. - Troubleshoot infrastructure and runtime issues in the field. - Cloud Infrastructure: - Maintain and optimize OpenFn deployments on GCP, AWS, and occasionally Azure. - Implement and maintain CI/CD pipelines for services team deployments. - Monitor system performance, set up alerting, and respond to infrastructure incidents. - Advise implementation teams on cloud architecture decisions and cost optimization. - Internal Standards and Enablement: - Build and maintain internal DevOps standards, deployment guides, and infrastructure-as-code templates. - Contribute to pre-sales and scoping conversations by advising on infrastructure feasibility. - Work closely with the Principal Solutions Architect and the CTO to ensure deployment strategy alignment. Qualifications - 7+ years of experience in DevOps, infrastructure, or SRE roles. - Strong hands-on experience with Docker and containerization; experience with Docker Compose in production is essential. - Solid experience with at least one major cloud provider, GCP & AWS strongly preferred, with working knowledge of Azure. - Experience deploying and maintaining applications on on-premise or sovereign-hosting infrastructure. - Proficiency with CI/CD tools and pipelines (GitHub Actions, GitLab CI, or similar). - Experience with infrastructure-as-code tools such as Terraform or Ansible. - Strong Linux systems administration skills, including networking, storage, and security hardening. - Excellent written and verbal English communication skills. - Ability to work from home or a shared office space with fast, stable internet. Requirements - Strong advantage for experience with Kubernetes in production environments. - Familiarity with system monitoring and observability tooling (Prometheus, Grafana, or similar). - Background working with NGO technology teams, government digital transformation programs, or digital public goods deployments in Africa or Asia. - Working proficiency in French or Spanish. Benefits - Financial compensation is commensurate with experience. - Flexible working schedule. - Fully remote work with team meetups across Europe and Africa. - Exposure to industry and technology trends and leading health and humanitarian interventions. - Opportunity for leadership and advancement. - Opportunity to shape strategy and impact millions of lives through open source software.
• Own DevOps architecture, CI/CD pipelines, and IaC (CDK/CloudFormation) • Lead AWS services setup (Amazon Connect, IAM, DynamoDB, Lex bots, Lambda integrations) • Manage multi-environment deployments and cross-account pipelines • Lead and mentor DevOps team across sprints and deliveries • Troubleshoot infrastructure, application, voice, SIP, and platform‑related issues • Collaborate with engineering, architecture, QA, and analytics teams during incidents and releases • Integrate security best practices ("secure-by-default") into the pipeline, including access control and vulnerability scanning • Implement monitoring (CloudWatch, X-Ray) and incident management • Drive automation, testing, and environment stability
DevOps & Test Automation Engineer
MWDNMWDN connects exceptional tech talent with leading companies across Israel, the USA, Great Britain, and Western Europe. We aim to ensure our employees enjoy a rewarding and secure experience while collaborating with prestigious international clients. MWDN is ranked among the top 5 IT employers in our region by DOU, and we pride ourselves on our transparency and commitment to our team.
Role Description Join a forward-thinking technology company building intelligent systems that transform how people interact with digital experiences. Focused on the next generation of AI-driven applications, the company develops advanced solutions powered by large language models and real-time decision-making to create responsive, adaptive platforms. If you’re passionate about building cutting-edge AI systems and want to help shape technology that can reason, adapt, and communicate instantly, this is an opportunity to contribute to a platform redefining intelligent interaction. Qualifications - Strong DevOps background with hands-on experience in production systems - Deep understanding of automated testing strategies (unit, integration, E2E, performance) - Experience building testing infrastructure, not just writing tests - Strong experience with GCP (Cloud Run, GKE, networking, IAM, load balancing) - Experience with CI/CD systems (GitHub Actions, Cloud Build, etc.) - Solid backend engineering skills (Python preferred) - Experience with database management (MongoDB, Postgres), including cloning and seeding strategies - Experience with containerization (Docker) and environment orchestration - Strong understanding of system reliability, observability, and debugging - Experience working in fast-paced environments with high ownership - Upper-intermediate+ level of English Requirements - Experience with frontend testing frameworks (Playwright, Cypress) - Experience with performance/load testing tools - Experience with test data generation and synthetic datasets - Familiarity with AI/LLM-based systems and testing challenges - Experience with security and compliance testing (SOC2, etc.) Benefits - People-oriented management without bureaucracy - The friendly climate inside the company is confirmed by the frequent comeback of previous employees - Flexible working schedule - Full financial and legal support for private entrepreneurs - Free English classes with native speakers or with Ukrainian teachers (for your choice) - Dedicated HR
Role Description Buscamos uma pessoa Analista DevOps Sênior para atuar em ambientes críticos, liderando iniciativas de automação, modernização e evolução da nossa plataforma. Se você gosta de construir soluções resilientes, escalar sistemas e impactar diretamente a eficiência do negócio, essa oportunidade é pra você! 🚀 - Desenhar, implementar e operar plataformas escaláveis e resilientes em cloud (AWS); - Liderar iniciativas de automação e modernização de infraestrutura; - Gerenciar e evoluir ambientes Kubernetes (EKS); - Construir e otimizar pipelines de CI/CD e práticas de GitOps; - Garantir observabilidade (métricas, logs e tracing) e confiabilidade dos sistemas; - Atuar com segurança em cloud (DevSecOps, IAM, secrets, hardening); - Monitorar performance, disponibilidade e custos (FinOps); - Atuar na resolução de incidentes e melhoria contínua da plataforma. Qualifications - Experiência sólida em DevOps/SRE em ambientes críticos de produção; - Domínio de AWS (EC2, RDS, VPC, IAM, EKS, ALB, CloudFront, API Gateway); - Experiência com Infrastructure as Code (Terraform); - Vivência com CI/CD (GitHub Actions, GitLab CI, CodeBuild ou similares); - Experiência com GitOps (ArgoCD, Flux ou similares); - Conhecimento em observabilidade (Datadog, Prometheus, Splunk ou similares); - Práticas de segurança em cloud (DevSecOps, IAM, least privilege, secrets); - Forte capacidade analítica, organização e visão de arquitetura. Requirements - Experiência com Backstage ou developer portals; - Conhecimento em FinOps (otimização de custos em cloud); - Experiência com gestão de incidentes (on-call, postmortem, runbooks); - Conhecimento em redes e segurança em cloud (VPC design, WAF, policies); - Certificações AWS ou Kubernetes (CKA). Company Description

