Job Closed
This listing is no longer active.
Together, delivering life changing therapies. Let's talk future.
Observability Engineer
Location
Pennsylvania
Posted
150 days ago
Salary
0
Seniority
Senior
Job Description
Observability Engineer
PCI Pharma Services
• Lead the enterprise-wide migration from SolarWinds to Dynatrace, including architecture design, agent deployment, and dashboard development • Design and implement comprehensive monitoring coverage for servers, network devices, applications, databases, and cloud resources across 16 global sites • Develop custom dashboards, alerts, and automated remediation workflows aligned with operational KPIs • Establish baseline metrics and anomaly detection rules for proactive incident identification • Integrate observability platform with ServiceNow for automated incident creation and enrichment • Configure monitoring for IT/OT environments including manufacturing systems, SCADA, and industrial control systems • Implement synthetic monitoring for critical business applications and user experience tracking • Design log aggregation and correlation strategies for security event monitoring in coordination with SECURE team • Create runbooks and standard operating procedures for alert response and escalation • Provide 24x7 monitoring strategy and coordinate with global follow-the-sun operations team • Integrate backup monitoring via Veeam reporting and alerting for RPO/RTO compliance visibility • Optimize monitoring costs through efficient data retention policies and license management • Train operations staff on platform usage, dashboard interpretation, and alert response procedures
Job Requirements
- Bachelor's degree in Computer Science, Information Technology, or related field
- 5+ years of experience in infrastructure monitoring and observability
- Hands-on experience with Dynatrace including OneAgent deployment, Davis AI, and dashboard development
- Strong experience with SolarWinds (NPM, SAM, VMAN) for migration planning
- Proficiency in monitoring network infrastructure (Cisco switches, routers, firewalls)
- Experience monitoring VMware vSphere environments
- Knowledge of cloud monitoring for Azure and AWS workloads
- Strong scripting skills (PowerShell, Python, Bash) for automation
- Understanding of SNMP, WMI, API-based monitoring approaches
- Experience with log management and SIEM integration
Benefits
- Equal Opportunity/Affirmative Action Employer
- Quality and operational excellence
- Industry-leading customer experience
- Fair and competitive rewards program
- Professional development opportunities
Related Guides
Related Categories
Related Job Pages
More Engineer Jobs
Senior Cloud Data Engineer
EYBuilding a #BetterWorkingWorld by providing trust through assurance and helping organizations grow, transform & operate.
• Develop and maintain robust, scalable data pipelines in cloud environments (Azure). • Design and maintain relational and dimensional data models, with schemas optimized for analytical consumption. • Collaborate with analysts and business stakeholders to ensure the delivery of reliable, well-structured data. • Ensure data governance, quality, and security throughout the entire process.
Release Train Engineer – RTE
CRODUCRODU is a full-stack JavaScript software house. We deliver fresh, creative, and results-driven solutions to our clients
• Orkiestracja „pociągu” (ART): Facylitacja kluczowych wydarzeń (PI Planning, System Demos, Inspect & Adapt) • Wsparcie organizacji w przejściu na model SAFe (współpraca z wewnętrznym SAFe Agentem) • Budowa struktur od zera: współtworzenie nowego zespołu (ART) i wdrażanie standardów SAFe w organizacji, która przechodzi transformację • Zarządzanie zależnościami: Identyfikacja i rozwiązywanie krytycznych punktów styku między modułami SAP a systemami zewnętrznymi • Aktywne wspieranie zespołów w eliminowaniu eskalowanych przeszkód/ blokad projektowych, zarządzanie ryzykiem i dbanie o to, by zespoły mogły dowozić bez zbytecznych problemów • Współpraca z biznesem: ścisły kontakt z Dyrektorem Wykonawczym oraz Business Ownerami w celu zapewnienia przewidywalności dostarczania wartości
Telecom Engineer II
Cleveland ClinicYour source for health news, tips and information from one of the nation’s top hospitals.
• Provide engineering of enterprise-wide telecommunication systems including evaluation, design, provisioning, and implementation of technology solutions and services • Serve as an architect for new telephony systems and buildout constructs • Ensure system performance, availability, and reliability of deployed technology/services • Act as an escalation point on complex technical problems and support issues providing problem diagnostic and consulting services • Provide project budgeting estimates to include scope of work and contract reviews • Communicate production requirements for on-going maintenance and support • Provide design requirements and configuration of new and/or upgraded telecommunications hardware and software • Coordinate the installation of hardware, Operating System, applications software, and peripherals as required • Develop and communicate technical environmental requirements related to physical space, system security, power, HVAC, etc. • Participate in project and change control meetings concerning the physical placement and installation of telecommunication solutions • Document all related activities using a standard Project Book • Provide for the design, planning and scheduling of integrating additional hardware, OS upgrades, or enhancements into production systems
Sustaining Engineer
VeevaHeadquartered in Pleasanton, California, Veeva is a leading provider of cloud-based software and services for the life sciences industry. As an employer, Veeva
• Troubleshoot critical customer reported issues in the Vault CRM products • Fix bugs in the Vault CRM code • Learn everything about the Vault CRM modules and use that knowledge to ensure our customers are successful • Build tools to help with fixing and troubleshooting product issues • Collaborate with the Development, Support, Product, and QA teams to diagnose, troubleshoot and resolve complex customer issues • Identify problems that may impact multiple customers




