Mid Cloud Observability Engineer
Location
Brazil
Posted
1 day ago
Salary
0
Seniority
Senior
Job Description
Mid Cloud Observability Engineer
Experian
• A cloud observability engineer’s day is about making complex systems understandable, improving signal quality, and enabling faster, smarter debugging across teams. • Check system health, review alerts/incidents. • Triage alerts • Investigate issues • Improve observability instrumentation • Build and improve dashboards • Alert optimization • Work with development teams and other engineering partners. • Continuous improvements. • Release support. • Design and implement observability frameworks using metrics, logs, and distributed tracing • Develop dashboards, alerts, and visualizations to monitor system health • Standardize observability practices across engineering teams (logging, telemetry, tracing) • Implement and manage native monitoring tools. • Build alerting systems (avoid alert fatigue) • Participate in on-call rotations • Build intelligent alerting using Improve System Reliability/ proactively reduce risk • Identify reliability risks to help harden systems against failure • Reduction in alert noise / false positives • Increased observability coverage (% of services instrumented) • Improved SLO compliance • Embed observability into applications • Add tracing/metrics into code • Standardize logging formats • Ensure all services are observable end-to-end
Job Requirements
- SRE, DevOps, or Cloud Engineering
- Cloud platforms AWS
- Experience in AWS services including CloudWatch, X-Ray, Lambda ECS/EKS, API Gateway RDS, DynamoDB, S3
- Hands-on with experience with an observability tool Dynatrace Splunk Datadog OpenTelemetry Prometheus, Grafana AWS Distro for OpenTelemetry
- Strong understanding of: Containers & orchestration (Docker, Kubernetes)
- CI/CD pipelines
- Infrastructure as Code (Terraform, CloudFormation)
- Nice to have Experience building observability platforms at scale in AWS
- Familiarity with multi-account AWS environments
- Experience with cost optimization for observability (logging/metrics ingestion)
- Experience in high-scale distributed system
Benefits
- Health insurance
- Professional development opportunities
Related Guides
Related Categories
Related Job Pages
More Cloud Engineer Jobs
• Lead development of data solutions, pipelines, and API-driven database components • Drive high‑volume data processing and enterprise data architecture efforts
• You will be a part of the team accountable for design, model and development of whole AWS data ecosystem for one of our Client’s. • Involvement throughout the whole process starting with the gathering, analyzing, modelling, and documenting business/technical requirements will be needed. The role will include direct contact with clients. • Modelling the data from various sources and technologies. Troubleshooting and supporting the most complex and high impact problems, to deliver new features and functionalities. • Designing and optimizing data storage architectures, including data lakes, data warehouses, or distributed file systems. Implementing techniques like partitioning, compression, or indexing to optimize data storage and retrieval. Identifying and resolving bottlenecks, tuning queries, and implementing caching strategies to enhance data retrieval speed and overall system efficiency. • Identifying and resolving issues related to data processing, storage, or infrastructure. Monitoring system performance, identifying anomalies, and conducting root cause analysis to ensure smooth and uninterrupted data operations. • Train and mentor less experienced data engineers, providing guidance and knowledge transfer.
Role Description Looking for a Senior Azure & Data Factory Developer for a permanent role by way of 6 month contract-to-hire. This requires a powerhouse SQL/ADF expert who brings sharp thinking, high energy, and enterprise‑grade data engineering to every project. - Lead development of data solutions, pipelines, and API-driven database components - Drive high‑volume data processing and enterprise data architecture efforts Qualifications - 5+ years hands-on SQL development with stored procedures and large‑scale batch processing - Proven experience designing and delivering enterprise data platforms - Building and maintaining Azure Data Factory pipelines - Creating and deploying database‑hosted APIs - Data analysis, transformation, and ETL expertise Requirements - Broader experience across cloud data ecosystems - Additional exposure to advanced data modeling or automation tooling - Familiarity with modern data integration patterns Benefits - Duration: 6 month contract to hire - Pay rate: $55-60/hour on W2 - Salary upon conversion is $115,000 - 100% remote, EST or CST hours
Role Description This position is in the Department of the Chief Information Office (DCIO), Infrastructure Services Division (ISD). The incumbent serves as a Senior Hybrid Cloud Engineer and Subject Matter Expert (SME) for the engineering, implementation, and Tier 2/3 support of the organization's integrated hybrid infrastructure. The incumbent is the technical authority for bridging these environments, ensuring seamless interoperability, high availability, and the extension of Zero Trust Architecture (ZTA) from the local data center to the public cloud. The incumbent plays a pivotal role in managing the total lifecycle of virtualized assets, from on-premises hypervisors to cloud-native instances, ensuring a unified security and operational posture. - Engineering and managing enterprise on-premises virtualization platforms (e.g., VMware vSphere, Microsoft Hyper-V, or Nutanix AHV) to support mission-critical legacy and modern workloads. - Architecting and maintaining hybrid cloud connectivity using technologies such as Azure Arc, AWS Outposts, or Google Anthos to create a unified management plane across local and public data centers. - Leading the migration and refactoring of workloads from on-premises virtual environments to Azure, AWS, GCP, or OCI using "lift-and-shift," "re-platforming," or "cloud-native" strategies. - Managing software-defined data center (SDDC) components, including virtualized networking (NSX/AVS) and storage (vSAN), to ensure high performance and hardware abstraction. - Automating hybrid infrastructure delivery through Infrastructure as Code (IaC) tools like Terraform and Ansible to provision resources consistently across on-premises and multi-cloud providers. - Designing cross-environment Disaster Recovery (DR) and Backup solutions that utilize cloud storage (e.g., S3, Azure Blob) as targets for on-premises virtual machine snapshots and stateful data. - Extending Zero Trust security models to the on-premises perimeter by implementing micro-segmentation, identity-based access, and consistent firewall policies across the hybrid fabric. - Optimizing compute resource allocation by monitoring hypervisor contention on-premises and right-sizing elastic instances in the public cloud to balance cost and performance. - Performing Tier 3 diagnostic analysis on complex "gray failure" issues, such as latency between local virtual switches and cloud VPCs, or synchronization errors in hybrid storage arrays. - Managing the integration of hyper-converged infrastructure (HCI) with cloud-native APIs to facilitate seamless resource scaling during peak demand periods. - Mentoring technical staff on the convergence of traditional systems administration and cloud-native engineering, developing guides for hybrid operational excellence. - Ensuring federal compliance (NIST SP 800-53) across the entire stack, maintaining security controls for both physical host hardware on-premises and ephemeral resources in the cloud. - Performing other duties as assigned. Qualifications - Attention to Detail: Is thorough when performing work and conscientious about attending to detail. - Customer Service: Works with clients and customers to assess their needs, provide information or assistance, resolve their problems, or satisfy their expectations. - Oral Communication: Expresses information effectively, considering the audience and nature of the information; makes clear and convincing oral presentations. - Problem Solving: Identifies problems; determines accuracy and relevance of information; uses sound judgment to generate and evaluate alternatives, and to make recommendations. Requirements - Applicants must have at least one full year (52 weeks) of specialized experience which is in or directly related to the line of work of this position. - Experience as a Subject Matter Expert in on-premises virtualization (VMware/Hyper-V) and multi-cloud engineering (Azure/AWS/GCP/OCI). - Managing highly technical projects as directed supporting the Cloud and VDI capability reporting status and compliance as required. - Coordinating and maintaining relationships with all necessary internal and external groups supporting related services. - Expertise in automating infrastructure across local and public data centers. Benefits - Familiarity with the Federal Judiciary and AO policies is a plus. Certifications - VMware Certified Professional (VCP) - Azure solutions architect - AWS solutions architect - GCP Professional cloud architect - OCI solutions architect

