We are a global, diverse team of cloud experts building the next generation of cloud solutions.
Future Openings - SRE Support Engineer - Observability
Location
United States
Posted
94 days ago
Salary
0
Job Description
Future Openings - SRE Support Engineer - Observability
Virtasant
SRE Support Engineer - Observability While this position is not currently open, we are interviewing strong candidates for upcoming opportunities on this team. Location: Remote | Time Zone: (US, Canada, Brazil, Chile, Colombia, Mexico) (8AM–5PM Pacific) Freedom to grow. Power to deliver. Virtasant is a global technology services company delivering large-scale cloud, data, and engineering solutions across 130+ countries. We partner with some of the world’s largest organizations to help them build, operate, and scale internal platforms used by tens of thousands of engineers. For this role, you will be supporting one of the most advanced internal developer platforms in the world, powering products used by hundreds of millions of people. The problems you will solve are deep, complex, and essential to keeping a global-scale organization moving. Role Overview The Observability & Tools Support Engineer provides high-impact technical support for customers of a large technology company’s internal IaaS platform, with a focus on monitoring, alerting, telemetry, and operational tooling. This role spans a wide range of support—from white-glove onboarding and end-to-end customer enablement, to deep technical troubleshooting across Linux, networking, and observability systems (especially Prometheus and AlertManager). You will also contribute to improving the support function itself: strengthening tooling, documentation, workflows, and feedback loops so the service scales. Success depends on excellent troubleshooting, strong written communication, comfort working with highly technical customers, and the maturity to identify patterns and drive operational improvements beyond individual ticket resolution. Business Outcome Become a trusted frontline expert for the customer’s observability ecosystem and operational tooling - delivering fast, accurate support across Slack and tickets, improving monitoring reliability, and reducing incident impact through better triage, troubleshooting, onboarding, and knowledge capture. Success Measures - Healthy volume of threads and tickets handled with high-quality outcomes - Consistent achievement of time-based SLAs - High customer satisfaction through surveys - Accurate classification of issue type, severity, and recurring patterns - Reduced repeat issues through better docs, tooling, and scalable onboarding What Will Be True When You Succeed - Customers can onboard smoothly to monitoring/alerting with minimal friction - Monitoring and alerting issues are resolved quickly, with fewer escalations - Linux and networking-related incidents reach resolution faster due to strong troubleshooting and clean handoffs - Engineering and SRE teams receive clear, actionable feedback based on real customer trends - Knowledge base content prevents tickets and accelerates self-service Core Work Units 1) Frontline Support for Observability & Tooling - Manage Slack threads and tickets (roughly 50/50) - Handle a broad range of customer support: simple issue resolution through end-to-end onboarding - Provide clear, structured guidance to highly technical customers - Maintain strong attention to detail while managing multiple interactions in parallel 2) Deep-Dive Troubleshooting & Incident Support - Troubleshoot, isolate, and resolve monitoring and alerting issues (especially Prometheus + AlertManager) - Troubleshoot complex Linux and networking issues (TCP/IP fundamentals required) - Support OpenTelemetry, tracing, and telemetry pipelines, including investigation of gaps in signals and instrumentation - Drive incidents to resolution in partnership with Engineering/SRE teams 3) Documentation & Knowledge Development - Build and maintain customer-facing and internal knowledge base articles - Create informational posts for the community support platform - Turn repeated issues into reusable guides, checklists, and onboarding playbooks 4) Trend Analysis & Feedback to Engineering - Analyze and categorize customer interaction trends - Provide accurate, meaningful feedback to Engineering and SRE orgs to improve product/tooling - Identify “top offenders” and propose practical fixes (tooling, docs, process, product) 5) Operational Excellence & Continuous Improvement - Participate in post-mortem reviews and drive follow-through on improvements - Contribute meaningfully to team objectives and goals (process, tooling, and service scaling) - Bring creativity and discretion to resolve highly complex issues “outside the box” High-Quality Work - what top performance looks like Frontline Support - Moves smoothly from triage to deeper analysis without losing the customer - Communicates clearly and confidently with technical users - Maintains clean follow-ups and thread hygiene even with high context switching Troubleshooting - Rapidly isolates issues across monitoring/alerting configs, Linux runtime behavior, and network connectivity - Uses structured approaches to incident handling: hypothesis → test → evidence → resolution - Produces high-signal writeups that accelerate downstream resolution Documentation & Enablement - Documentation is clear enough that customers avoid opening tickets - Onboarding flows reduce time-to-value and prevent common misconfigurations - Captures “tribal knowledge” quickly and makes it reusable Operational Excellence - Obsessing over details: correct severity, accurate tagging, clean timelines, strong handoffs - Spots patterns early and proactively proposes improvements that scale support Typical Day / Work Patterns - ~50% Slack support, ~50% ticket handling - Deep-dive investigations during lower ticket volume periods - Documentation writing and lightweight tooling/process improvements when patterns emerge - Weekly team review of escalations, themes, and operational improvements - High rate of context switching and parallel issue management Required Skills & Experience (Non-Negotiable) - Several years supporting highly scalable applications and web services - Hands-on experience with open-source observability and cloud-native tooling, including: - Kubernetes (and container fundamentals) - Prometheus and AlertManager troubleshooting - OpenTelemetry and distributed tracing concepts - Strong understanding of the Linux operating system (command line, process/network debugging, logs) - Good understanding of infrastructure observability principles (signals, alerting strategy, SLO thinking, noise reduction) - Good understanding of the TCP/IP suite and practical networking troubleshooting - Strong experience troubleshooting ambiguous, multi-layer issues - Excellent analytical capability and strong attention to detail - Strong written and verbal communication (clear, structured, customer-friendly) - Comfortable working with a very technical customer base - Passion for Technical Support and a service mindset Nice-to-Haves - Experience improving or supporting internal support tooling or workflows (automation, templates, runbooks) - Experience operating at scale in a services environment (pattern detection, KPI/SLA awareness, operational process maturity) - Familiarity with Grafana, log aggregation, incident tooling, and production support practices - Prior SRE or platform support experience Minimum Qualifications - 3–7+ years in Technical Support Engineering, SRE support, DevOps, Platform Support, or similar - Demonstrated experience supporting distributed systems, IaaS, or cloud platforms - Strong Linux, troubleshooting, and customer-facing communication background - Evidence of documentation, knowledge-base contributions, and process improvement mindset Disqualifiers: weak Linux fundamentals, inability to troubleshoot systematically, poor written communication, or discomfort supporting highly technical users. What You’ll Love - Real technical problem solving with tangible customer impact - A role that blends deep troubleshooting with scaling support via docs, tooling, and process - High autonomy in a remote-first environment What May Be Challenging - High context switching and managing multiple threads in parallel - Repeated patterns that require discipline to convert pain into scalable improvements - Supporting high-visibility systems where speed and accuracy matter Differentiation Industry: Remote-first, trust-based culture; global team; autonomy; modern systems; meaningful technical challenges Internal: High-impact, customer-facing observability support; direct influence on tooling and process maturity; opportunity to shape scalable support practices
Related Guides
Related Categories
Related Job Pages
More Support Engineer Jobs
Customer Support Engineer- Tier 2 (Server)
CommvaultCommvault provides award-winning, intelligent data solutions and information management services that deliver backup and recovery for businesses and organizations. The company was
Recruitment Fraud Alert We’ve learned that scammers are impersonating Commvault team members—including HR and leadership—via email or text. These bad actors may conduct fake interviews and ask for personal information, such as your social security number. What to know: - Commvault does not conduct interviews by email or text. - We will never ask you to submit sensitive documents (including banking information, SSN, etc) before your first day. If you suspect a recruiting scam, please contact us at wwrecruitingteam@commvault.com About Commvault Commvault (NASDAQ: CVLT) is the gold standard in cyber resilience. The company empowers customers to uncover, take action, and rapidly recover from cyberattacks – keeping data safe and businesses resilient. The company’s unique AI-powered platform combines best-in-class data protection, exceptional data security, advanced data intelligence, and lightning-fast recovery across any workload or cloud at the lowest TCO. For over 25 years, more than 100,000 organizations and a vast partner ecosystem have relied on Commvault to reduce risks, improve governance, and do more with data. Commvault is world’s most powerful backup and recovery software in the cloud and on any infrastructure, helping companies transform their data into a powerful strategic asset. Commvault data protection and information management solutions enable companies and organizations of all sizes, in all industries, to protect, access and Share all their data- anywhere and anytime. As an organization, we committed to a great work culture that embraces our values and promote professional growth. Our vaulters are passionate innovators who work together to uncover new challenges that can be solved. We are proud that the focus of every vaulter is to drive our customer’s business forward. We’re all about getting the job done and having FUN doing it. As vaulters, we pride ourselves on transparency, integrity and respect in everything that we do. NOW is the time to join a growing company with strong roots, where you can take on your new challenge. We are looking for a Level 2 Support Engineer with a genuine passion for all things tech to join our expert and super-friendly Server team. Position Responsibilities - Working independently and in a team to come up with optimal solutions for customer problems - Strong written and verbal communication skills - Providing best-in-class phone based support for a variety of complex, time critical issues - Using and sharing your knowledge of a wide range of technologies. Main focus being on Windows Server operating systems, Networks, Hardware and Application troubleshooting via event logs, acquiring process crash dumps, analyzing network packets, making registry changes, understanding Windows firewall and VSS. - Working remotely on enterprise level customer sites and secure sites. - Recreation of problems in an internal lab environment and ability to provide Root Cause Analysis - Contributing to knowledge base and online forums - Alibility to multi-task with strong time management skills - Continuous professional development and maintaining technical expertise via training and certifications - When you join Commvault you do far more than move companies. You become a part of the Commvault family….a group of smart, quirky and inspirational women and men who love to tinker with technology, drive innovation, push the boundaries, strive for outstanding results and have fun in the process. - Our employees (Vaulters), often talk about our culture as inspiring, collaborative, rewarding, transparent and entrepreneurial. Commvault is a place where YOU really can be the best version of yourself. Position Requirements ***US based position*** - Position will require working with some federal customers - A bit of a technical whizz who loves to know and learn “how things work” - Enjoy troubleshooting and resolving complex problems and help customers across the globe fix their issues and make their day. - Real desire to provide extraordinary customer experience (that is why our APAC Support Team receives 98% customer satisfaction score!) - Support, Systems Administrator or Systems Engineer background - Advanced administration and troubleshooting skills in Windows Server environments - Proven experience of Networking and troubleshooting connectivity, Name Resolution, and performance based issues with OS and/or hardware - Understanding of Microsoft clustering technologies and experience with SAN Storage - Have experience with applications such as the following: - Microsoft SQL, 2008-2014 Installation, administration and troubleshooting, Basic SQL scripting - Working knowledge of VMware\Hyper-V - Microsoft Internet Information Services (IIS), Apache Tomcat Recognized as a leader in the Gartner Magic Quadrant for Data Centre Backup and Recovery Software, our industry’s definitive independent ranking. For the sixth straight year, Commvault has been named a leader. And this year we’re further on the “Completeness of vision” and highest on the “ability to execute”. Commvault offers its products through a broad array of distribution partner globally, while building upon its strong portfolio of strategic partnership with leading technology companies including Microsoft, Amazon web services, Cisco, Oracle, SAP, Nutanix, Pure, HP, Hitachi, NetApp & many others. Commvault’s global headquarter is in Tinton Falls NJ, with additional offices that support customers globally across the Americas, EMEA & APAC. Commvault is an equal opportunity workplace and is affirmative action employer. We are always committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital Status, disability, Gender identity or Veteran Status. We will not discriminate against based on such characteristic or any other status protected by laws or regulations in the location where we work. #LI-AM1 #LI-Remote Thank you for your interest in Commvault. Reflected below is the minimum and maximum base salary range for this role. At Commvault we use broad salary ranges in our job postings to reflect the diverse levels of expertise and experience among our candidates and is not reflective of the total compensation and benefits package. The specific salary offered will be determined based on your unique qualifications, including your relevant experience, skills, and the value you bring to the role. While the range provides a general idea of the compensation, it is important to note that placements within the range are not automatic and will be carefully considered to ensure a fair and competitive offer. We are committed to rewarding talent and experience. Pay Range $72,250—$140,300 USD Commvault is an equal opportunity workplace and is an affirmative action employer. We are always committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status and we will not discriminate against on the basis of such characteristics or any other status protected by the laws or regulations in the locations where we work. Commvault’s goal is to make interviewing inclusive and accessible to all candidates and employees. If you have a disability or special need that requires accommodation to participate in the interview process or apply for a position at Commvault, please email accommodations@commvault.com For any inquiries not related to an accommodation please reach out to wwrecruitingteam@commvault.com. Commvault's Privacy Policy
Remote Technical Support Agent
HopesglobalgetawaysHopes Global Getaways is a remote travel planning company powered by a full-service travel agency that specializes in destination weddings, honeymoons, cruises, family vacations, and luxury getaways. We partner with top global travel brands to design seamless, memorable travel experiences for clients worldwide. Our mission is to help travelers plan unforgettable vacations while offering flexible, remote opportunities for individuals who are passionate about travel and customer service.
Position Overview We are actively hiring motivated and customer-focused individuals for a Remote Technical Support Agent role. In this position, you will provide remote assistance to clients by handling travel-related inquiries, coordinating reservations, resolving booking issues, and ensuring accurate documentation. No prior travel industry experience is required. Comprehensive training and ongoing support are provided. Key Responsibilities - Respond to client inquiries via phone, email, and online platforms - Provide support for reservations, confirmations, updates, and changes - Research and verify travel options using approved systems - Troubleshoot booking-related issues and provide timely resolutions - Maintain accurate records and client documentation - Deliver professional, solution-oriented customer service Qualifications - Strong communication and customer service skills - Detail-oriented with strong organizational abilities - Comfortable working independently in a remote setting - Basic computer proficiency and reliable internet access - Must be at least 18 years of age - Legal eligibility to work in the U.S., U.K., Mexico, Australia, or Spain - English communication proficiency What We Offer - Fully remote position - Flexible scheduling options - Training and professional development - Incentive programs and performance-based perks - Opportunities for advancement
Technical Support Representative
HopesglobalgetawaysHopes Global Getaways is a remote travel planning company powered by a full-service travel agency that specializes in destination weddings, honeymoons, cruises, family vacations, and luxury getaways. We partner with top global travel brands to design seamless, memorable travel experiences for clients worldwide. Our mission is to help travelers plan unforgettable vacations while offering flexible, remote opportunities for individuals who are passionate about travel and customer service.
Overview We are currently hiring motivated individuals for a Remote Technical Support Representative position. In this role, you will assist clients with travel-related inquiries, booking support, reservation coordination, and issue resolution. No prior travel industry experience is required. Training and ongoing support are provided. What Youll Do - Respond to client inquiries via phone, email, and online platforms - Assist with travel bookings, reservations, confirmations, and updates - Research and compare travel options using approved systems - Troubleshoot booking issues and resolve service concerns - Maintain accurate records and documentation - Support clients with itinerary details and travel information What Were Looking For - Strong communication and customer service skills - Detail-oriented and organized - Comfortable working independently in a remote environment - Basic computer skills and reliable internet access - At least 18 years old - Legally authorized to work in the U.S., U.K., Mexico, Australia, or Spain - English proficiency What We Offer - 100% Remote position - Flexible scheduling - Training provided - Incentive programs and travel-related perks - Growth opportunities
Remote Support Agent
HopesglobalgetawaysHopes Global Getaways is a remote travel planning company powered by a full-service travel agency that specializes in destination weddings, honeymoons, cruises, family vacations, and luxury getaways. We partner with top global travel brands to design seamless, memorable travel experiences for clients worldwide. Our mission is to help travelers plan unforgettable vacations while offering flexible, remote opportunities for individuals who are passionate about travel and customer service.
Role Summary We are seeking a Remote Support Agent to serve as the primary point of contact for our clients, supporting travel coordination and ensuring booking details are accurate and complete. In this fully remote role, you will assist with itinerary organization, reservation management, and client communication to deliver smooth and reliable travel experiences. This position is ideal for detail-oriented individuals who enjoy problem-solving, customer service, and working independently in a virtual environment. No prior travel experience is required; training and tools are provided. Primary Responsibilities - Assist clients with organizing and coordinating personalized travel itineraries - Research, compare, and confirm reservation options using approved systems - Provide accurate information and thoughtful recommendations based on client preferences and budgets - Communicate professionally with clients via email, phone, and messaging platforms - Manage booking updates, schedule changes, and service-related inquiries efficiently - Maintain accurate client records, documentation, and booking details - Participate in required training sessions, meetings, and professional development Required Qualifications - Strong written and verbal communication skills with a professional approach - High attention to detail and strong organizational abilities - Basic computer proficiency and reliable high-speed internet access - Ability to work independently and remain productive in a remote setting - Must be at least 18 years of age - Legal eligibility to work in the U.S., U.K., Mexico, Australia, or Spain What We Offer - Fully remote role with flexible scheduling - Comprehensive training and ongoing professional development - Access to incentive programs and travel-related perks - Supportive, collaborative team environment

