knowmad mood

growing together

Senior SRE Engineer (Site Reliability Engineering)

DevOps EngineerDevOps EngineerFull Time Remote SeniorTeam 1,001-5,000Since 1994H1B No SponsorCompany Site LinkedIn

Location

Spain

Posted

48 days ago

Salary

Seniority

Senior

No structured requirement data.

Job Description

We are knowmad mood! Somos una compañía líder en transformación digital, en constante evolución y a la vanguardia de la tecnología. Nacimos para provocar un cambio real a través de la innovación y el desarrollo sostenible, con la misión de aportar valor a los clientes e impulsar nuestro talento. Formado por más de 3.000 personas creativas, digitales e innovadoras conectadas a un propósito y capaces de generar conexiones con personas de todo el mundo. Un equipo responsable, flexible y con alta capacidad de adaptación a las necesidades de nuestros clientes y del mercado, a la vez que proporciona valor, visión, creatividad, expertise, profesionalidad y pasión por la tecnología en cada proyecto. Los valores que marcan nuestro rumbo y nos guían hacia la excelencia son la colaboración, la innovación, el compromiso, la diversión y la conﬁanza. ¿Qué es lo que valoramos? - Compromiso, trabajo en equipo y capacidad para comunicar ideas técnicas complejas. - Experiencia sólida en monitorización y observabilidad (Prometheus, Grafana, ELK). - Conocimiento avanzado en arquitecturas de microservicios y patrones de resiliencia. - Dominio de Java 8/11 y prácticas de testing. - Experiencia con Docker y Kubernetes. - Experiencia en performance testing, resolución de incidencias y guardias on-call. - Conocimientos en SRE, Chaos Engineering y AIOps. - Perfil claramente senior y acostumbrado a trabajar en squads multidisciplinares. - Nivel alto de inglés. ¿Cuáles serían tus funciones? - Garantizar la fiabilidad y disponibilidad de los servicios en producción aplicando prácticas SRE. - Diseñar e implementar monitorización y observabilidad con Prometheus, Grafana y ELK. - Gestionar arquitecturas de microservicios, aplicando patrones de resiliencia (circuit breaker, bulkheading, service discovery). - Desarrollar y mantener automatizaciones y servicios en Java (8/11) con buenas prácticas de testing. - Administrar y optimizar contenedores y despliegues en Docker y Kubernetes. - Realizar performance testing, análisis de capacidad y mejora continua del rendimiento. - Participar en guardias rotativas y resolución de incidencias críticas, incluyendo post-mortems. - Aplicar Chaos Engineering para validar la resiliencia del sistema. - Implementar prácticas de AIOps para mejorar la detección y respuesta automatizada de incidentes. - Colaborar dentro de un squad multidisciplinar, aportando visión técnica y coordinándose con desarrollo, QA y producto. Además, valoraremos muy positivamente si tienes experiencia y/o conocimientos en: - Arquitecturas cloud (Azure, AWS o GCP). - Gestión avanzada de pipelines CI/CD. - Observabilidad de aplicaciones distribuidas a gran escala. - Metodologías ágiles y trabajo en squads. - Certificaciones en SRE, Kubernetes o cloud. Y con nosotros podrás disfrutar de: ✅Contrato Indefinido ✅ 100% remoto y flexibilidad horaria ✅Formación interna y acceso a certificaciones ♻Consulta nuestro calendario aquí: https://www.knowmadmood.com/es/talento/formacion ✅Plan de retribución flexible (seguro médico, transporte, tickets guardería, tickets restaurante) ✅Embajador de nuestra marca, a través de nuestro plan amigo ¡Recomienda a tus amigos y llévate un extra! ✅¡Eventos, meetups, techdays, charlas...y mucho más! En knowmad mood nos comprometemos con la igualdad de oportunidades y el respeto a la diversidad. Aplicamos nuestro Plan de Igualdad y el principio de no discriminación en todos nuestros procesos de selección. Para estar al corriente de nuestras novedades síguenos aquí -> knowmad mood

Related Categories

DevOps Engineer

Related Job Pages

Remote Full-time Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

Senior Site Reliability Engineer

GoDaddy

GoDaddy is a web services platform that helps individuals and businesses worldwide start, grow, and manage their online presence. GoDaddy employs team members across North America,

DevOps Engineer48 days ago

Full Time Remote

Location Details: India, Remote At GoDaddy the future of work looks different for each team. Some teams work in the office full-time; others have a hybrid arrangement (they work remotely some days and in the office some days) and some work entirely remotely. This is a remote position, so you’ll be working remotely from your home. You may occasionally visit a GoDaddy office to meet with your team for events or meetings. Join our Team The Networking – Tools and Monitoring team is responsible for monitoring GoDaddy's global network infrastructure. We keep watch over data centers and backbone networks across the USA, Singapore, Germany, and France, as well as office locations in cities around the world. Our mission is simple but critical: deliver accurate, reliable network monitoring data and fast-firing alerts so that our engineers are notified of incidents as quickly as possible! As a Senior Site Reliability Engineer you will be a key technical owner on the team. You will deploy and operate Kubernetes-based services on OpenStack, drive GitOps workflows, and write Python tooling that underpins our Pulumi-based IAC platform. You will directly shape how we monitor the global GoDaddy network and how we deliver reliable, automated services to our internal customers. You will also maintain and extend devcontainer based working environments with scripting, AI rule governance and custom CLI tools. What you'll get to do... - Deploying and managing Kubernetes clusters and workloads hosted on OpenStack - Operating and improving GitOps delivery pipelines using Rancher Fleet - Developing new and maintaining existing Python tools and libraries within our Pulumi-based infrastructure-as-code environment - Supporting internal customers when they experience issues with our services or tooling Your experience should include... - 6+ years in an SRE, platform engineering, or equivalent DevOps / infrastructure role - Deep, hands-on Kubernetes experience: cluster administration, RBAC, networking — not just consuming managed services - Proficiency with at least one infrastructure-as-code tool: Pulumi, Helm, Kustomize, or Terraform - Very strong scripting and automation skills in Python and Bash - Very Strong Linux background - Experience with AI-assisted development tooling (Claude Code, GitHub Copilot, Codex, or equivalent) - Solid network troubleshooting fundamentals You might also have... - Production experience with Rancher for multi-cluster management, provisioning, and upgrades - Hands-on Rancher Fleet experience for GitOps-based workload and configuration management - OpenStack automation via API or Horizon UI - Experience managing network devices: Juniper / Arista, BGP, SNMP, streaming telemetry - Certifications (LPIC, CCNA, etc.) — note: hands-on experience always outweighs certificates We've got your back...  We offer a range of total rewards that may include paid time off, retirement savings (e.g., 401k, pension schemes), bonus/incentive eligibility, equity grants, participation in our employee stock purchase plan, competitive health benefits, and other family-friendly benefits including parental leave. GoDaddy’s benefits vary based on individual role and location and can be reviewed in more detail during the interview process. We also embrace our diverse culture and offer a range of Employee Resource Groups (Culture). Have a side hustle? No problem. We love entrepreneurs! Most importantly, come as you are and make your own way. We encourage you to apply even if your experience or skillset doesn’t align perfectly with every requirement. We value a wide range of backgrounds and transferable skills, and we are excited to support learning and growth. About us... GoDaddy is empowering everyday entrepreneurs around the world by providing the help and tools to succeed online, making opportunity more inclusive for all. GoDaddy is the place people come to name their idea, build a professional website, attract customers, sell their products and services, and manage their work. Our mission is to give our customers the tools, insights, and people to transform their ideas and personal initiative into success. To learn more about the company, visit About Us. At GoDaddy, we know diverse teams build better products—period. Our people and culture reflect and celebrate that sense of diversity and inclusion in ideas, experiences and perspectives. But we also know that’s not enough to build true equity and belonging in our communities. That’s why we prioritize integrating diversity, equity, inclusion and belonging principles into the core of how we work every day—focusing not only on our employee experience, but also our customer experience and operations. It’s the best way to serve our mission of empowering entrepreneurs everywhere, and making opportunity more inclusive for all. To read more about these commitments, as well as our representation and pay equity data, check out our Diversity and Pay Parity annual report which can be found on our Diversity Careers page. GoDaddy is proud to be an equal opportunity employer. GoDaddy will consider for employment qualified applicants with criminal histories in a manner consistent with local and federal requirements. Refer to our full EEO policy. Our recruiting team is available to assist you in completing your application. If they could be helpful, please reach out to myrecruiter@godaddy.com. GoDaddy doesn’t accept unsolicited resumes from recruiters or employment agencies.

View details: Senior Site Reliability Engineer

India

Apply

Regional Site Start Up

Parexel

DevOps Engineer48 days ago

Full Time RemoteTeam 10,001+Since 1983H1B Sponsor

Company Site LinkedIn

When our values align, there's no limit to what we can achieve. At Parexel, we all share the same goal - to improve the world's health. From clinical trials to regulatory, consulting, and market access, every clinical development solution we provide is underpinned by something special - a deep conviction in what we do. Each of us, no matter what we do at Parexel, contributes to the development of a therapy that ultimately will benefit a patient. We take our work personally, we do it with empathy and we're committed to making a difference. The Regional Site Start Up (SSU) role is responsible for leading and delivering site start-up and activation activities across clinical trials. This role will ensure timely site activation, maintain strong relationships with sites, and work cross-functionally with internal and external teams to efficiently achieve study site activation timelines. The role provides regional expertise, ensuring large areas of geographic-specific needs are addressed and adherence to study milestone timelines. This role must possess excellent interpersonal skills, attention to detail, and the ability to collaborate across teams to ensure timelines are achieved. CORE JOB RESPONSIBILITIES: Site Start Up and Activation: - Accountable to delivering individual site activation timelines to plan for assigned sites - Gather, organize and share, as appropriate, all required essential documents from clinical sites and Sponsor specific documents to ensure compliance with Regulatory and Sponsor requirements as part of the site activation process - Collect site intelligence to inform site discussions and maintain site information in CTMS - Ensure site regulatory packages meet country requirements, TMF standards and ICH-GCP compliance - Assist with reviewing Informed Consent Forms (ICF) as requested - Facilitate the translation of Essential Documents that may be required in languages other than English for purposes of submission to and approval from Regulatory Health Authorities and/or Independent Review Board/Ethics Committees - Provide regional expertise, addressing specific geographic challenges to facilitate site activation. Serve as the primary point of contact and escalation point for sites: troubleshoot issues and provide strategic solutions to ensure activation timelines are achieved - Update trackers with key study information, risks and mitigation strategies - Ensure all site start-up documents are filed in the TMF and are inspection ready - Support inspection readiness activities related to site start up documents Cross-Functional Collaboration: - Partner with internal, external stakeholders and clinical sites to ensure good communication and coordination through the site start-up phase - Ensure alignment with all global and local regulatory requirements Process Optimization and Compliance: - Maintain accurate records of site activation progress, including updates on document collections, submissions statuses, and timelines - Identify and escalate challenges or delays in document collection, regulatory submissions, or site activation processes for resolution - Identify opportunities for process improvement in site start-up activities and implement best practices to enhance efficiency and effectiveness Job Requirements: In addition to the core duties outlined, the following qualifications are required for the Regional Site Start Up II role: - Demonstrated interpersonal & leadership skills - A data driven approach to planning, executing, and problem solving - Effective communication skills via verbal, written and presentation abilities - Proactive and self-disciplined, ability to meet deadlines, effective use of time, and prioritization · Demonstrated vendor management experience - Technical proficiency in trial management systems (CTMS, TMF) and MS applications including (but not limited to) Project, PowerPoint, Word, Excel · Experience in the clinical drug development process, including study start-up - Knowledge of ICH/GCP and regulatory guidelines/directives - Ability to understand and implement operational strategic direction and guidance for respective clinical trials, fostering a culture of collaboration and trust across diverse teams and stakeholders. - Support stakeholders by addressing concerns promptly and professionally, building positive relationships, and ensuring clear communication to maintain alignment with trial objectives - Contribute to team productivity by maintaining open communication and supporting team members in their tasks - Education: Bachelor’s Degree, minimum - Years of Experience: 3 - 4 years #LI-KW1

View details: Regional Site Start Up

Canada

Apply

DevOps (k8s, terraform, Grafana, ELK Stack, Kafka, MongoDB)

coara

DevOps Engineer48 days ago

Full Time RemoteTeam 11-50

Are you a passionate DevOps professional with a love for Kubernetes and Terraform? Do you have experience with medium and large SaaS microservice architectures? Are you interested in running a platform that is stable, efficient, and most importantly, automated, so you never have to work late or be disturbed from your sleep? At coara, we build companies and develop digital products. We're looking for a DevOps specialist to bolster our team and ensure our established projects run 100% stably and efficiently. Tasks - Ensure and improve platform operations, stability and performance - Maintain automated k8s setup in Terraform - Optimize horizontal and vertical autoscaling with Terraform in an AWS or Scaleway cloud - Build and improve monitoring and alerts in Grafana and Kibana (CPU, Memory, Disk usage, Kafka Lag, Service Availability, Service Latency, etc.) - Monitor and improve latencies in high-performance environment - Monitor and configure KrakenD API Gateway - Keep disaster recovery plan up to date and perform dry runs - Maintain up-to-date maintenance plans and documentation - Monitor and improve latencies in high-performance environmenturce usage - Optimize AWS cost - Look into Backend code to help backend team optimizing (JAVA and NestJS/NodeJS) Requirements You should have sound knowledge of: - Kubernetes (in AWS and Scaleway) - Microservice Architecture (with KAFKA as connecting event engine) - Terraform, Helm - MongoDBs (and Postgres) - Redis - ELK Monitoring Stack - CI/CD pipelines in gitlab (and bitbucket) Bonus skills: - JAVA Spring Framework - NestJS - Kafka and other queuing systems like Bull or NATS Our team communicates in English, Spanish, and German so it would be great if you're proficient in one or two of these languages. We're based in Estonia and Mallorca but also work remotely. We run two k8s clouds in AWS and Scaleway with several hundred PODs and dozens of databases. We're a small and young team, which is sometimes challenging but can also be very motivating and rewarding. We don't operate like a corporate environment. An interesting and dynamic environment awaits with the opportunity to continually learn new things. Let me know if you need further assistance!

View details: DevOps (k8s, terraform, Grafana, ELK Stack, Kafka, MongoDB)

Spain

€35K - €65K / year

Apply

DevOps Engineer

Ciklum

At Ciklum, we are always exploring innovations, empowering each other to achieve more, and engineering solutions that matter. With us, you’ll work with cutting-edge technologies, contribute to impactful projects, and be part of a One Team culture that values collaboration and progress. As one of Ukraine’s largest IT companies and a top employer recognized by Forbes, we’ve spent over 20 years delivering meaningful tech solutions. We proudly support diverse talent and military veterans, recognizing their unique skills and perspectives they bring to shaping the future.

DevOps Engineer48 days ago

Full Time RemoteTeam 1,001-5,000

Ciklum is looking for a DevOps Engineer to join our team full-time in Poland. We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers, consultants, analysts and product owners, we engineer technology that redefines industries and shapes the way people live. About the role: As a DevOps Engineer, you will be part of a Messaging Integrations team within a client operating in the online food delivery domain, helping to build world-class messaging solutions. You will work alongside a team of engineers supporting other engineering teams by providing capabilities, tools, and support that enable consistent, safe, and fast delivery. Messaging acts as the central nervous system of the platform and microservices landscape. The team focuses on enabling integrations in this space, with an emphasis on innovation, efficiency, and reliability. The goal is to simplify messaging, allowing product and engineering teams to focus on customer value. The role involves contributing to the design and architecture of the developer experience, as well as cloud-based messaging and microservices platforms, and is well-suited for engineers who enjoy solving complex, event-driven challenges and thinking outside the box. About Client: With almost 100 million active users across 25 countries, they’re a global food tech company. As a recently formed team, they have many opportunities and ideas for sharing value back to the customers within their continuously expanding platform. They're looking for talented and trusted engineers to help them impress their customers. Responsibilities: - Strong interest in event-driven and microservices architectures, with focus on how systems communicate and scale - Good understanding of APIs (both synchronous and asynchronous) and their role in distributed systems - Collaborate closely with engineering teams across multiple locations, supporting best practices, standards, and system design from a platform/infrastructure perspective - Act as a supportive partner for engineering teams, helping to solve challenges related to messaging, integrations, and system reliability - Contribute to improving platform capabilities, tooling, and processes that enable teams to deliver services efficiently and safely - Take ownership of workstreams, track progress, and communicate updates clearly within the team - Drive continuous improvement by suggesting and implementing better processes, designs, and operational practices - Ensure solutions are reliable in production, supporting stability, performance, and scalability - Participate in on-call rotation to support production systems and ensure smooth operation Requirements: - Deliver and support reliable platform and infrastructure solutions, ensuring quality and stability across the lifecycle - Basic understanding of coding/scripting (e.g. Go, C#, or similar) to support automation and platform tasks - Hands-on experience with messaging technologies (e.g. Kafka, SNS/SQS), event-driven architecture, and/or real-time data/streaming concepts - Ability to work independently and collaborate to solve complex infrastructure and system challenges - Hands-on experience with AWS - Experience working in Agile environments (Scrum, Kanban) - Strong analytical thinking and communication skills - Hands-on exposure to messaging systems, event-driven architecture, or real-time data/streaming concepts Desirable: - Knowledge of C# .NET - HTTP, REST or other API technologies (we use OpenSpec or AsyncAPI) - Modern DevOps mentality, frequent CI/CD release cycles, aware of the value of self service - Knowledge of infrastructure as code tools such as pulumi, CDK, terraform - Working in microservices and event-driven architecture - Serverless computing and cloud architecture patterns - Working within an e-commerce business where reliability is critical - Experience with frameworks to manage containerized workloads and services, such as Kubernetes What`s in it for you? - Strong community: Work alongside top professionals in a friendly, open-door environment - Growth focus: Take on large-scale projects with a global impact and expand your expertise - Tailored learning: Boost your skills with internal events (meetups, conferences, workshops), Udemy access, language courses, and company-paid certifications - Endless opportunities: Explore diverse domains through internal mobility, finding the best fit to gain hands-on experience with cutting-edge technologies - Flexibility: Enjoy flexibility – full remote working possibilities - Care: We’ve got you covered with company-paid medical insurance, mental health support, and financial & legal consultations About us: At Ciklum, we are always exploring innovations, empowering each other to achieve more, and engineering solutions that matter. With us, you’ll work with cutting-edge technologies, contribute to impactful projects, and be part of a One Team culture that values collaboration and progress. With delivery centers in Wrocław and Gdańsk, our 300+ professionals in Poland drive forward-thinking solutions for global clients. Join a community where collaboration sparks innovation—and your impact reaches millions. Want to learn more about us? Follow us on Instagram, Facebook, LinkedIn. Explore, empower, engineer with Ciklum! Interested already? We would love to get to know you! Submit your application. We can’t wait to see you at Ciklum.

View details: DevOps Engineer

Poland

Apply

Senior SRE Engineer (Site Reliability Engineering)

Job Description

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Senior Site Reliability Engineer

Regional Site Start Up

DevOps (k8s, terraform, Grafana, ELK Stack, Kafka, MongoDB)

DevOps Engineer