Platform Engineer – AWS

Platform EngineerPlatform EngineerFull TimeRemoteSeniorTeam 201-500Since 2021H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

9 hours ago

Salary

0

Seniority

Senior

Job Description

Platform Engineer – AWS

Embrace Software Inc

• Own and evolve the AWS infrastructure that underpins our multi-tenant SaaS platform. • Design, provision, and manage production-grade AWS services including EC2, S3, RDS, ECR, VPC, IAM, CloudFront, Route 53, and EKS/ECS clusters. • Implement and maintain Infrastructure as Code (IaC) using Terraform or CloudFormation. • Architect and optimize PostgreSQL infrastructure including automated backups, replication, and performance tuning. • Drive high availability, disaster recovery planning, scalability, and cloud cost optimization initiatives. • Build and maintain delivery pipelines for rapid, safe, and reliable deployments. • Design and operate CI/CD workflows for Python (Django/Flask/FastAPI) and React applications. • Implement deployment strategies to reduce production risk. • Manage Docker-based development and production environments. • Design, operate, and optimize our container orchestration platform. • Build and maintain monitoring, alerting, and observability systems using CloudWatch, Datadog, Prometheus, and Grafana. • Partner with engineering teams to improve deployment reliability and operational efficiency.

Job Requirements

  • 5+ years of progressive DevOps/SRE experience in SaaS or enterprise environments.
  • Infrastructure as Code using Terraform (AWS provider, modules, multi-environment state management).
  • AWS core services: EKS, ECR, RDS, VPC, IAM, CloudWatch, ALB, EFS, S3, CloudFront, Route 53.
  • Kubernetes administration: Helm charts, pods, deployments, services, kubectl, autoscaling.
  • Docker containerization including multi-stage builds and registry operations.
  • CI/CD pipelines: AWS CodeBuild, GitHub Actions, GitLab CI, or Jenkins.
  • PostgreSQL production management: backup automation, replication, monitoring, performance tuning.
  • Linux systems administration (Ubuntu/Amazon Linux) and shell scripting proficiency.
  • Networking fundamentals: DNS, load balancing, TLS/SSL, firewall rules, VPN configurations.
  • Monitoring and observability: Datadog, FluentBit, CloudWatch Logs.
  • Security: AWS Secrets Manager, ACM certificates, security groups, IAM policies.
  • Application stack: Django, Celery, Redis, PostgreSQL, Nginx.
  • Git workflows, branching strategies, and pull request review processes.
  • Strong problem-solving skills with a proactive, ownership-driven approach.

Benefits

  • Competitive salary commensurate with experience.
  • Opportunities for career advancement and professional development.
  • Experience collaborating with a diverse, global team within a remote work setting.

Related Categories

Related Job Pages

More Platform Engineer Jobs

FanDuel logo

Staff AI Software Engineer - Platform

FanDuel

FanDuel is one of the leading online platforms for the fast-growing daily fantasy sports industry. The company launched in 2009 as the result of a “backyard brainstorm” between

Platform Engineer22 hours ago

Title: Staff AI Software Engineer - Platform Location: New York City Job Description: THE POSITION Our roster has an opening with your name on it As the senior-most technical Knowledge & Context Engineer, you will design and operationalized a centralized multimodal memory architecture spanning vector retrieval, knowledge graphs, ontologies, metadata systems, runtime memory injection, and lineage and governance frameworks. This role sits at the intersection of AI engineering, distributed systems, and applied intelligence. You will lead the strategy and execution for how enterprise knowledge is structured, represented, governed, retrieved, and operationalized across agents, copilots, automation systems, and customer-facing AI products. Importantly, you will sit at the cutting edge of building the cognitive substrate for an AI-native enterprise. Come join us as a hands-on thought leader who innovates by doing. In addition to the specific responsibilities outlined above, employees may be required to perform other such duties as assigned by the Company. This ensures operational flexibility and allows the Company to meet evolving business needs. THE GAME PLAN Everyone on our team has a part to play Build foundational content intelligence systems: Design systems for ingestion, indexing, embeddings, metadata, retrieval, lineage, governance, and auditability that can support both internal and customer-facing AI use cases. This includes customers, media and marketing assets, product features, production code, websites, game sounds, customer service experiences, operational workflows, and other enterprise content. Establish enterprise knowledge graphs and ontologies: Define and implement a regulatory-compliant knowledge graph strategy that creates deep context about FanDuel’s products, employees, operations, customers, and systems. Own the design of graph databases, semantic models, ontologies, entity resolution patterns, and relationships across vector and non-vector data. Build the connective tissue that allows AI systems and agents to reason over enterprise context with precision, transparency, and control. Design secure memory patterns for agents: Create reusable design patterns for how AI agents acquire, store, retrieve, update, and discard context during runtime. This includes short-term memory, long-term memory, episodic memory, summarization, context injection, retrieval augmentation, and governed memory sharing across tools and systems. Ensure memory systems are efficient, secure, auditable, and appropriate for regulated environments. Champion responsible AI development: Ensure knowledge, retrieval, graph, and memory systems meet regulatory requirements, ethical standards, privacy obligations, and responsible gaming principles. Build safeguards, access controls, provenance, explainability, monitoring, and auditability into the platform from day one. In six months, you’ll know you’re heading on the right path if you’ve built: - Reusable runtime memory patterns that are being used by multiple agents to securely acquire, retrieve, summarize, and apply context during execution. - A production-grade graph and ontology framework connecting products, customers, employees, operations, systems, and content with clear lineage, access controls, and regulatory compliance. - Teams can build AI solutions faster because retrieval, memory, metadata, governance, and knowledge graph capabilities are available as shared primitives rather than bespoke pipelines. THE STATS What we're looking for in our next teammate ABOUT FANDUEL FanDuel Group is the premier mobile gaming company in the United States and Canada. FanDuel Group consists of a portfolio of leading brands across mobile wagering including: America’s #1 Sportsbook, FanDuel Sportsbook; its leading iGaming platform, FanDuel Casino; the industry’s unquestioned leader in horse racing and advance-deposit wagering, FanDuel Racing; and its daily fantasy sports product. In addition, FanDuel Group operates FanDuel TV, its broadly distributed linear cable television network and FanDuel TV+, its leading direct-to-consumer OTT platform. FanDuel Group has a presence across all 50 states, Canada, and Puerto Rico. The company is based in New York with US offices in Los Angeles, Atlanta, and Jersey City, as well as global offices in Canada and Scotland. The company’s affiliates have offices worldwide, including in Ireland, Portugal, Romania, and Australia. FanDuel Group is a subsidiary of Flutter Entertainment, the world's largest sports betting and gaming operator with a portfolio of globally recognized brands and traded on the New York Stock Exchange (NYSE: FLUT). PLAYER BENEFITS We treat our team right We offer amazing benefits above and beyond the basics. We have an array of health plans to choose from (some as low as $0 per paycheck) that include programs for fertility and family planning, mental health support, and fitness benefits. We offer generous paid time off (PTO & sick leave), annual bonus and long-term incentive opportunities (based on performance), 401k with up to a 5% match, commuter benefits , pet insurance, and more - check out all our benefits here: FanDuel Total Rewards. *Benefits differ across location, role, and level. FanDuel is an equal opportunities employer and we believe, as one of our principles states, “We are One Team!”. As such, we are committed to equal employment opportunity regardless of race, color, ethnicity, ancestry, religion, creed, sex, national origin, sexual orientation, age, citizenship status, marital status, disability, gender identity, gender expression, veteran status, or any other characteristic protected by state, local or federal law. We believe FanDuel is strongest and best able to compete if all employees feel valued, respected, and included. The applicable salary range for this position is $170,000 - $213,000 USD, which is dependent on a variety of factors including relevant experience, location, business needs and market demand. This role may offer the following benefits: medical, vision, and dental insurance; life insurance; disability insurance; a 401(k) matching program; among other employee benefits. This role may also be eligible for short-term or long-term incentive compensation, including, but not limited to, cash bonuses and stock program participation. This role includes paid personal time off and 14 paid company holidays. FanDuel offers paid sick time in accordance with all applicable state and federal laws. FanDuel is committed to providing reasonable accommodations for qualified individuals with disabilities. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please email Benefits@fanduel.com. It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. #LI-Hybrid 7+ years of engineering experience, preferably working with large distributed systems that support a mix of data and software development activities. Bonus to consulting engineers and people with experience on early start up teams Track record of taking products from concept to launch in fast-moving, ambiguous environments. - Practical fluency with generative AI tools and concepts—especially graph, RAG, AgenticRAG, fine-tuning, and anti-RAG patterns - Experience building or operating vector and graph DBs, otnology, semantic search, and runtime memory evaluations - Ability to operate autonomously, create clarity from ambiguity, and influence across a matrixed organization. - Strong communication skills—able to translate AI concepts into clear value narratives for both technical and non-technical stakeholders. - High ownership mindset with a bias for action and outcomes over outputs. - For bonus points, experience and background in gaming, fintech, marketplace, or other regulated industries. - The first context sets of centralized, governed multimodal vector store and retrieval layer supporting AI applications across customer, product, marketing, engineering, operations, and support domains.

New York
$170K - $213K / year
Elfonze Technologies logo

OCI Platform Engineering, Administration

Elfonze Technologies

In a world of quantity, we offer quality...

Full TimeRemoteTeam 201-500Since 2020H1B No Sponsor

• Lead design and implementation of OCI architectures including landing zones, compartment strategy, and governance guardrails. • Own OCI IAM design and operations: policy authoring, dynamic groups, access reviews, and audit-ready evidence. • Design and administer enterprise OCI networking and connectivity: DRG, DRG route tables, route distributions, IPSec VPN/FastConnect, DNS strategy, load balancers, and security controls (NSGs/security lists). • Define and enforce tagging, naming, and cost management practices across environments. • Own operational administration for Oracle PaaS services as applicable (e.g., Autonomous Database, Oracle Integration Cloud, API Gateway, Functions, Streaming, Vault/KMS, Logging, Monitoring/Alarms) and their integrations. • Establish monitoring and alerting, backup/restore strategy, certificate management, and upgrade/patch planning for PaaS services. • Perform advanced troubleshooting across platform dependencies including identity, network paths, TLS/certificates, and service integrations; execute root-cause analysis (RCA) and implement preventive actions. • Lead Fusion instance provisioning end-to-end: intake validation, provisioning orchestration, baseline configuration, access enablement, and post-provision readiness validation. • Administer Fusion environments: user and role access governance, environment refresh coordination, release/patch impact planning, and integration enablement (OIC, SSO/IdP, allowlisting). • Create and maintain standardized runbooks, checklists, and readiness gates for Fusion lifecycle operations and change validation. • Build and maintain Terraform automation: reusable modules, remote state strategy, environment promotion (Dev/Test/Prod), code reviews, and CI/CD pipeline integration. • Implement operational automation using scripting (Python/Bash) for provisioning, compliance checks, reporting, and routine admin tasks. • Define change control standards including peer review, automated validation, and release governance. • Lead small technical workstreams; plan execution, manage dependencies, and drive delivery outcomes. • Mentor Associates; support onboarding, training, and operational maturity improvements. • Drive operational excellence: service health reporting, dashboards, SLIs/SLOs, and continuous improvement initiatives.

India
Thoughtworks logo

Lead Platform Engineer

Thoughtworks

Thoughtworks is a dynamic and inclusive community of bright and supportive colleagues who are revolutionizing tech. As a leading technology consultancy, we’re pushing boundaries through our purposeful and impactful work. Over 30 years of delivering extraordinary impact with clients. Helping clients solve complex business problems with technology as the differentiator.

Full TimeRemoteTeam 10,001

Role Description Lead Platform Engineers help clients build and evolve systems that client organizations use to deliver and run software. They are comfortable working within teams of people with diverse roles and levels of experience to find solutions that meet the needs of the organization. They combine technical expertise and understanding with consideration of different situational needs. They champion technical quality and effective ways of working as a means to better outcomes for clients, rather than an end in themselves. They help clients to understand agile ways of working and DevOps as a mindset for collaboration. - You will explore the client’s needs and drive the building of a technical roadmap and impactful solution that will support their ambitious business goals. - You will help shape and build Thoughtworks’ cloud and infrastructure practice through collaboration with business development, marketing, and capabilities development teams. - You will ensure and build the controls and processes for continuous delivery and evolution of infrastructure and applications, driving automation through all stages of the process. - You will take a leading role in monitoring and ensuring that technical expectations of deliverables are consistently met on projects. - You will provide expertise and guidance in the areas of DevOps, cloud, platform and infrastructure engineering, both internally and in client sites. - You will establish trusting and thoughtful partnerships with a client’s leadership across engineering and commercial functions. - You will lead the design and implementation of innovative solutions to current constraints and business policies. Qualifications - You can lead the collaborative design of enterprise and/or web-scale hosting platforms and can administer application servers, web servers and databases. - You have a deep understanding of cloud and virtualization platforms, infrastructure automation and application hosting technologies. - You have experience working with software delivery teams and understand DevOps philosophies, Agile methods, Infrastructure as Code and how to apply them to your work. - You regularly apply DevOps philosophy, Agile methods, Infrastructure as Code to your work and lead infrastructure and operations with these approaches. - You have a history of working with server virtualization, at least one IaaS cloud platform, and two or more application runtime platforms including physical servers, virtual servers, container clusters, serverless and databases. - You can write scripts using at least one scripting language and are comfortable with building one or more of: Linux servers, Windows servers or container clusters, and/or Windows server systems. - Experience with continuous integration and continuous delivery tools with different tech stacks, web or mobile. - You’ve previously worked with monitoring systems for availability, performance or security, stress and performance testing with observability patterns: Distributed Tracing/OpenTracing, Log Aggregation, Audit Logging, Exception Tracking, Health Check API, Application Metrics, Self-Healing/Multi-Cloud. - You have an understanding of security concerns, threats and approaches for dealing with them, including infrastructure platform vulnerabilities, secrets management, network security and software supply chain security. - Bonus points if you have experience with unit testing and automated testing tools, stress and performance testing. Requirements - You genuinely enjoy interacting with teammates from across the business and have a knack for communicating technical concepts to nontechnical audiences. - You love creating robust, scalable, flexible and relevant solutions that help transform businesses and industries. - You’re comfortable partnering directly with leadership teams (CTO/CIO/COO) to design technical strategies while simultaneously collaborating with senior IT groups in an advisory capacity. - You adapt effortlessly to uncertainty, embrace change and confidently make decisions with limited information to achieve positive outcomes. Benefits - There is no one-size-fits-all career path at Thoughtworks: however you want to develop your career is entirely up to you. - Your career is supported by interactive tools, numerous development programs and teammates who want to help you grow. - We see value in helping each other be our best and that extends to empowering our employees in their career journeys. Company Description Thoughtworks is a dynamic and inclusive community of bright and supportive colleagues who are revolutionizing tech. As a leading technology consultancy, we’re pushing boundaries through our purposeful and impactful work. For 30+ years, we’ve delivered extraordinary impact together with our clients by helping them solve complex business problems with technology as the differentiator. Bring your brilliant expertise and commitment for continuous learning to Thoughtworks. Together, let’s be extraordinary.

Chile
Full TimeRemoteTeam 201-500

Role Description We are looking for a skilled individual to join our rapidly growing team at Bluelight. This position is ideal for someone who thrives in a fast-paced, dynamic environment where everyone's opinions and efforts are valued and appreciated. You will have the opportunity to contribute to challenging and meaningful projects, developing high-quality applications that stand out in the market. We value continuous learning, personal growth, and hard work, offering a collaborative environment that promotes professional development. If you are passionate about software development and eager to be part of a growing software consultancy, we invite you to apply and join us on this exciting journey. - ETL Data Engineering: Develop and maintain ETL data engineering processes using Python (PySpark) within Azure Synapse Analytics Notebooks, and/or Azure Synapse Analytics Pipelines, to ensure efficient data extractions, transformation, and loading. - Data Warehousing: Apply your expertise in data warehousing, understanding star schemas, facts, and dimensions, to design and build effective data storage structures in a Massively Parallel Processing (MPP) SWL Pool. - Data Source Expertise: Extract data from various sources, including REST APIs, SWL database tables, and CSV files. - Azure Synapse Analytics Expertise: Utilize your deep knowledge of Azure Synapse Analytics to design and optimize data notebooks/pipelines for scalability and performance. - Data Fabric Concepts: Contribute to the implementation and understanding of other Data Fabric concepts, such as data lakes, lakehouses, delta lakes, and data cataloging, to enhance data management capabilities. - Data Modeling: Collaborate with data architects to create data models and schemas that align with business requirements. - Data Quality: Implement data quality checks and validation processes to maintain data accuracy and consistency. - Performance Tuning: Identify and resolve performance bottlenecks and optimize ETL data notebooks/pipelines to meet SLAs. - Monitoring and Troubleshooting: Monitor ETL jobs, diagnose issues, and implement solutions to ensure data pipeline reliability. - Documentation: Maintain comprehensive documentation of ETL data engineering processes, data flows, and data transformations. - Collaboration: Work closely with cross-functional teams to understand data requirements and provide support for data-related initiatives. - Security and Compliance: Ensure data security and compliance with data governance and privacy standards. Qualifications - Bachelor’s degree in Computer Science, Information Technology, or a related field; or equivalent work experience, with certifications related to data engineering or data science (e.g. Azure Data Engineer) being a plus. - Proven experience in ETL data engineering with significant expertise in using Python (PySpark) to perform data extraction, transformation, and loading from REST APIs, SQL database tables, and CSV files. - Proficiency in using Azure Synapse Analytics resources including Notebooks, Pipelines, Linked Services, and Azure Key Vault. - Demonstrated ability to write complex SQL queries, optimize query performance, and work with both SparkSQL and MS SQL to effectively extract, transform, and load data. - Knowledge of data integration best practices and tools. - Experience with version control systems, such as Git (Azure DevOps). - Strong problem-solving and analytical skills, with a keen attention to detail. - Excellent communication skills, both verbal and written, with the ability to work collaboratively in a team environment with shifting priorities. - Familiarity with big data technologies, machine learning, and data analysis preferred. - Experience with data visualization tools (e.g. Power BI, Tableau) and Agile Methodologies a plus. Benefits - Competitive salary and bonuses, including performance-based salary increases. - Generous paid-time-off policy. - Flexible working hours. - Work remotely. - Continuing education, training, conferences. - Company-sponsored coursework, exams, and certifications.

Latin America (LATAM)