Illumination Works logo
Illumination Works

Digital Transformation, Data Science, Data Engineering, Augmented Reality, IoT, Cloud, and More

Senior Cloud Engineer

Cloud EngineerCloud EngineerFull TimeRemoteSeniorTeam 51-200Since 2006H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

3 days ago

Salary

0

Seniority

Senior

Bachelor Degree4 yrs expEnglishAWSCloudDynamoDBPython

Job Description

Senior Cloud Engineer

Illumination Works

• Design and implement serverless workloads on AWS using Lambda, API Gateway, EventBridge, SQS, SNS, Step Functions, DynamoDB, and S3 • Author and maintain CloudFormation (YAML) templates as the source of truth for infrastructure, including nested stacks, change sets, and drift detection • Build and maintain CI/CD pipelines that lint, test, package, and deploy CloudFormation stacks and code across environments with defined promotion and rollback strategies • Develop Python based Lambda functions, internal tooling, and automation with appropriate testing, logging, error handling, and dependency management • Provide architecture guidance for distributed event-driven systems, including choreography vs. orchestration, idempotency, delivery semantics, dead-letter and replay strategies, and schema evolution • Define event contracts and async integration patterns in partnership with application teams • Implement observability across services: structured logging, distributed tracing (OpenTelemetry), metrics, and actionable alarms • Participate in incident response and post-incident reviews and convert findings into automated guardrails and runbooks • Conduct code and design reviews; mentor junior and mid-level engineers on serverless patterns, IaC standards, and operational practices

Job Requirements

  • 4+ years of professional cloud or software engineering experience
  • 2+ years building production AWS serverless workloads
  • Strong Python proficiency, including pytest, packaging, and async patterns
  • Hands-on CloudFormation experience leveraging Stacks and StackSets
  • Demonstrated experience designing and operating CI/CD pipelines that deploy infrastructure, application code, and provision managed services
  • Working knowledge of event-driven architecture patterns and the trade-offs between EventBridge, SNS, SQS, and Kinesis
  • Practical understanding of AWS IAM, VPC networking, KMS, and least-privilege design for serverless workloads
  • Experience producing design documentation and driving technical decisions from ambiguous requirements
  • Light web application development experience to build internal tools, admin UIs, or thin frontends that integrate with serverless APIs
  • Must hold an active security clearance CompTIA Security+ certification
  • Acceptable candidates must successfully pass a drug test and background screen

Benefits

  • market-competitive salary
  • generous PTO package
  • comprehensive medical, dental, vision and life insurance plans
  • 401K
  • short/long-term disability insurance
  • fun and engaging culture
  • training opportunities to keep you up to speed on the latest technologies

Related Categories

Related Job Pages

More Cloud Engineer Jobs

Full TimeRemoteTeam 1,001-5,000Since 1994H1B No Sponsor

• Адмініструвати та підтримувати Microsoft 365 (Exchange Online, Teams, SharePoint Online, OneDrive). • Керувати користувачами, групами, ліцензіями, ролями та доступами. • Підтримувати Microsoft Intune (політики, профілі, захист застосунків, розгортання ПЗ, реєстрація пристроїв). • Адмініструвати Microsoft Entra ID (Conditional Access, MFA, Enterprise Applications, RBAC). • Підтримувати Azure-інфраструктуру та compute-ресурси (Virtual Machines, Networking, Backup, Monitor, Log Analytics), виконувати моніторинг, troubleshooting та планування ресурсів. • Автоматизовувати адміністративні задачі за допомогою PowerShell. • Аналізувати інциденти, працювати з журналами аудиту та взаємодіяти з Microsoft Support. • Документувати рішення та підтримувати актуальність технічної документації.

Ukraine

Senior Cloud Engineer – Database

Thinkahead Consultant Psychologist Pty Ltd

We get to the heart of the matter.....real people......real solutions

Cloud Engineer3 days ago
Full TimeRemoteTeam 1-10H1B No Sponsor

• Lead the day-2 management, support, and continuous improvement of contracted and onboard cloud-managed database platforms across customer environments. • Own complex database incidents, escalations, and problem investigations involving availability, performance, replication, connectivity, failed jobs, schema issues, storage growth, backup failures, restore events, and recovery scenarios. • Administer and optimize contracted cloud-managed database services across AWS, Azure, and GCP, including approved relational, NoSQL, in-memory, and our supported analytics platforms. • Plan and execute approved database operational changes and lifecycle activities including scaling where supported, parameter changes, maintenance coordination, backup validation, restore testing, failover readiness validation, and platform hygiene in compliance with client change control procedures. • Perform database performance management activities including query analysis, slow-query identification, missing/duplicate/unused index analysis, execution plan review, regression detection, wait-state and lock contention analysis, parameter tuning within approved change windows, and performance baseline and trend tracking. • Validate and support backup, restore, retention, point-in-time recovery, replication health, high-availability configuration, failover readiness, and database-layer recovery operations within the technical capabilities of the platform and the approved service scope. • Validate and support database security and compliance posture, including encryption-at-rest and encryption-in transit settings, audit logging and retention validation, CIS benchmark assessment, and network access control assessment for contracted database platforms. • Coordinate operational readiness input for separately scoped migration, upgrade, or modernization efforts when required, while maintaining day-2 operational support for in-scope database platforms. • Maintain monitoring, threshold-based alerting, runbooks, operational documentation, capacity tracking, and health checks for contracted database instances, including alert triage, acknowledgment, and initial remediation within contracted response expectations. • Use approved operational tooling, SQL, scripting, and existing automation capabilities to improve repeatability and service delivery, while aligning any new automation development to separately scoped project work when required. • Partner with client application and infrastructure teams on cross-tier incidents, approved operational changes, workload onboarding, and production operational requirements within the contracted service scope. • Mentor Cloud Engineers and other team members on cloud database operations, troubleshooting methods, standards, and best practices; review work for quality and completeness. • Follow and reinforce ITSM and client change control processes for incident, request, problem, and change management, including pre-change impact assessment, rollback planning, emergency change handling, RCA documentation for SEV-1 and SEV-2 incidents, and customer-facing status communication; participate in after hours/on-call support for critical incidents and approved changes as needed. • Escalate platform-level issues to cloud provider support and manage provider support cases on behalf of the client when required. • Track engine minor version releases and associated CVEs for contracted database platforms and coordinate patch planning within the boundaries of the service scope; major engine version upgrades are handled through separate project scoping. • Prepare and deliver recurring service governance outputs, including weekly operational touchpoints, monthly operational reporting, quarterly business reviews, and ad hoc reporting within the monthly allocation of the service. • Other job duties as assigned.

India
Evry Health logo

Senior Software Engineer - Node

Evry Health

Bringing humanity to health insurance

Cloud Engineer3 days ago
Full TimeRemoteTeam 51-200Since 2017H1B No Sponsor

Title: Sr. Software Engineer (Node) Location: Dallas-Fort Worth, Texas Department: Engineering – Shared Services - IT Job Description: Roles and Responsibilities - System Architecture & Design: Lead the design and implementation of scalable and maintainable systems, ensuring alignment with business requirements and technical standards. - Development & Coding: Write efficient and maintainable code using Node 20 LTS (back-end), Next 15+ (Backend + Front end), React 19+, React Native 0.80+, and Expo SDK 54+ technologies, following best practices for software development, including test-driven development and continuous integration. Primary focus will be on backend services, RESTful API development, Next.js API routes, and SQL database design and optimization. - Backend & Database Development: Design and optimize SQL database schemas, write complex queries, and implement ORMs (Prisma, TypeORM, Sequelize, or similar). Build and maintain robust backend services using Express.js, Fastify, or similar Node.js frameworks. - API Design & Documentation: Create well-documented RESTful APIs using OpenAPI/Swagger specifications, ensuring consistency and ease of integration for frontend and mobile applications. - Technical Leadership: Provide technical guidance and mentorship to junior engineers, conducting code reviews, and ensuring adherence to established coding standards and practices. - Performance Optimization: Identify and resolve performance bottlenecks in systems, databases, and APIs, ensuring high availability and reliability of services. - Collaboration: Work closely with cross-functional teams, including .NET core developers and product managers, to deliver software solutions. - Documentation: Create and maintain technical documentation for systems, processes, and codebases to ensure knowledge sharing and continuity. - Security & Compliance: Implement and enforce security best practices, ensuring that backend systems are secure and compliant with relevant regulations and standards. - Problem Solving: Troubleshoot and resolve complex technical issues, providing timely and effective solutions to minimize downtime and ensure smooth operation of systems. - Cloud & DevOps: Deploy and manage applications on Azure cloud platform, implement CI/CD pipelines, and work with containerization technologies (Docker). Familiarity with GitHub actions and workflows. - Experience and Skills Desired - Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience). - 10+ years of professional experience in software development, with a focus on Node and React technologies, primarily in backend development. - Strong backend expertise: Deep proficiency in Node.js 20 LTS for backend development, including experience with Express.js, Fastify, NestJS, or similar frameworks. - Database proficiency: Advanced SQL skills with PostgreSQL, MySQL, or SQL Server, including schema design, query optimization, indexing, and performance tuning. Experience with ORMs such as Prisma, TypeORM, or Sequelize. - API development: Proven experience designing, building, and documenting RESTful APIs. Familiarity with GraphQL or tRPC is a plus. - Experience with full-stack development and back-end technologies (.NET core, Azure, C#, etc.) - Proficiency in Node 20 LTS (back-end), Next 15+ (Backend + Front end), React 19+, React Native 0.80+, and Expo SDK 54+ technologies. - Azure cloud platform: Experience with Azure services (App Service, Azure Functions, Azure SQL Database, Blob Storage, API Management, etc.). - DevOps & containerization: Hands-on experience with Docker, CI/CD pipelines (Azure DevOps, GitHub Actions), and Azure Container Instances or Azure Kubernetes Service. - Testing: Experience with backend testing frameworks (Jest, Mocha, Supertest) and test-driven development practices. - Version control: Strong Git workflow experience, including pull request reviews, branching strategies, and collaborative development. - Understanding of services architecture, distributed systems, and microservices patterns. - Authentication & security: Experience implementing authentication and authorization (JWT, OAuth, Azure AD, or similar). - Monitoring & logging: Experience with Sentry for error tracking and Azure Application Insights for application monitoring and performance analysis. - Caching & message queues: Experience with Redis for caching and BullMQ for job queue management. - Bonus: Healthcare software development experience - Telecommuting Requirements - This is a remote position. Our whole company works remotely. Company headquarters are in Dallas, Texas. While this position is remote, candidates must live in the Dallas-Fort Worth, TX area or be willing to relocate. - Company business hours are weekdays 9-5 CST. - Required to have a dedicated work area established that is separate from other living areas and provides information privacy. - Ability to keep all company sensitive documents secure. - Must live in a location that receives an existing high-speed internet connection/service. - Benefits - Competitive salary - Comprehensive health, dental, and vision insurance as well as life and disability - Retirement savings plan with company match - Generous time off/vacation - Professional development opportunities - Flexible work environment - We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Texas
TaskUs logo

Cloud Engineer

TaskUs

Digital Customer Experience. Trust & Safety. AI Services.

Cloud Engineer4 days ago
Full TimeRemoteTeam 10,001+Since 2008H1B Sponsor

• Provide operational and engineering support. • Handle incident management, problem management, change management (implementation of changes), release management, and capacity management. • Provide L2 and L3 escalation support for the NOC and Service Desk. • Provide project representation in your area of expertise and maintain system documentation.

India