Senior Cloud Software Engineer
Location
Maryland
Posted
1 day ago
Salary
0
Seniority
Senior
Job Description
Senior Cloud Software Engineer
INNOVIM
• build innovative tools allowing scientists and students alike to discover, transform, update, and improve the quality of Earth Science data in the pursuit of solving a wide range of environmental and socio-economic issues • work in a fast-paced Agile development environment performing operations, design, and development for the NASA Earthdata Cloud • ensure adherence to operational agreements and polices (including response times), compliance to program and NASA standards and requirements (including Section 508 compliance), leveraging of industry best practices for automation and scalability of assets, and implementation of issue detection and alerts
Job Requirements
- Bachelor’s degree in computer science, Engineering, or related technical field
- 6 years of engineering experience
- Knowledge and understanding of application hosting, with 5 years of experience using Cloud Services in an Infrastructure as a Service (IAAS) or Platform as Service (PAAS) environment
- In-depth knowledge of Amazon Web Services (AWS), including networking and serverless services such as Lambda and Step Functions
- 5-7 years of programming / scripting experience in C++, BASH, Python, Java and Java script particularly within an Agile development environment
- Experience in supporting infrastructures and operations using DevOps automation tools such as Puppet, Chef, Ansible, Terraform
- Familiar with core suite of AWS services related to DevOps, with depth in those that are heavily used when providing DevOps automation solutions, including the Management and Deployment services such as Terraform and IAM
- A strong desire to automate processes, build software tools, and create infrastructure-as-code solutions in a DevOps environment
- Experience in designing and developing data management systems
- Ability to analyze current cloud infrastructure and define, plan, communicate, and implement improvement solutions
- Ability to effectively work and communicate with customer, all levels of management and with individual contributors to the team
- Experience in mentoring team members in leveraging AWS cloud services for optimal design
Benefits
- comprehensive nationwide Medical/Dental/Vision insurance programs
- life insurance
- matching 401k contribution
- Educational/Training support
Related Guides
Related Job Pages
More Full-stack Engineer Jobs
Software Engineer
Cole Engineering Services, Inc. (CESI), a By Light CompanyProviding solutions to the most complex modeling & simulation problems
• Manage and execute the technical activities required to design, develop, integrate, test, and field applications and cloud computing resources to meet the USCG TDL Owner's deployment timelines and goals • Develop and maintain Application Program Interfaces (APIs) required for USCG information systems to pull data from CG-OWL • Customize CG-OWL User Interfaces for USCG use • Gather CG-OWL user feedback, assess validity, and present to Effort POC to collaboratively add and prioritize features to the Product Roadmap / Backlog • Conduct Operational and User Acceptance Testing • Monitor and report to the USCG, CG-OWL performance metrics as outlined in the USCG Modernized Learning Management System CONOPS to ensure compliance • Report performance metrics to USCG once per month, including any planned or unplanned downtimes and a summary of actions to address the downtime, and future mitigating factors • Address all defects found in the MLMS system, or inform third parties of any defects found • Keep process guides updated for System Administrators in accordance with the latest process or software release • Provide Tiered 3 Support for CG-OWL Monday through Friday 0900-1700 ET excluding Federal Holidays • Utilize the primary USCG ticketing system, currently CGFixIT (BMC Remedy), to properly document and respond to all incidents, problems, service requests, and maintenance IAW applicable USCG process guides, policies, and procedures
• Identify and act on production problems without being prompted. A single 500 error reaching a fan is a problem, not a data point; treat it accordingly and act with urgency regardless of volume. • Build end-to-end, high-availability systems that handle extreme traffic spikes during high-demand on-sales without degradation. • Lead platform modernization initiatives - migrating legacy services to cloud-native, microservice-based architectures. • Design and implement streaming and event-driven solutions using Kafka/gRPC, enabling real-time data flow across services. • Embed deeply in a specific service area when needed, or work horizontally across teams to solve cross-cutting problems. • Champion resilience patterns: circuit breakers, graceful degradation, bulkheads, and auto-scaling strategies. • Collaborate with the Enterprise Architecture team on architectural decisions and evolve engineering standards across the Prepurchase domain. • Partner with product, security, and SRE teams to align technical decisions with business priorities. • Drive observability improvements - ensuring services are instrumented for monitoring, alerting, and rapid incident response. • Identify and eliminate single points of failure, improving system reliability and reducing on-call burden. • Apply AI and machine learning tools to improve developer productivity, automate operational tasks, and enhance system capabilities. • Evaluate and introduce new technologies that improve performance, reliability, or engineering velocity.
• Design and implement moderately complex ingestion pipelines that integrate with internal and external systems • Develop reusable components for data transformation, validation, and logging • Contribute to both batch and streaming ingestion flows, ensuring scalability and maintainability • Support platform observability by enhancing monitoring, alerting, and error-handling features • Participate in design discussions, code reviews, and incident investigations • Partner with data consumers to understand requirements and translate them into ingestion solutions • Improve automation and testing coverage to reduce manual effort and increase pipeline reliability
Principal Engineer I – Prepurchase Platform
Live Nation EntertainmentA Fortune 500 company lauded for innovative business practices by Fast Company magazine, Live Nation Entertainment is a global leader in live entertainment and
• Identify and act on production problems without being prompted. • Build end-to-end, high-availability systems that handle extreme traffic spikes during high-demand on-sales without degradation. • Lead platform modernization initiatives - migrating legacy services to cloud-native, microservice-based architectures. • Design and implement streaming and event-driven solutions using Kafka/gRPC, enabling real-time data flow across services. • Embed deeply in a specific service area when needed, or work horizontally across teams to solve cross-cutting problems. • Champion resilience patterns: circuit breakers, graceful degradation, bulkheads, and auto-scaling strategies. • Collaborate with the Enterprise Architecture team on architectural decisions and evolve engineering standards across the Prepurchase domain. • Partner with product, security, and SRE teams to align technical decisions with business priorities. • Drive observability improvements - ensuring services are instrumented for monitoring, alerting, and rapid incident response. • Identify and eliminate single points of failure, improving system reliability and reducing on-call burden. • Apply AI and machine learning tools to improve developer productivity, automate operational tasks, and enhance system capabilities. • Evaluate and introduce new technologies that improve performance, reliability, or engineering velocity.




