Electrosoft

A leader in cybersecurity services and solutions

Data Scientist

Data ScientistData ScientistFull Time Remote Mid LevelTeam 51-200Since 2001Company Site LinkedIn

Location

United States

Posted

111 days ago

Salary

$130K - $140K / year

Seniority

Mid Level

Salesforce AI/ML Tableau Power BI ETL CMS Python Pandas NumPy scikit-learn Jupyter AWS Azure Amazon S3 Amazon Redshift Databricks SQL

Job Description

Electrosoft Services, Inc. is an award-winning company that provides comprehensive technology-based solutions and services to federal customers. While cybersecurity is our specialty, we also focus on ICAM, enterprise IT modernization, and software solutions. We always seek to delight our customers, so we retain highly qualified employees and offer them meaningful work, growth opportunities, and work-life balance. What sets us apart from all other contractors is the sense of teamwork our employees feel – and the knowledge that outstanding effort is recognized and rewarded. The camaraderie we share emanates from Lunch & Learn sessions where we explore new ideas together, fun group activities ranging from escape rooms to miniature golf, and much, much more. If we’ve described you and your dream workplace, please apply and share in the many benefits and opportunities we offer. Data Scientist Seeking a highly skilled and mission-driven Data Scientist to support modernization efforts, enhance operational efficiency, and improve data-driven decision-making across multiple government Cybersecurity Operations divisions. This role will contribute to the development and implementation of innovative data solutions, support security initiatives, and manage key platforms such as Salesforce to ensure effective communication and access control. The ideal candidate will bring expertise in data science, machine learning, and visualization, along with experience in federal environments and a strong understanding of HHS-specific operational and security requirements. Responsibilities: - Develop and maintain predictive models for vulnerability trending, risk scoring, and program performance. - Build dashboards (Tableau/Power BI/Plotly) used by HHS leadership to track VAPT metrics, ATO/RMF status, and cybersecurity indicators. - Conduct ETL and data cleansing across large, heterogeneous datasets (HC3, CMS, security tools, compliance repositories). - Support CTI-driven analytics, integrating structured and unstructured data (RSS, API feeds, OMB reporting, CVE/NVD data). - Automate data ingestion and pipelines using Python, APIs, and cloud-native tools. - Produce executive-level reports, presentations, and insights aligned with HHS PMO standards. - Ensure proper handling of PHI/PII consistent with HIPAA, FISMA, and federal privacy guidelines. - Collaborate closely with PMs (you), VAPT leads, CTI analysts, and HHS stakeholders on data needs and requirements. Required Skills: - Bachelor’s or master’s degree in data science, Computer Science, Statistics, Mathematics, or a related field. - Expert Python skills (Pandas, NumPy, SciPy, scikit-learn, Jupyter). - Strong foundation in statistics, modeling, clustering, anomaly detection, and ML. - Hands-on experience with Power BI or Tableau for visual analytics. - Strong ETL/ELT and large dataset processing experience. - Familiarity with federal cybersecurity data (CVE, NVD, CISA KEV, RMF artifacts, VAPT deliverables). - Understanding of HIPAA, PHI/PII controls, and federal privacy requirements. - Experience with AWS or Azure data workflows (S3, Glue, Redshift, Synapse, Databricks, etc.) - Familiarity with Tines or similar automation/orchestration tools - Experience supporting federal cybersecurity programs (HHS, DHS, CISA, or similar strongly preferred) - Strong proficiency in: - SQL (data extraction, transformation) - Familiarity with: - Cybersecurity data sets (vulnerability data, threat intel, asset inventories) - CDM (Continuous Diagnostics and Mitigation) ecosystem - NIST frameworks (CSF, RMF concepts) - Ability to operate in cross-functional environments (technical + policy + leadership) - Ability to translate data into clear, concise, executive-ready narratives. Pay Range $130,000—$140,000 USD

Related Categories

Data Scientist

Related Job Pages

Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More Data Scientist Jobs

Data Scientist (Forward Deployed)

Fundamental

The Power to Predict. See the future in your data.

Data Scientist111 days ago

Full Time RemoteTeam 51-200Since 2024H1B No Sponsor

Company Site LinkedIn

About Fundamental Fundamental is an AI company pioneering the future of enterprise decision-making. Founded by DeepMind alumni, Fundamental has developed NEXUS – the world's most powerful Large Tabular Model (LTM) – purpose-built for the structured records that actually drive enterprise decisions. Backed by world class investors and trusted by Fortune 100 companies, Fundamental unlocks trillions of dollars of value by giving businesses the Power to Predict. At Fundamental, you'll work on unprecedented technical challenges in foundation model development and build technology that transforms how the world's largest companies make decisions. This is your opportunity to be part of a category-defining company from the ground-up. Join the team defining the future of enterprise AI. About the role Fundamental is seeking a Forward deployed Data Scientist to facilitate the adoption of NEXUS and collaborate with customers to address complex technical challenges. The Data Scientist is an integral part of our FDE team, which is dedicated to driving the successful deployment of Fundamental products and proving value over legacy baselines or net new use cases. They work hand-in-hand with customers from the Proof of Value stage to post-implementation, ensuring our solutions run securely in the client's production heartbeat. In this role, you’ll manage customer relations involving multiple stakeholders (IT, C-suite, and data science teams) and function as a key bridge, translating field insights into our product roadmap. Key responsibilities - You’ll individually help deploy into production use cases with considerable business impact, moving from "science experiments" to definitive ROI - You’ll work on rigorous head-to-head benchmarking against client baselines (XGBoost, LightGBM), executing the work of data engineering, feature engineering, and validation - You’ll work in collaboration with our research and product teams to translate operational pain points and data anomalies into essential inputs for the Fundamental roadmap - You’ll be involved in technical strategy to identify the right business problems, prevent data leakage, and handle the "last mile" integration (VPC, on-prem, air-gapped) - Your collaboration with the Sales and Solution Architect teams will help align diverse stakeholders and explain predictions to business users Must have - You hold a PhD / master in CS / Math / Stats or you have equivalent deep statistical literacy - You have 2+ years as a technical individual contributor (data scientist or software engineer) - You have experience with containerization (Docker), orchestration, and writing performant APIs (FastAPI/Flask) - You master the end-to-end pipeline, from framing, pre-processing, ml algorithm and validation strategies - You have a deep understanding of data handling (PySpark, Pandas) and memory optimization - You have demonstrated experience optimizing models for a specific business problem - You hold strong communication skills with an ability to translate architectural nuances into clear business value Nice to have - Experience with PyTorch and cloud-native ML pipelines (AWS, GCP, Azure) - Experience as a Forward Deployed Engineer, Staff Engineer, Machine Learning Engineer, or Staff Data Scientist - Industry-based subject matter expertise Benefits - Competitive compensation with salary and equity - Comprehensive health coverage, including medical, dental, vision, and 401K - Paid parental leave for all new parents, inclusive of adoptive and surrogate journeys - Relocation support for employees moving to join the team in one of our office locations - A mission-driven, low-ego culture that values diversity of thought, ownership, and bias toward action

AI C LightGBM Data Engineering Docker/Containers Docker FastAPI Flask AI/ML PySpark Pandas PyTorch AWS GCP Azure

View details: Data Scientist (Forward Deployed)

United States

Apply

Citizen Science Volunteer Coordinator and Science Communicator

Friends of the River Wye

Data Scientist111 days ago

Contract Remote

Friends of the River Wye (FORW) are seeking an exceptional Citizen Science Volunteer Coordinator and Science Communicator to help coordinate our network of citizen scientists and drive public awareness of the plight of the Rivers Wye and Lugg. This role will work at the intersection of citizen science, volunteer coordination, science communication, community mobilisation and partnership building - helping ensure local voices, and evidence guide the river back to health. A significant focus of this role will be on turning complex science and data into compelling narratives of the degradation and restoration of our beloved river. It will suit a passionate, environmentally committed and scientifically literate person who combines creativity with ecological understanding and is as comfortable on social media as they are in meeting with farmers in their kitchens. If you think you can help bring together the power of people, data, and persistent, positive pressure to bring a river back to life, we want to hear from you. Responsibilities 1. Community & Volunteer Engagement ● Work closely with volunteer citizen scientists, amplifying their findings, and strengthening local action on pollution. ● Maintain stocks of equipment and consumables required by citizen scientists, and arrangements for their distribution ● Recruit, train, organise and coordinate citizen scientists, including attending events, coordinating meet-ups and running training and feedback sessions - note these tend to be evening or weekend sessions to meet volunteer availability, so flexibility is needed. ● Coordinate with the existing citizen science management team and the board of Trustees, ensuring efforts are aligned with FORW’s strategic priorities. 2. Communications & Social Media Leadership ● Be highly active across social platforms - posting regular updates, replying to comments, mythbusting misinformation, and amplifying positive stories. ● Maintain a consistent, evidence-led voice for the Wye across channels. ● Grow the reach and engagement of FORW’s social media presence. 3. Science Interpretation & Public Education ● Translate complex scientific and regulatory information into clear, engaging, accurate explainers. ● Produce accessible analyses of water quality and related data, trends, and emerging issues. ● Use basic GIS and mapping tools (e.g. QGIS, Google Earth, Google Maps) to help visualise citizen science results and identify key locations for sampling and storytelling. ● Use simple data visualisation tools (e.g. Tableau, Flourish, Google Sheets) to present findings in a compelling and public-friendly way. ● Promote and interpret tools like Wye Viz, helping the public understand what the data means. ● Create regular high-quality videos, graphics, explainers, and other compelling content to explain what is happening to the Wye and what people can do to help. ● Work with community groups and other river-focused organisations to ensure aligned communication, shared messaging, and joint opportunities. 4. Farmer and Community Engagement & Positive Practice Amplification ● Build strong, trust-based relationships with community groups, farmers and farming groups to promote meaningful steps to reduce pollution. ● Showcase, celebrate, and elevate farmer voices engaged in genuine action. Skills and experience ● Highly organised, self-motivated, and capable of managing multiple strands of work. ● Confident working with diverse groups, including volunteers, farmers, community members, NGOs, and public bodies. ● Strong communicator across written, verbal, and digital channels. ● Proven experience in social media management, campaigning, or communications. ● Ability to interpret scientific or technical information and communicate it clearly to the public. ● Working knowledge of GIS or mapping tools to support spatial understanding of citizen science data and river pressures. ● Experience presenting data visually (e.g. dashboards, maps, charts) in ways that are accurate, accessible and engaging for a public audience. ● Strong analytical skills; experience working with large datasets. ● Science qualifications or background a strong positive. ● Experience producing videos, digital content, or graphic explainers. ● Understanding of environmental issues, water quality, and agriculture (highly desirable). ● Willingness to work flexibly, including occasional evenings/weekends for events or reactive communications. ● Current drivers license and consistent access to a car (for transport around the large Wye catchment area). ● As a contracting role, the successful candidate will be expected to provide their own equipment (recent model laptop, mobile phone). ● Welsh language a strong positive. This position will jointly report to the Chair of the FORW Trustees and the Programme Manager. Where appropriate, they will ensure Welsh translation is provided, and appropriate recognition of grants and sponsorship that have supported our work. A project associated with this role is funded by the Nature Networks Programme. It is being delivered by the Heritage Fund, on behalf of the Welsh Government. Type: 1-Year contract (2 days per week - flexible working hours) Rate: £130 per day (contract rate) Location: Flexible working, based within / across the Wye catchment

GIS Tableau

View details: Citizen Science Volunteer Coordinator and Science Communicator

United Kingdom

Apply

Clinical Data Scientist Manager

Dandelion Health Inc

Data Scientist111 days ago

Full Time Remote

Our Team Dandelion Health was founded in 2020 by experts in health tech, hospital systems, academia, and clinical AI. We are building the world’s largest AI training and clinical development platform. Today, we pride ourselves on our ability to make data access as easy as possible for AI developers, pharma, and medical devices, while raising the bar for patient safety and data quality. Tomorrow, we will be the place where any healthcare organization can go to build a responsible clinical AI product. Our culture is all about learning from data and improving, so we can help our clients improve health through AI. Meet the rest of our team here. Our Data We partner with health systems to safely and ethically make their de-identified patient data available to AI developers. Currently, the data is acquired from Sharp HealthCare, Sanford Health, and Texas Health Resources – with two additional U.S. health systems joining soon. We have clinical data dating back to July 1, 2016. This data represents over 10 million patients and includes but is not limited to: - Structured data (e.g., 100% of the EMR, including some claims) - Unstructured text (e.g., clinical notes, radiology reports) - Images (e.g., DICOM, pathology) - Video - Waveforms - Continuous streaming monitoring data Your Role You are a data scientist manager who knows your way around healthcare data and has experience both as an individual contributor and as a leader of a team of data scientists. Your primary responsibility is to lead and oversee the creation and maintenance of AI-ready datasets for clients in our AWS environment. You will work across the organization to drive impact by managing data scientists, curating datasets, and pooling multimodal data to enable rapid exploratory AI/ML analyses, model experimentation, and/or model validation. You will use your people leadership, project management, analytical expertise and critical thinking skills to support our technical, product and growth teams. Your team’s ultimate goal is to deliver the highest-quality data possible to our clients, who are building products that improve patient health. You will report to the Chief Data Officer. Responsibilities Your responsibilities will include the following: - Structure, plan, and lead multimodal clinical data projects from ideation to delivery - Manage direct reports as a player-coach, including hiring, developing, and motivating technical team members - Query complex health data sources (e.g., EMRs, ECG data, DICOM data) to explore, clean, and operationalize new data sources as they become available and lead the development of streamlined, scalable data models - Support the design, testing, validation, analysis, and merging of multimodal data structures from various source systems - Provide thought leadership and technical insight for developing harmonized datasets - Support all phases of SQL/analytical programming, data management, quality control, and reporting - Develop and review code and documentation to deliver high-quality, HIPAA-compliant data products to customers on time - Support the technical product team in advancing development of the suite of data-related product offerings - Act as the primary technical point of contact (with Customer Success) for client deliverables, communicating clearly and professionally - Present confidently to senior leadership and external audiences, including client stakeholders - Ensure accuracy, data integrity, and validity of all data and analysis - Mentor junior staff with project feedback and code review You are not afraid to tackle massive, confusing, and disorganized new datasets and get them under control. You are eager to learn new environments, languages, and skills. As a member of an early-stage company with big ambitions, everyone pitches in across the team as needed. Qualifications - M.S. or PhD in Data Science, Biomedical Informatics, Biostatistics, Statistics, or a related field with at least 3+ years of relevant experience, or B.S. with at least 7 years of experience in an applied quantitative field - At least 2 years of experience directly managing and leading data science teams, including experience with people and project management - Experience communicating both technical details and overall team strategies - Strong background in statistical modeling and information theory concepts - Fluency in Python and SQL; experience with dbt is a strong plus - Hands-on experience with extracting, curating, and analyzing data created within HIT and healthcare delivery (e.g., EMR, claims, registry) - Understanding of data exchange/content standards (e.g., FHIR, CDA, CQL) and clinical terminology standards (e.g., ICD, CPT, LOINC, SNOMED-CT, NDC, RxNorm) - Strong technical writing, editing, communication skills, and a collaborative mindset - Excellent organizational skills, adaptability to change, and ability to manage multiple projects to meet deadlines - Experience working in or with startups is a plus Technology Experiences and Skills Candidates should be interested in growing their skills and working with healthcare data in all its complexity. The Data Team works closely with Engineering to deliver client needs. - SQL - Python and/or R - Git and version control - Familiarity with encryption and regular expressions - Experience querying EDWs/databases and creating reports/analytics for healthcare data - Familiarity with electronic medical record systems (e.g., Epic, Cerner, Allscripts) - Experience with insurance claims data - Knowledge of medical terminologies and controlled vocabularies (ICD-10, SNOMED-CT, LOINC, CPT/HCPCS, NDC, RxNorm) - Medical ontology experience (a plus) - NLP experience (a plus) - Experience with DICOM or other imaging modalities (a plus) - Experience with OMOP common data model - Familiarity with machine learning concepts - AWS experience Note: Familiarity with machine learning model development/deployment is not required, but an understanding of high-level ML concepts is a plus. Nature of Our Work Our work is fast-paced and iterative. We are growing and support our team members’ development. We value diversity of perspectives, experimentation, and continuous improvement. We work with the full spectrum of healthcare data — tabular, videos, images, waveforms, and more. If a health system collects it, we may work with it! If this seems like a partial fit, please reach out. We would love to share more about the work we do to help you determine if it’s the right fit for you. Other Details - Occasional travel for in-person company working days (approximately quarterly) Team Benefits - Remote work and flexible hours. Availability needed for meetings, which we try to keep to a healthy minimum - Complete wellness benefits including healthcare, dental, vision, PTO, sick days and more. Ask for details - Professional development days to build your skills - Collegial work environment - Academic bent towards inquiry and problem solving but start-up speed and flexibility - Great balance of focus time to work on projects but easy to access team members to discuss issues and work collaboratively - Dandelion is a mission-driven company that is focused on improving patient care

AI Observability/Monitoring AWS AI/ML SQL Python dbt R Git

View details: Clinical Data Scientist Manager

United States

$165K - $185K / year

Apply

Job Closed

Data Scientist (Applied AI)

Cohere Commerce

Data Scientist111 days ago

Full Time Remote

About Cohere Commerce Cohere Commerce is the leading AI-driven platform for retail sourcing and vendor evaluation, trusted by buyers, consultants, and investors across North America. Backed by top-tier investors with tens of millions in funding and partnered with major retailers like Target and Sprouts, we deliver real-time, validated insights that help the industry make smarter product decisions, streamline supply chains, and enhance the customer experience. As we continue to grow, you’ll play a foundational role in shaping the future of retail intelligence. The Role We are looking for a highly motivated Data Scientist (Applied AI) to bridge the gap between traditional data science and cutting-edge Generative AI. In this role, you won’t just be crunching numbers in a vacuum—you will be building the intelligent engines and AI agents that drive our platform forward. What You'll Do - Build the Foundation: Design and build high-quality data pipelines by extracting, cleaning, and integrating multi-source retail data (e-commerce transactions, social media, supply chain, user behavior, and funding info). - Develop Applied GenAI: Play a core role in developing LLM-powered applications. You’ll build RAG (Retrieval-Augmented Generation) systems, manage vector databases, and continuously iterate on Prompt Engineering. - Architect AI Agents: Help design and implement AI Agent workflows. You will use function calling and multi-step reasoning to automate complex, retail-specific business logic. - Evaluate & Iterate: Write evaluation scripts to test model accuracy and prep datasets for fine-tuning, ensuring our models become leading "domain experts" in retail. - Drive Business Value: Apply machine learning and statistical methods to power personalized recommendations and uncover deep insights into customer behavior. - Ensure Data Integrity: Champion data governance, monitor data quality, and set up anomaly detection. You will also design and evaluate A/B tests to guide product decisions with hard data. What We're Looking For Required - Education: Bachelor’s degree or higher in Computer Science, Data Science, Statistics, AI, or a related field. (Recent grads, current Master's/Ph.D. students, and early-career professionals with 1-3 years of experience are highly encouraged to apply!) - Data Engineering Chops: Strong SQL skills with experience in massive data processing, ETL pipeline development, and an understanding of modern Data Warehouse/Data Lake architectures. - Programming & ML: Fluency in Python. You are comfortable with data wrangling, feature engineering, and standard machine learning workflows (classification, clustering, evaluation metrics). - LLM Foundations: You understand how Large Language Models work under the hood and have tinkered with frameworks like LangChain, LlamaIndex, LLM APIs, and vector databases. - Hacker’s Mindset: You are highly self-directed. You can read documentation, quickly pick up new open-source tools or protocols (like MCP), and creatively solve engineering roadblocks. - Analytical Rigor: You have a sharp sense for data, strong logical reasoning skills, and experience with hypothesis testing, A/B testing, and metrics management. - Collaboration: Excellent communication skills with the ability to work cross-functionally and turn data into actionable business insights. Bonus If You Have - Previous data project experience in retail, CPG, or e-commerce. - Familiarity with managed AI services on AWS or GCP. - Hands-on, demonstrable experience building RAG pipelines or AI Agents from scratch. Why You’ll Love It Here - Full-Stack AI Impact: You will get hands-on experience across the entire AI lifecycle—from raw, underlying data pipelines to top-level AI Agent applications. - Global Scale: You will tackle the industry's toughest challenges using real-world data from top-tier North American retailers. - Massive Growth Potential: We are a fast-moving, flat-structured startup. Your code, your models, and your ideas will directly shape the product and the industry. - Snacks and Products: Unlimited access to the latest unreleased snacks, drinks, beauty and more.

AI AI Agents LLM AI/ML Data Engineering SQL ETL Python LangChain LlamaIndex A/B Testing AWS GCP

View details: Data Scientist (Applied AI)

United States

Apply

Data Scientist

Job Description

Related Guides

Related Categories

Related Job Pages

More Data Scientist Jobs

Data Scientist (Forward Deployed)

Citizen Science Volunteer Coordinator and Science Communicator

Clinical Data Scientist Manager

Data Scientist (Applied AI)