Analytica logo
Analytica

Data-driven consulting and technology services

Data Scientist – NLP

Data ScientistData ScientistFull TimeRemoteSeniorTeam 51-200H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

2 days ago

Salary

0

Seniority

Senior

Job Description

Data Scientist – NLP

Analytica

• Pre-processing - Demonstrate the skills and experience to collect, clean, and prepare data sets for input into a computational model using Python • Strong candidates will explain various methods you have applied using common pre-processing functions such as stop word removal, stemming, lemmatization, and tokenization • Feature Engineering and Attribute Evaluation - Candidate must demonstrate experience with NLP feature engineering methods such as TF-IDF, word2vec, GloVe, and FastText identifying the key determinants for modeling that exist in the business process and within existing data sets as well as selecting evaluation protocols (model techniques) • Modeling - Candidates will have practiced skills and experience selecting classification modeling techniques to fit the business problem. Examples will include techniques such as machine learning (ML) supervised and unsupervised learning, regression, neural networks and deep learning, natural language processing, etc. • Validation - Strong candidates will describe their experience with investigating, reporting, and justifying model results • Visualization- Experience in presenting the results of their modeling activities, depicting the insights realized, and explaining the relevance of their results to the organization’s business challenges

Job Requirements

  • Master's degree required, and PhD preferred in Statistics, Mathematics, Computer Science, or similar
  • High degree of experience utilizing SAS, R, or Python to support NLP use cases such as Document Summarization, Named Entity Recognition, Sentiment Analysis, and/or Topic Modeling
  • At least four years of experience developing scalable, production-ready NLP solutions using sci-kit learn, Keras, TensorFlow, PyTorch, Spark NLP
  • Experience using git/github to version control source code
  • Experience leveraging transformer architecture to develop NLP models
  • Experience with open source NLP packages such as Gensim, SpaCy, or NLTK
  • Experience with BERT, GPT-J, RoBERTa, T5 or other transformers
  • Experience with GenAI and Prompt Engineering is a plus
  • Experience in Databricks and MLFlow is a plus
  • Experience with machine translation and transcription of foreign language documents using Microsoft Azure translation services is a plus
  • Experience working in an AWS cloud environment and with related AWS services such as Bedrock and Textract
  • Experience coordinating and maintaining user stories
  • Must be a US citizen
  • Must be able to obtain and maintain a Public trust security clearance

Benefits

  • Competitive compensation with opportunities for bonuses
  • Employer paid health care
  • Training and development funds
  • 401k match

Related Categories

Related Job Pages

More Data Scientist Jobs

Full TimeRemoteTeam 1-10H1B No Sponsor

• Advance Root’s performance marketing capabilities through improvements in bidding systems, automation, and applied machine learning • Lead and develop a multidisciplinary team responsible for the deployment, optimization, and performance of a nine-figure annual marketing budget • Drive fast, disciplined experimentation across channels and partners • Develop strategic relationships with marketing and technology partners • Align priorities across Marketing, Analytics, Engineering, and Product

United States
$250K - $300K / year
Full TimeRemoteTeam 10,001+Since 1961H1B Sponsor

• Serve as the primary finance lead for assigned markets/focus areas • Own end‑to‑end financial support for IDN and PCP risk negotiations • Lead complex scenario analysis and large‑scale model adjustments • Translate enterprise models into market‑specific financial narratives • Partner with market presidents, contracting, and network strategy leaders • Set analytic priorities and manage multi‑deal demand • Provide direction, review, and coaching to support analysts and associates • Represent finance in market‑level strategy discussions and escalations

United States
$104K - $143K / year
Full TimeRemoteTeam 10,001+Since 1990H1B No Sponsor

• Leveraging advanced analytical techniques and machine learning models to derive actionable insights from complex data sets. • Designing and implementing advanced data models and algorithms to solve complex business problems and drive strategic insights. • Conducting in-depth analyses of large and diverse data sets to uncover patterns, trends, and correlations that inform decision-making. • Collaborating with cross-functional teams to understand business needs and translate them into analytical solutions. • Developing and deploying machine learning models to enhance predictive capabilities and optimize business processes. • Providing thought leadership on data science best practices and emerging technologies to drive innovation and continuous improvement.

Pennsylvania
DICK'S Sporting Goods logo

Senior Data Scientist, Search & Recommendations

DICK'S Sporting Goods

Headquartered in Coraopolis, Pennsylvania, DICK’S Sporting Goods offers sports fans and enthusiasts a “big store” selection of name-brand sports equipment

Data Scientist2 days ago

• Design and implement machine learning models for search and recommendation systems, including ranking, retrieval, personalization, and query understanding • Build ranking and recommendation models using user behavior, embeddings, content signals, and contextual features • Develop personalization systems that tailor results based on user behavior, preferences, and contextual signals • Collaborate with data and search engineers to build scalable data pipelines supporting search and recommendation systems • Partner with software engineers to integrate ML models into production services via APIs • Design and execute A/B tests to evaluate model performance and business impact • Monitor offline and online metrics to identify opportunities for improving relevance, ranking, and engagement • Apply modern ML and GenAI techniques to improve search and discovery experiences • Contribute to best practices in modeling, experimentation, and production ML systems

United States
$83K - $138.2K / year