Founded in 1967, Capgemini is revered as one of the world's leading consulting, technology, and outsourcing agencies. In 2016 alone, the company reported global
Data Scientist Predictive Analyst
Location
Worldwide
Posted
1 day ago
Salary
0
Seniority
Mid Level
Job Description
Data Scientist Predictive Analyst
Capgemini
Role Description Help support Auto and Home Actuarial team update Base Rate Offset tool into Python. This role requires strong Python skills as well as self-learning skills. - Contributes to the development and implementation of predictive analytics through the application of advanced statistical and analytical techniques in order to deliver data-driven insights supporting business objectives. - Uses appropriate modeling techniques to address business needs. - Utilizes broad knowledge of advanced modeling techniques and procedures to develop new modeling techniques and skills. - Conducts appropriate evaluation of model performance. - Designs and publishes reports to communicate results and track model performance. - Develops various programs including predictor and response variable programs. - Reviews programs to ensure they conform to quality standards. - Supports preparation of internal/external, structured/unstructured data sets to build/rebuild/refresh predictive models. - Creates ad-hoc data analyses, as needed. - Communicates analytics to other modelers as well as to non-technical business partners. - Contributes to the continuous improvement of the modeling process. Qualifications - English Proficiency: Fluent (We work 100% in English) Requirements - Python - Advanced - SQL - Intermediate - Power BI - Intermediate - Excel - Intermediate Benefits - Competitive salary and performance-based bonuses - Comprehensive benefits package - Career development and training opportunities - Flexible work arrangements (remote) - Dynamic and inclusive work culture within a globally renowned group - Private Health and Dental Insurance - Pension Plan - Meals tickets - Life Insurance
Related Guides
Related Categories
Related Job Pages
More Data Scientist Jobs
Senior Associate Scientist, Molecular Data
Terray TherapeuticsChemistry is the key to drug discovery, but chemical data is stuck in the twentieth century. We’re generating precise chemical datasets purpose-built to propel drug discovery into the information age — and we’re doing it on a larger scale and faster than has ever before been possible. Terray Therapeutics is a biotechnology company led by pioneers and long-time leaders in artificial intelligence, synthetic (medicinal) chemistry, biology and preclinical development, automation, and nanotechnology. Chemical datasets generated using our novel ultra-dense microarray technology work seamlessly with our integrated machine learning and computational platform to systematically map biochemical interactions between small molecules and causes of disease. Iterative cycles of virtual molecular design and experimentation power AI and machine learning models, which in turn guide the next cycle of design. With a chemistry engine that measures billions of interactions daily and becomes increasingly precise with every cycle, we can answer an unprecedented array of questions — deriving insights that enable us to predictably create drugs for patients in need.
Role Description Terray is currently seeking a motivated and creative senior associate data scientist. As an integral member of our Computational and Data Sciences (CDS) team, the candidate will be responsible for analyzing and visualizing data from our computational discovery platforms. - Collaborate with project teams in applying platform tools to identify and prioritize hits for follow-up synthesis, testing, and pipeline progression. - Work closely with computational and assay teams to perform hit validation analyses to evaluate how tArray platform binding profiles translate to off-platform assays. - Partner with screening teams to develop and refine target-specific screening strategies. Qualifications - BS/MS in Cheminformatics, Computational Chemistry, data science, or a related scientific discipline with a strong interest in programming and data-driven research. - Understanding of chemical structures, molecular properties, and compound profiling in the context of high-throughput screening and hit identification. - Working knowledge of Python for scientific data analysis, automation, and workflow support, including experience with libraries such as pandas, numpy, scipy, and matplotlib. - Strong expertise in SQL, including building and querying custom tables for internal workflows. - Strong problem-solving and data analysis skills with the ability to communicate scientific insights effectively. - Experience working with large, real-world scientific or laboratory datasets. - Familiarity with Linux environments and version control tools (e.g., Git). Requirements - Exposure to machine learning methods such as clustering, regression, or pattern recognition is a plus. - Exposure to cheminformatics or scientific software tools (e.g., RDKit, Spotfire, MOE, Schrodinger) is a plus. Benefits - Compensation: $120,000 – $182,000 annually, depending on experience. - Participation in the Company’s stock option plan. - 3% retirement safe harbor contribution. - Fully paid health, dental, and vision insurance for employees, spouses, partners, and families. - Above-market life insurance and disability coverage. - Additional benefits to explore during the offer process.
Bolsista Mestre – Cientista de Dados, NLP, Sistemas de Busca Semântica
Sistema FibraPelo Futuro da Indústria | Pelo Futuro do Trabalho
• Pré-processamento e enriquecimento de textos: Limpeza, tokenização, lematização e remoção de ruídos em documentos textuais. • Geração e gestão de embeddings: Criar embeddings de documentos e consultas usando modelos como Sentence‑BERT, OpenAI ada ou similares. • Construção de pipelines de busca semântica, recuperação de informação e aplicações com RAG: Desenvolver pipelines que combinam: consulta → embedding → busca vetorial → re-ranking (opcional) → uso do contexto recuperado em aplicações com RAG. • Adaptação e experimentação com modelos em PyTorch ou TensorFlow: Utilizar modelos pré-treinados e adaptá-los para tarefas específicas. • Documentação e versionamento: Documentar pipelines, decisões técnicas e resultados de experimentos.
Head of Data
DLG (Digital Luxury Group)Luxury is in our DNA. We are the leading strategic partner for luxury & lifestyle brands.
• Own the data lifecycle from ingestion to delivery • Build and maintain crawlers, API integrations, and vendor data feeds • Monitor collection reliability and onboard new data sources • Manage the full data pipeline from raw to production • Implement automated QA at every stage • Maintain schema integrity in BigQuery and PostgreSQL • Ensure downstream consumers receive consistent, documented datasets • Document every transformation rule and edge case • Contribute to building a data product that external customers pay for
• Architect and ship production-grade LLM applications • Design RAG pipelines that actually work at scale • Collaborate closely with product and engineering to own problems end-to-end • Design, build, and deploy AI models, tools, and agents • Partner with the Head of Data & AI to define the technical architecture • Implement retrieval augmented generation (RAG) pipelines • Design and implement Model Context Protocol (MCP) integrations • Build and maintain MCP servers • Establish patterns for MCP host/client configuration, access control, and observability • Implement MLOps and LLMOps practices • Collaborate with data engineers and platform teams




