MariaDB Corporation is a leading provider of open source database solutions for scalability, high availability, and security. MariaDB Corporation's flagship product, MariaDB, is th
Senior Data Scientist
Location
Bulgaria
Posted
65 days ago
Salary
0
Seniority
Senior
Job Description
Senior Data Scientist
MariaDB
• Apply the right analytical methods-statistical analysis, regression, clustering, forecasting, classification-to solve real business problems at scale • Design and build end-to-end analytical solutions-from data extraction in BigQuery and MariaDB to actionable insights and automated reporting • Develop and maintain data pipelines, analytical frameworks, and tooling using Python and SQL • Use AI-powered development tools (Windsurf, Gemini, Claude, Cursor) to accelerate experimentation, code generation, and analysis • Leverage the latest AI features and tools (LLMs, embeddings, generative AI) to augment and automate analytical workflows • Build lightweight applications and APIs (FastAPI) to put models and insights into the hands of stakeholders • Own the data architecture decisions within your domain-schema design, data modeling, and pipeline reliability • Communicate results, trade-offs, and recommendations clearly to both technical and business audiences • Map and improve business processes through data-driven analysis and process design
Job Requirements
- 7+ years of professional experience in data science, advanced analytics, or a quantitative field
- Expert-level SQL - deep fluency in BigQuery and MariaDB or similar data platforms; you understand query optimization, partitioning, and data modeling
- Strong Python and/or JavaScript skills - production-quality code, not just notebooks; experience with modern data and ML libraries
- Deep understanding of analytical methods - regression, clustering, time series, A/B testing, classification; you know when to apply what and why
- Solid understanding of data architecture - how warehouses, lakes, and pipelines are designed; how to structure data for both analytical and ML workloads
- Proficiency with AI-powered development tools - Windsurf, Gemini, Claude, Cursor, or similar; you leverage AI assistants as a core part of your workflow
- Excellent communication skills - you present complex technical work in a way that drives decisions
- Self-managing and self-sufficient - you set your own priorities, unblock yourself, and deliver without hand-holding
- Relentlessly curious and ambitious - you're always learning, always building, always improving
- A public GitHub profile showcasing your projects, experiments, or contributions
Benefits
- Globally distributed team environment
- Remote or hybrid work options (location-dependent)
- Challenging projects with company-wide impact
- Competitive compensation, 25 days paid annual leave, plus holidays
- A culture that values curiosity, clean engineering, and people who genuinely love what they do
Related Guides
Related Categories
Related Job Pages
More Data Scientist Jobs
Excellent compensation and benefits package available for the right candidate. A leading specialty insurance provider is seeking a Principal Data Scientist who will play a key role in shaping Artificial Intelligence &Â Machine Learning (AI/ML) strategy. The ideal candidate would have 6+ years of data science/predictive analytics experience in P&C insurance, familiarity with big data platforms, and a background in building AI &Â machine learning models, as well as deploying GenAI solutions. Must have expert-level proficiency in Python, strong project management skills, and experience working in an agile team environment. (#58368) Compensation: - A salary range of $150-170K Locations: - Amelia, OH - Hybrid - Remote in EST
• Understand business problems that can be solved with data science using statistical analysis and machine learning models; • Ensure integrity in data collection from all sources for all analyses performed by the data team; • Collect and prepare datasets for analysis; • Perform data exploration, test hypotheses, identify insights and validate them with business stakeholders; • Develop statistical and machine learning models; • Deploy models for consumption by the business; • Produce analyses and reports for business areas; • Contribute business-focused ideas to the department's data science analyses.
• Extract data from primary and secondary sources and conduct data analysis to generate actionable insights • Develop and maintain databases, data systems – reorganize data in a readable format • Design charts and tables that can easily show the key messages presented to different audiences • Filter Data by reviewing reports and performance indicators to identify and correct problems • Assign numerical value to essential business functions so that business performance can be assessed and compared over periods of time • Prepare reports for management: stating trends, patterns, and predictions using relevant data • Work with engineers, designers and project managers to create engaging and insightful business reviews and reports
Bioinformatician (Spatial & Single-Cell)
Deep Science VenturesDeep Science Ventures (DSV) combines scientific research and entrepreneurship to build companies that address global challenges in areas like agriculture, clima
Big Picture Bio is a seed-stage techbio company, backed by DSV, building a computational drug discovery platform that constructs causal biological networks from two primary sources: structured extraction of published experimental literature and large-scale primary human single-cell omics data. Our multi-agent AI system reasons over these networks to generate, simulate, and rank mechanistic hypotheses for combination therapies — with system accuracy verified against top-tier researchers at the Allen Institute. We are initially focused on oncology. The Role (remote, timezone-restricted) You will design and build production bioinformatics pipelines for new modalities—spatial transcriptomics, single-cell proteomics, and spatial proteomics—extending our existing scRNA-seq infrastructure. These pipelines feed directly into an agentic hypothesis generation system: the quality of what goes in determines the quality of every therapeutic hypothesis that comes out. You’ll work closely with our Head of AI & Technology (Dr. Francesco Moramarco) and Head of Platform (Dr. Moustafa Khedr) to: - Build end-to-end pipelines (ingestion, QC, normalization, integration, annotation, differential analysis) - Design modality-specific statistics: spot deconvolution, spatial autocorrelation, ADT normalization, protein-RNA joint embedding, segmentation, spillover correction - Extend hierarchical cell type annotation across modalities - Codify best-practice workflows into reusable templates for agent execution - Sanity-check outputs to catch batch effects and artifacts before they propagate

