Mathew (Dr.) Divine

Data Science Expert and AI Strategist

Schlangenbad, Germany

Experience

Jan 2024 - Present
1 year 7 months

Data Science Expert and AI Strategist

Freelancer

I specialize in designing and deploying scalable machine learning solutions for organizations aiming to boost operational efficiency and profitability. My expertise spans from strategic consulting to hands-on MLOps, leveraging modern ML frameworks, Docker, and AWS to deliver automated, cloud-native pipelines that drive tangible impact.

  • Strategy consulting and requirements gathering.
  • Coaching and upskilling team members.

Technologies: Python, CI/CD, AWS, Docker, SQL, genAI

Jan 2024 - Dec 2025
1 year

Data Science Expert and AI Strategist

Freelancer

API for Smart Search for EU-Tenders Platform

  • Built an API to ingest, clean, translate, and index EU tenders documents in Neo4j, enabling hybrid search with RAG and Cypher queries via a Streamlit dashboard.

  • Deployed the API on AWS Lightsail container services with CI/CD automation via GitHub Actions, ensuring stability through pytest unit and integration tests.

  • Requirements gathering and project management.

  • MLOps strategy execution.

Technologies: Neo4j, LangChain, GitHub Actions, Python, AWS LightSail, Docker

Creator of Online Content for a Course on Data Analysis in ChatGPT

  • Designed and developed a comprehensive online course to teach data analysis using ChatGPT, aimed at professionals and learners seeking practical, hands-on AI skills.

  • Created engaging instructional videos complemented by interactive Jupyter notebooks to teach concepts ranging from foundational data analysis principles to best practices in prompt design.

  • Utilized professional recording tools, including Open Broadcast Software and audio equipment, ensuring high-quality video and audio content.

  • Curriculum design and instructional delivery.

  • Data visualization and generative AI prompt design.

Technologies: ChatGPT, Open Broadcast Software (OBS), Jupyter Notebooks

Project Lead of CRM Cleaning and Development of AI-first Web-Scraping Technology

  • Led CRM data normalization and cleaning project, with multiple steps visualized as a data scorecard flow using a Sankey Diagram, aiding customer understanding.

  • Implemented and validated a genAI-first web crawling strategy, outlining data quality challenges and ensuring high quality and scalability on AWS.

  • Project management and requirements gathering.

  • Data visualization and LLM prompt prototyping.

Technologies: AWS-Sagemaker, Python, LangChain, AWS Lambda, Snowflake

Aug 2021 - Dec 2023
2 years 5 months

Head of Digital Lab (CD&O)

Boehringer-Ingelheim

Digital Execution Strategy Development

  • Led an 8-person global cross-functional team, embedding MLOps best practices to streamline AI-driven cloud deployment and operational execution.

  • Developed end-to-end cloud-based ML workflows, aligning business objectives with technical strategy to reduce time-to-market for digital transformation initiatives.

  • Established processes for empowered decision-making, driving agile execution across AI product development.

  • Upskilled team on design thinking methodologies, AI strategy, and qualitative user research.

  • Team leadership and strategic planning.

  • Cloud-native AI development and agile methodologies.

Technologies: Jira, Confluence, Mural, AWS, CI/CD

Digital Transformation in Medical Writing

  • Conducted process mapping and qualitative analysis to define AI-driven business use cases.

  • Led generative AI adoption by medical writers and conducted PoCs on LLM-driven document generation, redaction, and translation.

  • Performed market scouting and vendor analysis, streamlining RfP processes.

  • Technical leadership and cross-functional facilitation.

  • genAI and Azure Cognitive Services technologies.

Lay Document Generation Using RAG System

  • Advised Data Scientists and Medical Writers on LLM prompt engineering and AI-powered document structuring to enhance faster content generation.

  • Co-deployed a cloud-native prototype using Streamlit for AI-assisted content creation for the lay language protocol.

  • Technical leadership and product innovation.

  • RAG architecture implementation.

Technologies: Python, Llama Index, Azure Cognitive Services, OpenShift, Jenkins

Product Owner of Analytics Landing Page

  • Served as Product Owner for an AI-driven analytics dashboard integrating recommendation systems and LLM-based cognitive search for real-time insights.

  • Defined product vision, roadmap, and backlog grooming.

  • Managed dependencies and aligned stakeholders for enterprise-wide data-driven decision-making.

  • Product ownership and stakeholder management.

  • AI model deployment and cloud-based API integration.

Technologies: LLMs, Azure Cognitive Services, AWS, Jira, Confluence, Git

Jan 2020 - May 2021
5 months

Interim Head of Data Science

BI X GmbH

  • Responsible for coaching and development of a self-organized team of 9 data scientists, ensuring production-ready ML solutions across multiple initiatives.

  • Provided coaching to Product Owners and worked with BI X leadership during weekly calls.

  • Responsible for staffing new initiatives and internal hiring.

  • Strategic coordination and team leadership.

  • Adhering to budgets and timelines.

Technologies: Jira, AWS, Atlassian, Git, Python, Machine Learning

Jan 2018 - Jul 2021
3 years 7 months

(Senior) Data Scientist

BI X GmbH

Development of Bioinformatics Pipelines for Genome Assembly Algorithms

  • Led a cross-functional team to develop bioinformatics pipelines on AWS cloud services.
  • Focused on creating ultra-fast genome assembly algorithms.
  • Utilized AWS CDK for infrastructure management.

Technologies: AWS Batch, AWS IAM, AWS Step Functions, AWS ECS, AWS CodePipeline, Python, Docker

Clinical Data Catalogue

  • Developed ETL processes in Python to produce a clinical data catalogue integrating historical and in-house data.
  • Implemented semantic search capabilities using advanced NLP pipelines.

Technologies: Python, Jupyter Notebooks, Docker, SPARQL, SQL, Stardog, PostgreSQL

Interdisciplinary Hackathon Creating Gamified NLP Annotation in a Web App

  • Led a 2-day hackathon to gamify NLP annotation for biomedical researchers.
  • Developed a game to validate NLP entity linking models.

Conceptualization of Bespoke Knowledge Management System Including PoC

  • Conducted qualitative user research and led design sprints to produce advanced NLP-enhanced prototypes.

Technologies: Python, TensorFlow, Elasticsearch

Jan 2018 - Dec 2019
1 year

Data Scientist

BI X GmbH

Bespoke Drug Discovery and Market Intelligence Web App

  • Led development of a web application for drug discovery and market intelligence.
  • Participated in user interviews alongside UX researchers.

Technologies: Postgres, Neo4j, Python, TensorFlow, OpenShift, Airflow

Apr 2016 - Dec 2017
1 year 9 months

Research Scientist, Radiogenomics

Eberhard Karls University Tübingen

Curation and Reporting of High-Dimensional Datasets

  • Collaborated across departments to collect and analyze high-dimensional datasets from an observational clinical trial in hepatocellular carcinoma patients.
  • Created interactive reports for treating physicians.

Technologies: Python, DICOM, TensorFlow, Seaborn

Clinical Reporting Strategy of Genetic Variations

  • Created genetic variation reports to support oncological decision-making.

Technologies: REST API, Docker

Using Parallel Coordinates to Visualize Neural Networks

  • Developed methods for neural network architecture visualization, aiding hyper-parameter estimation.

Technologies: TensorFlow

Mar 2010 - Mar 2016
6 years 1 month

Research Fellow, Oncology

Eberhard Karls University Tübingen

Development of Gaussian Mixture Modeling Pipeline

  • Developed a pipeline integrating PET and MRI data for tumor tissue segmentation.

Multi-parametric MRI Tumor Segmentation Study

  • Developed an unsupervised segmentation method validated with histological samples.

Spectral Clustering for PET Studies

  • Implemented spectral clustering techniques for tumor heterogeneity assessment.

Technologies: MATLAB, PET, MRI

Summary

I am a seasoned Data Scientist with 15+ years of experience in machine learning and data science, including seven years specializing in cloud-native development and the productionization of AI solutions. My expertise spans MLOps and DevOps, designing scalable, automated, and cost-efficient machine learning pipelines in cloud environments using tools like Terraform, Docker, and CI/CD frameworks.

With a strong background in AI strategy, design thinking facilitation, and user research, I bridge technical innovation with business impact—driving operational efficiencies and cost savings. I excel at coaching teams in MLOps best practices, fostering cross-functional collaboration, and delivering measurable results in fast-paced, agile environments.

Fluent in German and English, I am committed to leveraging cloud-native AI to solve complex challenges and accelerate digital transformation.

Languages

English
Native
German
Advanced

Education

Mar 2010 - Mar 2016

Werner Siemens Imaging Center, Eberhard Karls University Tübingen

Doctorate, Dr. sc. hum. · Molecular Imaging · Tübingen, Germany

Oct 2007 - Feb 2010

Rheinisch-Westfälische Technische Hochschule Aachen

Master of Science · Biomedical Engineering · Aachen, Germany

Sep 2002 - May 2007

Midwestern State University

Bachelor of Science · Physics and Mathematics · United States