AI Architect and Senior Data Scientist for multimodal LLM agent and AI co-pilot, Joule on top of the SAP Analytics Cloud
SAP
Design and development of new features in the Just Ask capability of Joule on top of the SAP Analytics Cloud. Joule is SAP’s natural-language, generative multimodal AI copilot and conversational AI agent that understands industry- and business-specific language. It allows for example to visualize data in diagrams by querying in natural language instead of SQL.
Design and development of a validation and test system for the generative AI Just Ask capability.
Behaviour-Driven Development (BDD) and testing with Cucumber.
Performance optimization and testing with JMeter.
Automation of the release pipelines with Git, Jenkins and Groovy.
AI Architect and Senior ML Engineer for LLM assistants
Sanitas AG - Health Insurer
Designed, developed and productized several LLM assistants for different internal stakeholder groups including client advisors, health care advisors and the IT department at Sanitas - one of the largest health insurers in Switzerland.
Tookover aPoCof anRAG-basedLLMassistant for client advisors as apilot project from thedata science team, designed its production architecture and developed it further up to production.
Designed and developed a highly object-oriented base RAG-LLM-architecture which allows for an efficient implementation of specific RAG-based LLM assistants. Integrated RAG-LLM-architecture in the existing micro-service-based IT architecture partially on MS Azure and GCP.
Integration of various LLMmodels such as Azure’s OpenAI GPT-collection and Google’s Gemini collection.
Developed the necessary data fabric architecture including data APIs, data pipelines and vector-search data base with PG Vector in GCP - achieved a considerable speed-up in embeddings retrieval compared to the PoC.
Extended and optimized the underlying FastAPI app, vector-search and overall performance of the LLM assistants.
Improved prompt engineering for LLMmodels such as the Azure’s OpenAI GPT-collection and Google’s Gemini-collection.
Design of a common base CI/CD Gitlab pipeline as well as specific Gitlab pipelines for all Gitlab projects related to LLM assistants and vector-search.
Integration of the LLM assistants in Teams, internal SanitasGPT as well as Slack.
Tested the LLM assistant for client advisors with a selected team of internal client advisors and integrated the results in the further development.
Supported the devops team during the roll-out of the LLM assistant for client advisors across different locations in Switzerland.
Developed and deployed a devops support LLM assistant for the IT department at Sanitas in Slack.
Developed a PoC for an LLM assistant to support the Sanitas’ health advisor team.
Technologies: Python, FastAPI, Pydantic, LangChain, Azure AI Services, Azure OpenAI GPT-collection, Google Gemini-collection, Google Vector Search, PG-vector, Vertex AI, further GCP, DBeaver, SQL, Microsoft Bot Restful API, Terraform, Kybernetes, Docker.
Sep 2023 - Dec 2023
4 months
Munich, Germany
Principal Data Scientist, AI Architect and Product Manager for Content Personalization with Reinforcement Learning
E.ON
Led the entire product management for several content personalization projects based on Reinforcement Learning.
Communicated with all involved stakeholders such as front end design, web analytics, data engineering, data protection, campaign management, data science, MLOps, etc.
Collected all relevant data across various data sources at E.ON and collaborated with the data protection team for approval.
Designed an overall data and AI architecture for Reinforcement Learning services at E.ON as well as its integration in the existing data and AI platform.
Designed the necessary data pipelines for online data with Apache Kafka and offline data with SQL.
Developed several POCs and an MVP for personalized next best actions on the homepage for existing customer based on an RL Contextual Bandits approach.
Trained the data science team and handed the MVP over for productization in Microsoft Azure and A/B testing with Adobe Target.
AI & QC Tech Expert and Business Development Consultant for Extending the In-House Cluster Manager ParaStation Software Suite
Partec AG
Led thebusiness case study as projectmanager for extending the in-house clustermanager ParaStation Software Suite to hybridmulti-clouddatacenterswithHPC,AI,QC, IoTandbusinessapplications in industry. ”ParaStationSoftwareSuite” is ahighly successful cluster manager with QC integration, which powers leading European supercomputing centers such as in Jülich, Germany and Meluxina in Luxemburg.
Worked directly with the C-level suite and the board of ParTec AG.
Represented ParTec AG at the Supercomputing, ML and QC conference ISC 2023 in Hamburg.
Identifiedpotential customers, their currentHPC,AI,QC, IoTandbusinessapplications, cloudstrategiesand the resulting requirements for cluster management with ParaStation Software Suite.
Held meetings with potential customers such as the German Weather Services (DWD) and the German Climate Computing Center (DKRZ) as well as with cloud providers such as AWS.
Performed competitor analysis and derived USPs.
Scoped the technical requirements including changes, new feature development, interfaces, packaging and live demo of ParaStation Software Suite. Scoped the hardware and software requirements for the unique feature of ParaStation Software Suite for high-speed MPI-based HPC, AI and QC computing across on-premise and cloud data centers.
Selected themost suitable cloud services fromMS Azure, Google Cloud and AWS i.e. regarding data transfer, HPC, AI and QC services, etc.
Wrote the technical specification for the extension of the existing cluster manager ParaStation Software Suite.
Defined related professional services for the extended ParaStation Software Suite.
Technologies: ParaStation Software Suite; various cluster managers including NVIDIA Bright, HPE, Atos, etc.; HPC Schedulers such as Slurm, Altair PBSPro, etc.; Kubernetes, various services fromMSAzure, GCP, AWS concerning data transfer, HPC, Ansible Playbooks, etc.; distributed communication libs including MPI, Cuda, Gloo and RPC; distributed versions of Tensorflow and PyTorch; AI-accelerated numerical simulation tools such as NVIDIA SimNet; commercial multi-physics solvers such as Comsol, Ansys, etc.; visualization software such as Paraview, Catalyst, VisIt, ADIOS2; various QC libs such as in-house HPC quantum integration software QBridge, Qiskit, etc.
Jun 2022 - Dec 2022
7 months
Berlin, Germany
Interim Data Science Team Lead & AI Architect at a European E-commerce Store
Flaconi
Led the Flaconi Data Science team with up to 7 team members including data scientists and machine learning engineers, hired one data scientist.
Oversaw a portfolio of more than 30 machine learning models in production used by internal stakeholders. The portfolio comprised state-of-the-art demand forecasts, scenario planning models, recommender systems, customer behaviour analytics such as CLV & churn models as well as NLP models including chatbots & NERs.
Held technical responsibility for the entire ML tech stack including MlFlow, Databricks, Delta Lake, Github Workflows, Airflow, AWS, etc.. Further introduced and improved software engineering best practises.
Responsible for the overall Big Data and AI architecture in AWS.
Agile communication and requirements engineering with all involved stakeholders and shareholders of Flaconi.
Reported directly to the co-CEO for several month until the new VP joined.
Development of a new overall recommender strategy across different page types and placements on the entire Flaconi website to-gether with stakeholders and shareholders. Implemented the first milestone together with my team.
Further performance improvement and extensions of the entire demand forecast & scenario planning suite as well as customer analytics models.
Senior Data Scientist - Performance optimization of existingmodels and implementation of newmodels in the demand forecast and
Flaconi
Improved the performance of an existing scenario planing tool based on Catboost and a selection of regressionmethods for extrapolation by more than 50 % to a MAPE of 5-15 %
Improved the performance of an existing daily demand forecast for over 70’000 products based on AWS DeepAR as well as further forecast and extrapolation methods for its features by more than 40 %.
Improved the performance of an existing weekly demand forecast for over 70’000 products based on AWS DeepAR as well as further forecast and extrapolation methods for its features by more than 30 %.
Developed anewweekly demand forecast for over 70’000 products basedonAWSDeepARaswell as further forecast and extrapolation methods for its features.
Introduced software engineering best practices such as object-oriented programming, logging, exception handling, testing andmonitoring. Implemented best practises together with ML Engineer.
Implementation of related data pipelines in Databricks.
Technologies: Python 3, AWS DeepAR, Google Fusion Transformer, statsmodels, Catboost, TensorFlow, TensorFlow Recommenders, MlFlow, PyTest,Fast-API, Github Workflows, Databricks, Delta Lakehouse, Terraform, AWS SageMaker and related AWS services, Apache Airflow, Snowflake, Exasol, Power BI.
Jan 2021 - Mar 2022
3 months
Munich, Germany
Senior Data Scientist and AI architect - Reinforcement and Deep Learning Recommender System PoCs for Next Best Actions & Next
Telefonica
Collected, researched and evaluated different approaches to recommender systems for Next Best Action & Next Best Offer.
The research included state-of-the-art Deep-Learning, Reinforcement Learning including contextual bandits as well as combined approaches based on Tensorflow Recommenders, Microsoft Recommenders Collection as well as Reinforcement Learning libraries such as Vowpal Wabbits and TensorFlow Agents.
Design of base data & AI architecture for RL models at Telefonica including databases and data pipelines.
Implementation of several PoCs in Python and deployed in AWS with Docker & Kybernetes.
Technologies: Python 3, HuggingFace, Keras, TensorFlow, TensorFlow Recommenders, Microsoft Recommenders Collection, Spark ML, Reinforcement learning libraries such as Vowpal Wabbits, TensorFlow Agents, Terraform, Docker, Kubernetes, AWS, PyTest, etc.
Jul 2021 - Oct 2021
4 months
Köniz, Switzerland
Evaluation project for an AI-component in the human resources software ”Peerdom”
Nothing AG
Design and evaluation of the business case, communication and workshops with all involved stakeholder groups
Design individual machine learning use cases
Choice of suitable machine learning algorithms
Consulting on potential ethical issues as well as their avoidance or mitigation also from a technical perspective
Evaluation of suitability of Google Cloud and Google Vertex AI architectures for the Peerdom ”AI”-component
Technologies: Python 3, TensorFlow, Scikit Learn, XGBoost, Google Cloud, Google Vertex AI, etc.
May 2021 - Present
4 years 2 months
Klosters-Serneus, Switzerland
Business Mentor
Rolemodel Rebels
Mentor female students and professionals in advancing their careers, particularly as aspiring tech entrepreneurs.
Mar 2021 - Present
4 years 4 months
Zürich, Switzerland
Lecturer in Artificial Intelligence
HWZ Zurich University of Applied Sciences in Business Administration
Teach business and tech leaders in various MBA and CAS programs, covering topics such as machine learning and generative AI in finance and controlling, AI in digitial ethics, AI-driven business models and operations, operationalizing AI ethics, AI regulations, and auditing AI systems. My focus is on equipping participants with the knowledge and tools needed to integrate AI solutions effectively and responsibly into their organizations.
Mar 2021 - Jun 2021
4 months
Marina del Rey, United States
Machine Learning Team Lead & Architect for the ‘privacy-by-design‘ AWS cloud architecture and the large-scale machine learning
Opensesame Media Inc.
Team lead of a team with 3 data scientists and data engineers
Led and co-developed a global ‘privacy-by-design‘ GDPR-compliant AWS cloud architecture for a large-scale machine learning backend and various recommender and generative audio models
Responsible for overall strategy, roadmap and planning
Communication with all involved stakeholder groups
Designanddevelopment lead for 3differentdeep learning -based recommender systemsandexperimentationwith variousgenerative audio models
Co-design of a multi-account strategy together with the app backend lead implemented with AWS Control Tower
Design and development of custom MLOps templates for cross-account auto-scaling realtime and batch deployment of AWS SageMaker models based on AWS SageMaker Pipelines, AWS CodePipeline, AWS Service Catalog and AWS Cloudformation
Cross-account deployment of various AWS SageMaker models
Design and development of custom CodePipelines for cross-account deployment of GLUE jobs based on AWS GLUE, AWS CodePipeline, AWS Service Catalog and AWS Cloudformation
Cross-account deployment of various GLUE jobs
Design of a GDPR-compliant data lake house (data warehouse and data lake) with AWS Redshift and AWS S3 and scripted deployment with AWS Cloudformation and AWS SAM
Scripted multi-account deployment of further machine learning-backend relevant AWS services with AWS SAM
Utilized TensorFlow Lite to deliver TinyML models for the app team, enabling efficient deployment on mobile devices
Contributemy expertise in AI algorithms, AI ethics and AImanagement to help develop an Independent Audit of AI Systems for ForHumanity - a crowd-sourced non-profit organization.
Jan 2020 - Aug 2024
3 years 8 months
Lübeck, Germany
Advisory Board Member for Responsible AI
Key2Be.Me - Berufsorientierung im Netzwerk
ConsultedKEY2BE.MEon the use of responsible AI froma strategical, technical aswell as an ethical perspective for equal opportunities in career orientation.
Jan 2020 - Feb 2021
2 months
Mannheim, Germany
Evaluation and PoC of an Deep/Machine Learning-based trend analysis and forecasting solution to optimize the sales strategy of
Jonastone GmbH & Co. KG
Development of the business case together with the managing directors and Head of IT
Design and development of an Social Media API client and data collection PoC
Design and development of a deep/machine-learning based trend analysis and forecasting PoC
Development and evaluation of different image clustering algorithms and dimensional reduction algorithms on top of the deep learning architecture VGG16 and further hand-crafted feature vectors
Development and evaluation of different indoor scene segmentation and floor detection approaches based on the latest object detection, semantic segmentation, instance segmentation and panoptic segmentation deep learning algorithms for 2D images.
Sentiment analysis of social media posts and comments in different European languages
Design of several alternative AWS cloud architectures based on AWS RDS with PostgreSQL, AWS CodePipeline, AWS Glue, AWS SageMaker and AWS Deep Learning AMIs
Cost estimation and technical design of the overall solution
Coaching of the Head of IT concerning machine-learning and AWS cloud architectures
Consulting on strategical, technical and ethical aspects of an AI-based hiring solution
Empact Consulting
Oct 2020 - Nov 2020
2 months
Düsseldorf, Germany
Deep/Machine Learning-Based Multi-Variate Time Series Analysis and Forecasting for Retail Price Optimization of Energy Solutions
Bitwatt Systems GmbH
Application of various sophisticated techniques to analyse and decompose retail price multi-variate time series for different markets and demand groups
Creating various visuals to present the customer the results of the analysis and insights for price optimization
One-step-ahead andmulti-horizon forecasting PoC for multi-variate time series based on XGBoost models and several deep-learning architectures (Stacked LSTMS, Wave-net, Transformers, ... ) depending on performance for different markets
Consulting on Evaluation & Validation of an AI-Solution for Story Telling
Startup in Stealth Mode
Aug 2020 - Sep 2020
2 months
Maur, Switzerland
Data Transfer and Analytics Software for Avaya Communication Data
Stahlschmidt ICT Solutions
Design and development of a Python-software for exchanging data with Avaya (based on a specific protocol) for data analysis of the obtained communication data from Avaya
Advisory Board Member of the Entrepreneurship Group
Healthcare Businesswomen’s Association
Advised various HBAmembers on entrepreneurship.
Oct 2018 - Aug 2019
11 months
Baden, Switzerland
Optimization and Parallelization of Two Autonomous Machine-Learning based Energy Trading Solutions
Axpo Holding AG
Pre-evaluated C++- and a C#-energy trading software for a combined OpenMP and MultiGPU parallelization
Consulted the team on which GPU-hardware to buy for desktop computers and on-premise servers
Coached the team inoptimizationandparallelizationbest practices of linear programming, optimizationalgorithms,machine learning algorithms and linear algebra routines for energy trading
Coached the team in designing architectures for highly optimized and parallel energy trading software
Introduced the design and planning to various stakeholders at Axpo and successfully secured internal funding
Complete refactoring, improved hyperparameterization and optimization of the single-threaded C++-code and the OpenMP-multithreaded C++-code
Design and implementation of a CUDA-parallelization of the C++-software for multipleGPUs
Design and implementation of a combined MultiGPU + OpenMP-multithreaded version of the C++-software
Design and implementation CUDA-parallelization of an optimization algorithm in the C#-energy trading software based on CUDAmanaged
Unit, integration and system tests including memory and speed tests
Speed evaluation for OpenMP, GPU as well as combined parallelization for various power plants
Lead an award-winning company that offers technical, ethical, and strategic consulting services at the intersection of AI, Quantum Computing, and High-Performance Computing.
Provide consulting services in AI architecture, Data Science, ML Engineering, AI strategy, AI management, training and educational workshops, designing AI algorithms and numerical methods, and developing highly scalable distributed software solutions.
Work with freelancers and tech partners to provide customers with full end-to-end software solutions.
AI Architect & Senior ML Engineer for the Just Ask capability of Joule - an AI copilot and LLM agent for the Analytics Cloud at SAP.
AI Architect & Senior ML Engineer for LLM agents supporting several internal stakeholder groups at the Swiss Health Insurer Sanitas.
Interim Data Science Team Lead & AI Architect at the European E-commerce store Flaconi, Berlin, and responsible for over 30 ML services.
AI Architect & Senior Data Scientist for a deep reinforcement learning service, Telefonica, München.
Data Science Team Lead for the privacy-by-design audio AI backend SyncStage in AWS, OpenSesame Media Inc., California.
Image processing algorithms for an AI-driven autonomous flight control software including GPU-parallelization, Daedalean, Zürich.
Optimization of twonovel AI-based automatic energy trading solutions includingMultiGPU+OpenMPparallelization, AXPO, Baden.
Aug 2017 - Present
7 years 11 months
Zürich, Switzerland
Workshops and Courses in Artificial Intelligence and Distributed Ledger Technologies
Sciform GmbH
Various workshops and courses for business executives, lay people, private customers, companies and other organisations in Artificial Intelligence and Distributed Ledger Technologies
MesoPhys - In-house Simulation Software for Microfluidics and Pharmaceutical Applications
Customers of Sciform
Development of a Finite Element Method for Stochastic Navier-Stokes-Equations for meso-scale fluid flow
Design and implementation of a large-scale massively parallel High-Performance Computing architecture (C++,CUDA,MPI)
Unit, memory and speed tests
Setting up a High-Performance Computing (HPC) Cluster (MPI+GPU parallelism) for large-scale simulations in AWS (EC2) and running simulations
Award (HPC resources worth 20000 EUR) received for the excellent optimization and parallelization of MesoPhy by Arctur HPC in Feb 2018
Running simulations on the HPC-Clusters of Arctur Validation of the Finite Elementmethod and the software against experiments and for real-life applications
Optimization and Parallelization of Image Processing and Optimization Algorithms in an AI-based Autonomous Flight Control
Dadealan AG
Preparing the installation process of the entire AI-based flight control software on NVIDIA Jetson (Camera systemwith NVIDIA GPU for drones and planes) in Bazel including software changes
Optimization of non-parallelized andmulti-threaded implementation of several image processing and optimization algorithms
GPU-Parallelization in CUDA of the same algorithms for real-time application
Unit, memory and speed tests
Coaching the responsible people in the team about how to maintain the CUDA-code on GPUs
Lead Data Scientist and Senior Software Engineer (Mission Owner)
Adnovum AG
Technical lead and architect for the first AI-based, adaptive context-aware, continuous authentication & risk-detection component ”nevisDetect” for the NEVIS security suite. NEVIS including ”nevisDetect” is deployed in many financial institutions and government offices in Switzerland and other countries to prevent sophisticated malware attacks in a highly automated way. (Since 2020, NEVIS is continued by the NEVIS Security AG.)
Design of the context-aware authentication tool of nevisDetect for the evaluation of client-side and network layer data based on rules.
Design of the continuous authentication tool of nevisDetect for the evaluation of client-side and network layer data based on several advanced machine/deep-learning and especially anomaly detection algorithms.
Communication, collaboration and projects scoping with various stakeholders including the board, C-suite, customers, malware & network experts as well as IT security scientists from ETH Zürich.
Technologies: Python, TensorFlow, pySpark, Java, Apache Spark, ApacheKafka, Apache Storm, various other libraries from the Apache Spark Ecosystem, Couchbase, Cassandra Git, REST-APIs, Linux, Docker.
Jul 2014 - Oct 2015
1 year 4 months
Winterthur, Switzerland
Computational Scientist and Senior Software Engineer
Fluxim AG, ZHAW
Technical lead for the design and development of LAOSS - a novel software for large-area semiconductor device simulation, which, following its successful launch, is being used worldwide in the OLED and solar cell industries.
Collaboration and project scoping with customers as well as OLED and solar cell scientists from academia and industry.
Numerical development of a Finite Element Method for the simulation of large-area OLEDs and solar cells.
Design of a parallel HPC architecture for LAOSS and implementation of the numerics core.
Design and development of a Mie Scattering Module in SETFOS.
Contributed to several successful KTI and EU grants.
Jan 2014 - Present
11 years 6 months
Zürich, Switzerland
Fine Art Photographer
Fine Arts - Ursula Maria Mayer
Exhibitions of my fine art photography in Switzerland and abroad.
Jun 2011 - Dec 2014
2 years 7 months
Zürich, Switzerland
Research Computational Scientist
Institute of Environmental Engineering (IFU) at ETH Zurich
Design, development andextensionof theMPI&GPU-parallelizednumerics core for thewater resourcesmanagement software “FreshWaterSupply” in collaboration with “4dimensional GmbH” (see below). ”FreshWaterSupply“ is used by many decision makers and water authorities worldwide to improve availability and sustainability of drinking water.
Numerical extension of Finite Volume and Finite Element Methods for statistical modelling and multi-parameter optimization in groundwater flow, contaminant transport and multiphase flow for sustainable water resources management.
Contributed to several successful EU and world-bank grants to provide funding for the research project.
Dec 2007 - May 2011
3 years 6 months
Munich, Germany
Research Scientist and Software Engineer
International Graduate School of CSE, TU München
Numerical development of novel FSI-related Finite Element Methods for nano/micro-scale material simulations.
Implementation in a massively parallel multiphysics HPC-code.
Jul 2007 - Sep 2006
-1 years -9 months
Braunschweig, Germany
Research Assistant and Software Engineer
German Aerospace Center (DLR)
Numerical development of a novel semi-implicit time-integration scheme for CFD aerodynamics simulations.
Implementation in the massively parallel DLR tau-code, which is widely used in the European aerospace industry e.g. by Airbus.
Oct 2006 - Sep 2007
1 year
Zürich, Switzerland
Research Computational Scientist
Institute of Environmental Engineering (IFU) at ETH Zurich
Design, development andextensionof theMPI&GPU-parallelizednumerics core for thewater resourcesmanagement software “FreshWaterSupply” in collaboration with “4dimensional GmbH” (see professional experience). ”FreshWaterSupply“ is used by many decision makers and water authorities worldwide to improve availability and sustainability of drinking water.
Numerical extension of Finite Volume and Finite Element Methods for statistical modelling and multi-parameter optimization in groundwater flow, contaminant transport and multiphase flow for sustainable water resources management.
Contributed to several successful EU and world-bank grants to provide funding for the research project.
Jul 2004 - Dec 2011
7 years 6 months
Zürich, Switzerland
Managing Director & Co-founder
4Dimensional GmbH
Design, development and productization of “FreshWaterSupply” - a decision support software for ecologically and economically sustainable water resources management, which was jointly developed with IfU, ETH Zurich (see above).
Summary
I support my clients in the development of production‑grade responsible AI solutions with advanced algorithms and state‑of‑the‑art AI engineering & infrastructure.
Languages
German
Native
English
Advanced
Latin
Advanced
French
Elementary
Italian
Elementary
Education
Oct 2005 - Oct 2007
TU München
M.Sc. (hons) · Computational Science and Engineering · Munich, Germany
Oct 2000 - Oct 2005
ETH Zürich
M.Sc. · Civil Engineering · Zürich, Switzerland
Oct 1989 - Oct 1998
Gymnasium bei St. Stephan
Abitur · Augsburg, Germany
Certifications & licenses
IBM Qiskit - Certificate of Quantum Excellence, Global Summer School 2021
IBM
Certificate in QuantumMachine Learning
NITheP
Similar Freelancers
Discover other experts with similar qualifications and experience