Stephan Sahm
Senior Data/ML Consultant & Technical Lead
Experience
Senior Data/ML Consultant & Technical Lead
Jolin.io
Role: Software Engineer & Applied Mathematician (Mathematical optimization for scheduling; duration: 1 months; team setting: Team of 2, remote; technologies: JuMP, Julia, Pluto, Svelte, JavaScript, TypeScript, JetBrains Space, Terraform, Nomad)
Role: Software & Cloud & Web Engineer (Building scalable data science compute cluster from scratch; duration: 11 months; team setting: Team of 1, on-site; technologies: Terraform, Kubernetes, k8s ingress, k8s services, k8s RBAC, k8s networking, k3s, etcd, S3, DNS, certificates, Julia, Pluto, JavaScript, Tailwind, Astro, npm, Parcel, Preact, MUI, JWT, AWS SQS, AWS RDS, Python, GitLab, GitHub)
Role: AI & Web Engineer (Custom ChatGPT service; duration: 1 months; team setting: Team of 2, remote; technologies: Python, Poetry, LangChain, Tailwind, ChatGPT API, Flask, FastAPI)
Role: Architect & Data Engineer (Central datalake setup and ingestion; duration: 9 months; team setting: Team of 5, remote; technologies: Infrastructure-as-code, AWS CDK, Python, Boto3, PySpark, AWS Glue, IAM, S3, ECS, Fargate, Lambda, Apache Hudi, DeltaLake, Databricks, GitHub, Jira, Miro)
Role: Software Engineer (PoC Julia migration of scikit-decide; duration: 1 months; team setting: Team of 2, remote; technologies: Python, Julia, GitHub)
Organiser
Meetup Julia User Group Munich
Organising events & speakers
Giving introductory courses on Julia topics & deep-dive sessions
Senior Data Science Consultant & Technical Lead
Machine Learning Reply
Role: Lead Developer & Architect (Supporting usecase development on datalake; duration: 6 months; team setting: Team Lead, Team of 2, remote; technologies: Infrastructure-as-code, CloudFormation, Sceptre, Python, Boto3, PySpark, Scala, Spark, AWS Glue, AWS Secrets, AWS IAM, S3, SNS, Kubernetes, AWS VPC, AWS Networking, GitHub, Jira)
Role: Lead Developer & Architect (20 ETL pipelines on AWS; duration: 10 months; team setting: Team Lead, Team of 3, remote; technologies: AWS Glue, PySpark, Python, Boto3, Pandas, AWS SNS, AWS SQS, SQL, MySQL, PostgreSQL, MongoDB, AWS DocumentDB, Salesforce, AWS API Gateway, AWS Cognito, AWS Lambda, infrastructure-as-code, CloudFormation, Sceptre, GitHub, Jira)
Role: Lead Developer & Architect (Building multitenant datalake on AWS; duration: 5 months; team setting: Team Lead, Team of 2, remote with a few on-site workshops; technologies: Infrastructure-as-code, CloudFormation, Sceptre, Python, Boto3, PySpark, Scala, Spark, AWS SageMaker, AWS Glue, AWS Secrets, AWS IAM, S3, SNS, Lambda, Kubernetes, EKS, Kafka, MSK, AWS VPC, AWS Transit Gateway, AWS Networking, AWS EC2, AWS Session Manager, AWS CloudWatch, AWS SageMaker, GitHub, Jira)
Senior Data Science & Engineering Consultant
Data Reply
Role: Core Developer (Unification of existing time series analytics; duration: 9 months; team setting: Team of 15, on-site, Scrum; technologies: Python, PySpark, (PL)SQL, Hive, HBase, Oracle, Tableau, NiFi, Kubernetes, Docker, Azure, GitLab)
Role: Data Science Developer (Recommender system; duration: 7 months; team setting: Team of 1, on-site, weekly reviews; technologies: R, Scala, SBT, Spark, Yarn, HDFS, Bitbucket, Jira, Grafana, Prometheus, Elastic Stack, Kibana)
Role: Quality Assurance & Adviser (Custom data science framework review; duration: 2 months; team setting: Team of 1, mixed remote & on-site; technologies: R, AWS)
Role: Teacher (Workshop: Developing with Apache Spark; duration: Four one-day workshops; setting: Group of 15 persons, sole presenter; technologies: R, Python, Spark)
Data Science Consultant
Data Reply
Role: Data Science Developer (Fraud detection; duration: 14 months; team setting: Team of 1, on-site, review once every three months; technologies: R, Scala, Spark, Yarn, Bitbucket, Jira, Elastic Stack, Kibana)
Role: Data Science Developer (Callcenter and webcontent optimization using speech analytics; duration: 6 months; team setting: Team of 3, on-site, reviews every week; technologies: Python, NLP, spaCy, GitHub, Elastic Stack, Kibana)
Student Employee
Trufflebit
Role: Web Developer (Web visualization; duration: 4 month; team setting: Team of 1, remote, steady exchange with CEO; technologies: Python, Django, Bokeh, GitLab)
Role: Python Developer (Data parsing; duration: 8 month; team setting: Team of 1, remote, steady exchange with CEO; technologies: Python, PyParsing, Cython, GitLab)
Computer vision & Object recognition (Study Project)
University of Osnabrück
- Programmed robot with wheels and arms to grab a muffin from the receptionist on first floor, take the elevator, and bring it to the robotics lab; team setting: Team of 14, on-site, Scrum; technologies: ROS, Gazebo, Python, C++, OpenCV, Git
Summary
Stephan Sahm is a full stack lead data science consultant and cloud architect, specialized on big data, high performance, probabilistic computing and scientific machine learning.
Stephan Sahm brings 10+ years experience in data science, 7+ years in consulting, 2 years in leading small teams. He has an outstanding master of cognitive science, master of statistics, and bachelor of informatics. Since 2017 he worked with Big Data systems, since 2018 on clouds. Stephan Sahm architected cloud-based data solutions for several businesses with AWS, Azure and Kubernetes. He implemented several machine learning use cases in production using Python, R, Scala, and Julia.
Skills
Lead
Visionary
Team Player
Self Reflection
Caregiver
High Quality
Clean Code
Clean Setup
Clean Infrastructure State-of-the-art
Julia
Python
R
Matlab
Scala
Spark
Haskell
Nix
Ros
Aws
Cloud
Devops
Cicd
Automation
Statistics
Ai
Deep Learning
Expert Systems
Time Series
Machine Learning
Probabilistic Modelling
Optimization
Recommendation
Fraud & Anomalies
Nlp
Planning
Computer Vision
Julia
Python
R
Matlab
Java
C
Nix
Bash
Zsh
Scala
Cdk
Sagemaker
Glue
Emr
Lambda
Ec2
S3
Ecs
Fargate
Eks
Msk
Iam
Vpc
Api Gateway
System Manager
Azure Machine Learning
Azure Data Lake Storage
Terraform
Active Directory
Web App Service
Spark
Deltalake
Hudi
Databricks
Kubernetes
Kafka
Flink
Postgresql
Mongodb
Hive
Elasticsearch
Apache Nifi
Datalku
Datarobots
Tableau
Kibana
Gitlab Cicd
Github Actions
Dvc
Docker
Kubernetes
Nomad
Kubeflow
Airflow
Mlflow
Prometheus
Grafana
Elastic Stack
Jump.jl
Flux.jl
Pytorch
Tensorflow
Keras
Tfp
Stan
Pymc3
Turing.jl
Pandas
Numpy
Scipy
Yellowbrick
D3
Bokeh
Plotly
Dash
Git
Bash
Vim
Tmux
Make
Ssh
Intellij
Pycharm
Rstudio
Jupyter
Pluto
Atom
Vscode
Html
Css
Tailwind
Javascript
Typescript
React
Preact
Mui
Svelte
Astro
Vite
Parcel
Miro
Jira
Github
Gitlab
Jetbrains Space
Kanban
Scrum
Languages
Education
Radboud University
M.Sc. Applied Stochastics · Applied Stochastics · Nijmegen, Netherlands · very good
University of Osnabrück
M.Sc. Cognitive Science · Cognitive Science · Osnabrück, Germany · outstanding
University of Osnabrück
B.Sc. Mathematics/Computer Science · Mathematics/Computer Science · Osnabrück, Germany · outstanding
Certifications & licenses
AWS Certified Architect Professional
AWS Certified Architect Associate
AWS Certified Machine Learning Speciality
AWS Certified Big Data Speciality
AWS Certified Cloud Practitioneer
Cloudera Certified CCA Spark and Hadoop Developer
Professional Scrum Master™ I (PSM I)
Similar Freelancers
Discover other experts with similar qualifications and experience