Martin Mauch

Freelance Data Architect

Mintraching, Germany

Experience

Jul 2023 - Aug 2023
2 months
Germany

Freelance Data Architect

Zeppelin

  • Evaluation and scoring of various technologies as future telematics platform: Kafka Streams, Spark, Splunk, Snowflake
  • Improve test framework and scalability of Telematics streaming service: Scala, Property‑Based Testing, Kafka, Kafka Streams, Kubernetes
Jan 2021 - Aug 2023
1 year 8 months

Scala Node-RED

Github

  • Scala.js library for writing custom Node-RED nodes.
  • Developed from scratch, main maintainer.
Apr 2021 - Jun 2023
2 years 3 months
Magdeburg, Germany

Big Data Architect

Ultra Tendency

  • Implementation of a Big Data Record Linkage Pipeline: Cloudera Public Cloud, Spark, Hadoop, Hive, Kafka, Splink
  • Implementation of a CI/CD pipeline for the developed Big Data applications: Ansible, Gitlab CI, Docker, Kubernetes
  • Creation of a PoC for automatic validation and correction of data, as well as record linkage: Spark, Drools, Splink, Apache NiFi, Scala, Python
  • Set up of a new Big Data cluster for a European capital: Cloudera Private Cloud, Ansible, Kerberos
  • Migration of Big Data applications to the new cluster: Docker, Spark, Hive, Python
Aug 2020 - Aug 2023
3 years 1 month
Berlin, Germany

Software Architect

Xencura GmbH

  • Design of the architecture and implementation of a prototype for the use of Industry 4.0 processes for individualized cancer therapy: Petri Nets, Industry 4.0, Digital Twins, Scala
Apr 2018 - Oct 2020
2 years 7 months
Munich, Germany

Digital Innovation Architect

Wacker Neuson SE

  • Design and implementation of an algorithm to optimize the planning of production sequences subject to constraints: Scala, Constraint Programming, Constraint Based Local Search, SAP
  • Research, design and development of a platform for processing telematics data: Spark, Hadoop, Azure, Kafka
  • Analysis and improvement of telematics data quality: Python, Pandas, scikit‑learn, Spark, Time Series Analysis, Active Learning
  • Design and establishment of a data science workflow: R, RStudio, Jupyter, DVC
  • Research, design and development of a central API Gateway which hides the complex system landscape of WN behind a simple interface: Scala, GraphQL, REST
  • Conception and assistance in the establishment of agile processes in the IT department: Scrum, Kanban, Pair Programming, Retrospectives, Root‑Cause Analysis, Hypothesis‑Driven Development
Jan 2015 - Aug 2023
7 years 8 months

Spark Excel

Github

  • Spark library for reading and writing Microsoft Excel files.
  • Developed from scratch, main maintainer.
Apr 2015 - Apr 2015
1 month
Warsaw, Poland

Probabilistic modelling with Scala

Scalar Conference

  • Introduction to Bayesian networks. Live coding example on how to do learning and inference using a Scala library.
Jan 2015 - May 2015
5 months

D3 Bayesian Network

Github

  • D3 extension for drawing Bayesian networks along with their conditional probability tables.
Aug 2013 - Aug 2013
1 month
Berlin, Germany

JRubymeets Scala

JRubyConf

  • Overview and best practices of how to use JRuby and Scala in the same project.
Nov 2008 - Apr 2018
9 years 6 months
Passau, Germany

Innovation Manager and Copartner, Team Lead Innovation Hub

crealytics GmbH

  • Implementation of an algorithm for automatically generating, evaluating and selecting ad copies: Ruby, Branch and Bound
  • Design and implementation of a method for creating statistical estimations of conversion rates: R, RStudio, Knime, Rapidminer, Regression Trees, Bayesianmodels, Support Vector Regression
  • Implementation of an algorithm for creating statistical models of search engine auctions and maximizing profit given additional constraints: Scala, Bayesian linear regression, spline models, Computational Algebra, quasi‑Newton optimization
  • Prototyping of various algorithms. Coordination between data science and engineering teams: MinHash, Bayesian Vector Auto‑Regression, ARIMA, Jupyter, RStudio, Spark
  • Creation of an ontology for products, brands and relevant search terms: Natural Language Processing, Neo4j
  • Design and implementation of an algorithm for optimal matching of products to search queries: Scala, Branch and Bound, Graph DB, ontologies
  • Implementation of a data processing pipeline: Scala, Spark
  • Conception, submission and execution of a government‑funded research cooperation project (ZIM Koop) with the University of Kassel: Java, Exponential Smoothing, Bayesian Models, Support Vector Machines, Graph DB, Ontologies
  • Conception and establishment of agile processes in the entire company: Scrum, Kanban, Pair Programming, Retrospectives, Root‑Cause Analysis, Hypothesis‑Driven Development

Languages

German
Native
English
Advanced
Spanish
Intermediate

Education

Sep 2001 - Sep 2008

University of Passau

Diploma Computer Science, focus on Machine Learning · Computer Science · Passau, Germany · 3.3 / 4.0

Certifications & licenses

Microsoft Certified: Azure AI Engineer Associate

Microsoft

Microsoft Certified: Azure AI Fundamentals

Microsoft

CKA: Certified Kubernetes Administrator

The Linux Foundation

CKAD: Certified Kubernetes Application Developer

The Linux Foundation

Databricks Certified Associate Developer for Apache Spark 3.0

Databricks

Enterprise Architecture

St. Peter Polytechnic University @ Coursera

Exam 533: Implementing Microsoft Azure Infrastructure Solutions

Microsoft

Medical Neuroscience

Duke University @ Coursera

Computational Neuroscience

University of Washington @ Coursera

Functional Programming Principles in Scala

EPFL @ Coursera