Philipp Brunenberg

Senior Data & Machine Learning Engineer

Philipp Brunenberg
Munich, Germany

Experience

Sep 2023 - Dec 2023
4 months

Implement LLM-Based Tax-Automation Tool

Self-owned

  • Implement a command line-based tax automation and pdf processing tool
  • Fine-tune Llama2 LLM with custom generated training data to enable custom information retrieval from pdf documents
  • Utilize student-teacher approach to generate training data with ChatGPT results
  • Technologies: Python, PyTorch, Pandas, Google Colab, Llama, ChatGPT, langchain
Sep 2022 - Sep 2023
1 year 1 month

Re-Design of a Large-Scale Streaming Application

Deutsche Börse

  • Conceptualize functional & non-functional requirements, consult business requirements and technical specification teams, technical review of specifications
  • Develop technical design for real-time streaming application processing multi-billion messages per day
  • Implement multiple Spark Structured Streaming applications, including custom outer-join operations, Hbase access and complex data structures
  • Conceptualize and implement data migration routines from classical DWH to event stream-based representation
  • Technologies: Spark (SQL, Structured Streaming), Kafka (Avro), Hbase, Oracle DB, Informatica, HDFS, Zeppelin, YARN, Apache Hive
Dec 2021 - Jul 2022
8 months
Germany

Advanced Analytics in a Large-Scale Knowledge Graph

DB Systel

  • Help client with stability and performance challenges with a large-scale single node Neo4j database
  • Implement a concurrent ETL process to load a timely-sorted two-dimensional grid of linked nodes in Neo4j
  • Setup a monitoring system for Neo4j
  • Implement an automatic backup process to backup the Neo4j database nightly and store archives
  • Technologies: Neo4j, Python, AWS, Grafana, Graphite, PostgreSQL, HDFS, Kafka
Sep 2019 - Apr 2021
1 year 8 months

Advanced Graph Analytics on a Multi-Billion Node Knowledge Graph

Ippen Digital GmbH

  • Design, implementation & operation of a large-scale, stream-fed, multi-billion node knowledge graph (TigerGraph)
  • Requirement analysis, design, implementation and operation of a graph-based real-time recommendation engine for news articles
  • Schema modeling & validation of a knowledge graph of multiple data streams
  • Technologies: TigerGraph, Kafka, AWS, Docker, Kubernetes, Java, Scala, Go, Grafana, Prometheus, Terraform, Helm
Mar 2019 - Jul 2019
5 months
Germany

Implementation of Real-Time Analytics Applications

Telefonica Germany GmbH

  • Design and implement real-time streaming applications
  • Give internal workshop about Apache Kafka
  • Technologies: Spark, Kafka, Kafka Streams, Scala, Docker, Kubernetes, AWS, Akka Streams
Sep 2018 - Jan 2019
5 months

Design & Implementation of an Advanced Analytics Streaming Application

Allianz SE

  • Designed and implemented a proof-of-concept for an event-driven data analytics application
  • Developed an ingestion pipeline to transform and feed relational data into a graph database
  • Technologies: Spark, Kafka, Java, Scala, Docker, Neo4j, git
Jan 2018 - Jul 2018
7 months

Feature Implementation & Performance Optimizations of Document Mining Application

Ayfie GmbH

  • Optimized a large-scale text analytics pipeline for scalability and performance
  • Implemented knowledge discovery use-cases utilizing state-of-the-art NLP & ML approaches
  • Researched and applied efficient algorithms for analyzing large data sets
  • Educated in-house development team on big data software development and data mining
  • Technologies: Spark, Java, Scala, Docker, AWS, CI, Spring, Elasticsearch, SQL, Sonar, git, Grafana, Graphite
Jun 2017 - Sep 2017
4 months

Machine Learning for Knowledge Discovery in Food Recipes

EatSmarter GmbH

  • Project: Automatic calculation of nutritional values for food recipes
  • Consultation on planning of the project and the solution approach
  • Solution design: POC for a calculation pipeline based on multiple heterogeneous data sources and various machine learning approaches
  • Implementation of data processing pipeline to calculate nutritional values
  • Deploy pipeline on client infrastructure (AWS)
  • Technologies: Spark, Scala, AWS, Docker, Python, git, MySQL, CouchDB
Nov 2016 - May 2017
7 months
Munich, Germany

Master's Thesis: Knowledge Discovery in unstructured Data

Technical University Munich (TUM)

  • Title: Knowledge Discovery in textual Databases for enhancing the automatic Calculation of nutritional Values for online-based Food Recipes
  • The work utilizes various machine learning and NLP approaches to extract information from unstructured text to determine the nutritional content of food recipes
  • Theoretical approaches: Tokenization, Part-of-Speech Tagging, Stemming, Neural Networks, Logistic Regression, Word Embeddings
  • Technologies: Apache Spark, Scala, Docker, Python, Stanford-NLP
Nov 2016 - May 2017
7 months
Munich, Germany

Team Lead of Software Development at TUM Student Group "Roboy"

Technical University Munich (TUM)

  • As a team of interdisciplinary TUM students we developed a child-sized humanoid robot and promote the work on various events all over the world
  • As the team leader of the software development group, I was responsible for design, implementation and software engineering processes of the group
  • Technologies: C++, ROS (Robot Operating System), CMake, Unix
Jun 2016 - Sep 2016
4 months

Machine Learning for Automatic Classification of Food Recipes

EatSmarter GmbH

  • Project: Provide an indicator for the healthiness of food recipes which is based in expert ratings
  • Consulting in management and design thinking to derive useful innovative use cases from existing data of the client
  • Proof of concept: supervise generation of training data set and analyze data quality
  • Implementation of supervised classification system based on selected features and various approaches
  • Implementation and validation of final solution
  • Technologies: Spark, Scala, AWS, SQL, git, Docker, Python
Apr 2016 - Present
8 years 11 months
Germany

Freelance Data Engineering & Machine Learning Consultant

Self-employed

As a passionate freelance big data consultant, my expertise lies in crafting high-quality, and scalable data-driven applications that meet the unique requirements of my clients. I am dedicated to creating production-ready software solutions, leveraging cutting-edge machine learning techniques and robust big data architectures. Collaborating with in-house software development teams, as well as non-technical stakeholders, I thrive on practical problem-solving approaches and finding clear perspectives. Furthermore, I take great pleasure in empowering teams through customized workshops, equipping them with the knowledge and skills to develop scalable big data applications independently.

Oct 2014 - Mar 2015
6 months
Fürstenfeldbruck, Germany

Research & Development: Unmanned Aerial Vehicle

Elektroniksystem- und Logistik-GmbH (ESG)

  • Bachelor's Thesis: Design and prototypical implementation of a dynamic mission planner for integration into the mission planning software of an unmanned aerial vehicle in C++
  • Throughout the thesis I researched multiple approaches of route planning of an unmanned helicopter (UMAT) to explore a predefined area for hazardous gas
  • I implemented the routing algorithm in C++ to be used by the mission planning software developed by ESG
  • Technologies: C++, QT, CMake, Unix
May 2013 - Sep 2014
1 year 5 months
Fürstenfeldbruck, Germany

Software Engineering & Development: Aerosystems Avionics

Elektroniksystem- und Logistik-GmbH (ESG)

  • I was involved in the software development team developing a time- and safety-critical, distributed middleware in C
  • The middleware was deployed on a highly modular avionics platform developed by ESG
  • The platform was composed of multiple Unix-like modules, which were interchangeable on the fly
  • Technologies: C, CMake, Unix

Summary

I am dedicated to help my clients build scalable, high-quality big data & machine learning solutions.

Languages

German
Native
Spanish
Advanced
English
Elementary

Education

Oct 2014 - Jun 2017

Technical University Munich

M. Sc. · Computer Science · München, Germany

Oct 2011 - Jun 2014

University of Applied Sciences Munich

B. Sc. · Computer Science · Munich, Germany

Kwantlen Polytechnic University

Cloud Computing, Mobile Programming · Vancouver, Canada