Philipp Brunenberg

Research for Master's Thesis

Munich, Germany

Experience

Sep 2023 - Dec 2023
4 months
Lorem ipsum dolor sit amet

  • Implement a command line-based tax automation and PDF processing tool.
  • Fine-tune Llama2 LLM with custom training data for information retrieval from PDFs.
  • Employ a student-teacher approach for data generation, utilizing ChatGPT.
  • Technologies: Python, PyTorch, Pandas, Google Colab, Llama, ChatGPT, Git, Langchain.
Sep 2022 - Sep 2023
1 year 1 month
Lorem ipsum dolor sit amet

Deutsche Börse

  • Develop concept and consult on functional and non-functional requirements for business needs.
  • Create and implement real-time streaming application designs processing multi-billion messages daily.
  • Use Spark Structured Streaming for custom operations and complex data management.
  • Migrate traditional DWH to event stream-based data representation.
  • Technologies: Spark (SQL, Structured Streaming), Kafka (Avro), HBase, Oracle DB, Informatica, HDFS, Zeppelin, YARN, Apache Hive.
Dec 2021 - Jul 2022
8 months
Lorem ipsum dolor sit amet

DB Systel

  • Address stability and performance challenges with a Neo4j database.
  • Create a concurrent ETL process for sorted node grid loading in Neo4j.
  • Develop Neo4j monitoring and automated backup routines.
  • Technologies: Neo4j, Python, AWS, Grafana, Graphite, PostgreSQL, HDFS, Kafka, Apache Spark.
Sep 2019 - Apr 2021
1 year 8 months
Lorem ipsum dolor sit amet

Ippen Digital GmbH

  • Design and operate a large-scaled, stream-fed, multi-billion node TigerGraph knowledge graph.
  • Requirement analysis and creation of a graph-based article recommendation engine.
  • Design and validate multi-stream schema models.
  • Technologies: TigerGraph, Kafka, AWS, Docker, Kubernetes, Java, Scala, Go, Grafana, Prometheus, Terraform, Helm.
Mar 2019 - Jul 2019
5 months
Lorem ipsum dolor sit amet

Telefonica Germany GmbH

  • Design and implement real-time streaming applications.
  • Conduct workshops on Apache Kafka.
  • Technologies: Spark, Kafka, Kafka Streams, Scala, Docker, Kubernetes, AWS, Akka Streams.
Sep 2018 - Jan 2019
5 months
Lorem ipsum dolor sit amet

Allianz SE

  • Develop a proof-of-concept for event-driven data analytics.
  • Build an ingestion pipeline transferring relational data to a graph database.
  • Technologies: Spark, Kafka, Java, Scala, Docker, Neo4j, Git.
Jan 2018 - Jul 2018
7 months
Lorem ipsum dolor sit amet

Ayfie GmbH

  • Optimize a text analytics pipeline for scalability and performance.
  • Utilize NLP and machine learning for knowledge discovery use cases.
  • Research and apply scalable algorithmic solutions for data analysis.
  • Educate teams on modern big data approaches.
  • Technologies: Spark, Java, Scala, Docker, AWS, CI, Spring, Elasticsearch, SQL, Sonar, Git, Grafana, Graphite.
Jun 2017 - Sep 2017
4 months
Lorem ipsum dolor sit amet

EatSmarter GmbH

  • Calculate nutritional values for food recipe databases.
  • Design solution pipelines integrating multi-source machine learning-based approaches.
  • Deploy solutions to AWS infrastructures.
  • Technologies: Spark, Scala, AWS, Docker, Python, Git, MySQL, CouchDB.
Nov 2016 - May 2017
7 months

Research for Master's Thesis

Technical University Munich (TUM)

  • Topic: Knowledge Discovery in textual Databases for calculating nutritional food recipe values.
  • Machine learning and NLP theoretical methods employed: Tokenization, Part-of-Speech Tagging, Stemming, Logistic Regression.
  • Technologies: Spark, Scala, Docker, Python, Stanford-NLP.
Nov 2016 - May 2017
7 months

Team Lead of Software Development (Student Group 'Roboy')

Technical University Munich (TUM)

  • Develop child-sized humanoid robot as part of interdisciplinary efforts.
  • Lead the software development segment for design and engineering processes.
  • Technologies: C++, ROS, CMake, Unix.
Jun 2016 - Sep 2016
4 months
Lorem ipsum dolor sit amet

EatSmarter GmbH

  • Provide expert-rated indicators for recipe healthiness via supervised classification approaches.
  • Consult and design solution for innovative problem-solving in management datasets.
  • Technologies: Spark, Scala, AWS, SQL, Git, Docker, Python.
Apr 2016 - Present
9 years 3 months

Freelance Data Engineering & Machine Learning Consultant

Self-employed

  • Craft high-quality and scalable data-driven applications tailored to client needs.
  • Create production-ready software leveraging machine learning techniques and big data architectures.
  • Collaborate with both in-house software groups and non-technical stakeholders on problem-solving and clear perspectives.
  • Conduct workshops for teams to independently develop scalable big data applications.
Oct 2014 - Mar 2015
6 months

R&D - UAV Mission Planner

Elektroniksystem- und Logistik-GmbH (ESG)

  • Bachelor’s Thesis: Design dynamic planner integrating into UAV mission software.
  • Researched route planning algorithms for gas hazard mission zones.
  • Technologies: C++, QT, CMake, Unix.
May 2013 - Sep 2014
1 year 5 months

Software Engineer - Aerosystems Avionics

Elektroniksystem- und Logistik-GmbH (ESG)

  • Developed distributed middleware for critical safety modules in aeronautics.
  • Adapted modular avionics architectures for Unix subsystems.
  • Technologies: C, CMake, Unix.

Summary

I am dedicated to help my clients build scalable, high-quality big data & machine learning solutions.

Senior Freelance Data Engineering & AI Consultant with eight years of experience.

Expert-level knowledge and experience in big data technologies Apache Spark & Apache Kafka.

Content creator on self-hosted blog, YouTube and social media.

Founder of the Spark Rockstars Academy: Teaching Data Engineers to Pro-Level.

Co-Founder of DayCaptain: The fastest time planning tool for effective developers.

Help my clients build scalable, high-quality big data & machine learning solutions.

Languages

German
Native
English
Advanced
Spanish
Advanced

Education

Oct 2015 - Jun 2017

Technical University Munich

Master of Science, major fields of study: Machine Learning, Artificial Intelligence, Data Analytics, Entrepreneurship · Computer Science · Munich, Germany

Jan 2014 - Jun 2014

Kwantlen Polytechnic University

Cloud Computing, Mobile Programming · Vancouver, Canada

Oct 2011 - Jun 2015

University of Applied Sciences Munich

Bachelor of Science, major fields of study: Mathematics, Algorithms, Data Structures, Software Engineering, Software · Computer Science · Munich, Germany