Philipp Brunenberg
Instructor
Experience
Instructor
Spark Rockstars Academy
- Help developers with individual live coaching to become pro-level Apache Spark engineers
- Organize and host multi-day, tailored Apache Spark workshops for development teams
- Create educational technical content on a self-hosted blog, YouTube, and social media
Implement LLM-Based Tax-Automation Tool
Self-owned
- Implement a command line-based tax automation and PDF processing tool
- Fine-tune Llama2 LLM with custom generated training data for information retrieval from PDF documents
- Utilize a student–teacher approach to generate training data with ChatGPT results
- Technologies: Python, PyTorch, Pandas, Google Colab, Llama, ChatGPT, langchain, Git, Docker, Kubernetes, AWS, Jenkins
Re-Design of a Large-Scale Streaming Application
Deutsche Börse
- Conceptualize functional and non-functional requirements, consult business and technical specification teams, and conduct technical reviews
- Develop technical design for real-time streaming application processing multi-billion messages per day
- Implement Spark Structured Streaming applications with custom outer-join operations, HBase access, and complex data structures
- Conceptualize and implement data migration routines from classical data warehouse to event stream–based representation
- Technologies: Spark (SQL, Structured Streaming), Kafka (Avro), HBase, Oracle DB, Informatica, HDFS, Zeppelin, YARN, Apache Hive
Advanced Analytics in a Large-Scale Knowledge Graph
DB Systel
- Implement and operate a SaaS tool for personal time management with full-stack development including a Neo4j backend
- Address client stability and performance challenges with a large-scale single-node Neo4j database
- Implement a real-time recommendation engine in Neo4j based on Naive Bayesian classification
- Develop a concurrent ETL process to load a timely-sorted two-dimensional grid of linked nodes in Neo4j
- Perform frontend development of landing and app pages (HTML, CSS, JavaScript, Selenium)
- Setup monitoring system for Neo4j and automate nightly backups
- Technologies: Neo4j, Python, AWS, Grafana, Graphite, PostgreSQL, HDFS, Kafka
Co-Founder
DayCaptain
- Co-founded the fastest personal time planning tool for effective developers
Advanced Graph Analytics on a Multi-Billion Node Knowledge Graph
Ippen Digital GmbH
- Design, implement, and operate a large-scale, stream-fed multi-billion node knowledge graph using TigerGraph
- Create a forms graph-based real-time recommendation engine for news articles
- Perform requirement analysis, schema modeling, and validation of multiple data streams in a knowledge graph
- Technologies: TigerGraph, Kafka, AWS, Docker, Kubernetes, Java, Scala, Go, Grafana, Prometheus, Terraform, Helm
Implementation of Real-Time Analytics Applications
Telefonica Germany GmbH
- Design and implement real-time streaming applications
- Deliver internal workshops on Apache Kafka
- Technologies: Spark, Kafka, Kafka Streams, Scala, Docker, Kubernetes, AWS, Akka Streams
Design & Implementation of an Advanced Analytics Streaming Application
Allianz SE
- Design and implement a proof-of-concept for an event-driven data analytics application
- Develop an ingestion pipeline to transform and feed relational data into a graph database
- Technologies: Spark, Kafka, Java, Scala, Docker, Neo4j, Git
Feature Implementation & Performance Optimizations of German Document Mining Application
Ayfie GmbH
- Optimize a large-scale text analytics pipeline for scalability and performance
- Implement knowledge discovery use cases utilizing state-of-the-art NLP and ML approaches
- Research and apply efficient algorithms for analyzing large data sets
- Educate in-house development team on big data software development and data mining
- Technologies: Spark, Java, Scala, Docker, AWS, CI, Spring, Elasticsearch, SQL, Sonar, Git, Grafana, Graphite
Machine Learning for Knowledge Discovery in Food Recipes
EatSmarter GmbH
- Design proof-of-concept for a calculation pipeline based on multiple heterogeneous data sources and machine learning approaches
- Implement data processing pipeline to calculate nutritional values for food recipes
- Deploy pipeline on client AWS infrastructure
- Technologies: Spark, Scala, AWS, Docker, Python, Git, MySQL, CouchDB
Team Lead of Software Development
TUM Student Group “Roboy”
- Lead interdisciplinary team to develop a child-sized humanoid robot
- Responsible for design, implementation, and software engineering processes
- Technologies: C++, ROS, CMake, Unix
Machine Learning for Automatic Classification of Food Recipes
EatSmarter GmbH
- Provide an indicator for the healthiness of food recipes based on expert ratings
- Consult on management and design thinking to derive innovative use cases from client data
- Supervise generation of training data set and analyze data quality
- Implement supervised classification system based on selected features and validate final solution
- Technologies: Spark, Scala, AWS, SQL, Git, Docker, Python
Freelance Data Engineering & Machine Learning Consultant
Self-employed
- Craft high-quality, scalable data-driven applications tailored to client requirements
- Create production-ready software solutions leveraging cutting-edge machine learning techniques and robust big data architectures
- Collaborate with in-house software development teams and non-technical stakeholders to solve practical problems
- Deliver customized workshops to empower teams to develop scalable big data applications independently
Research & Development Engineer: Unmanned Aerial Vehicle
Elektroniksystem- und Logistik-GmbH (ESG)
- Design and prototype a dynamic mission planner for integration into UAV mission planning software in C++
- Research route planning approaches for unmanned helicopters to explore predefined areas for hazardous gas
- Implement routing algorithm for integration into mission planning software
- Technologies: C++, Qt, CMake, Unix
Software Engineer & Developer: Aerosystems Avionics
Elektroniksystem- und Logistik-GmbH (ESG)
- Develop time- and safety-critical distributed middleware in C
- Deploy middleware on a modular avionics platform with interchangeable Unix-like modules
- Technologies: C, CMake, Unix
Industries Experience
See where this freelancer has spent most of their professional time. Longer bars indicate deeper hands-on experience, while shorter ones reflect targeted or project-based work.
Experienced in Information Technology (10 years), Education (3 years), Aerospace and Defense (2 years), Banking and Finance (1.5 years), Media and Entertainment (1.5 years), and Professional Services (0.5 years).
Business Areas Experience
The graph below provides a cumulative view of the freelancer's experience across multiple business areas, calculated from completed and active engagements. It highlights the areas where the freelancer has most frequently contributed to planning, execution, and delivery of business outcomes.
Experienced in Product Development (11.5 years), Information Technology (11 years), Business Intelligence (10 years), Research and Development (2 years), and Accounting (0.5 years).
Summary
I am dedicated to help my clients build scalable, high-quality big data & machine learning solutions.
- Senior Freelance Data Engineering & AI Consultant with eight years of experience
- Expert-level knowledge and experience in big data technologies Apache Spark & Apache Kafka
- Content creator on self-hosted blog, YouTube and social media
- Founder of the Spark Rockstars Academy: Teaching Data Engineers to Pro-Level
- Co-Founder of DayCaptain: The fastest time planning tool for effective developers
Skills
Big Data Technologies
- Apache Spark
- Apache Kafka
- Neo4j
- Tigergraph
- Big Data Architectures
- Kafka Streams
- Apache Airflow
- Apache Iceberg
- Delta Lake
- Databricks
- Snowflake
- Hadoop
- Hdfs
- Mapreduce
- Apache Hive
- Apache Hbase
- Sql
- Mysql
- Oracle
- Postgresql
- Informatica
- Yarn
- Elasticsearch
- Apache Cassandra
Software Development
- Scala
- Python
- Java
- C/c++
- Clean Code
- Software Engineering
- Software Architectures
- Test-driven Development
Artificial Intelligence
- Pytorch
- Llms (Transformers)
- Tensorflow
- Scikit-learn
- Nlp
- Machine Learning Algorithms
- Statistics
Cloud & Deployment
- Docker
- Kubernetes
- Aws
- Jenkins
- Langchain
Communication
- Requirements Analysis
- Non-technical Consulting
- Training For Developers
Content Creation
- Visualization For Learning
- Video Creation
- Blog Posts
Languages
Education
Kwantlen Polytechnic University Vancouver
Semester abroad · Computer Science · Vancouver, Canada
Technical University Munich
M. Sc. Computer Science · Munich, Germany
University of Applied Sciences Munich
B. Sc. Computer Science · Munich, Germany
Profile
Frequently asked questions
Do you have questions? Here you can find further information.
Where is Philipp based?
What languages does Philipp speak?
How many years of experience does Philipp have?
What roles would Philipp be best suited for?
What is Philipp's latest experience?
What companies has Philipp worked for in recent years?
Which industries is Philipp most experienced in?
Which business areas is Philipp most experienced in?
Which industries has Philipp worked in recently?
Which business areas has Philipp worked in recently?
What is Philipp's education?
What is the availability of Philipp?
What is the rate of Philipp?
How to hire Philipp?
Average rates for similar positions
Rates are based on recent contracts and do not include FRATCH margin.
Similar Freelancers
Discover other experts with similar qualifications and experience
Experts recently working on similar projects
Freelancers with hands-on experience in comparable project as a Instructor
Nearby freelancers
Professionals working in or nearby Munich, Germany