Philipp B.
Instructor
Experience
Sep 2023 - Present
2 years 4 monthsInstructor
Spark Rockstars Academy
- Help developers with individual live coaching to become pro-level Apache Spark engineers
- Organize and host multi-day, tailored Apache Spark workshops for development teams
- Create educational technical content on a self-hosted blog, YouTube, and social media
Sep 2023 - Dec 2023
4 monthsImplement LLM-Based Tax-Automation Tool
Self-owned
- Implement a command line-based tax automation and PDF processing tool
- Fine-tune Llama2 LLM with custom generated training data for information retrieval from PDF documents
- Utilize a student–teacher approach to generate training data with ChatGPT results
- Technologies: Python, PyTorch, Pandas, Google Colab, Llama, ChatGPT, langchain, Git, Docker, Kubernetes, AWS, Jenkins
Sep 2022 - Sep 2023
1 year 1 monthGermany
Re-Design of a Large-Scale Streaming Application
Deutsche Börse
- Conceptualize functional and non-functional requirements, consult business and technical specification teams, and conduct technical reviews
- Develop technical design for real-time streaming application processing multi-billion messages per day
- Implement Spark Structured Streaming applications with custom outer-join operations, HBase access, and complex data structures
- Conceptualize and implement data migration routines from classical data warehouse to event stream–based representation
- Technologies: Spark (SQL, Structured Streaming), Kafka (Avro), HBase, Oracle DB, Informatica, HDFS, Zeppelin, YARN, Apache Hive
Dec 2021 - Jul 2022
8 monthsGermany
Advanced Analytics in a Large-Scale Knowledge Graph
DB Systel
- Implement and operate a SaaS tool for personal time management with full-stack development including a Neo4j backend
- Address client stability and performance challenges with a large-scale single-node Neo4j database
- Implement a real-time recommendation engine in Neo4j based on Naive Bayesian classification
- Develop a concurrent ETL process to load a timely-sorted two-dimensional grid of linked nodes in Neo4j
- Perform frontend development of landing and app pages (HTML, CSS, JavaScript, Selenium)
- Setup monitoring system for Neo4j and automate nightly backups
- Technologies: Neo4j, Python, AWS, Grafana, Graphite, PostgreSQL, HDFS, Kafka
Jan 2021 - Present
5 yearsCo-Founder
DayCaptain
- Co-founded the fastest personal time planning tool for effective developers
Sep 2019 - Apr 2021
1 year 8 monthsGermany
Advanced Graph Analytics on a Multi-Billion Node Knowledge Graph
Ippen Digital GmbH
- Design, implement, and operate a large-scale, stream-fed multi-billion node knowledge graph using TigerGraph
- Create a forms graph-based real-time recommendation engine for news articles
- Perform requirement analysis, schema modeling, and validation of multiple data streams in a knowledge graph
- Technologies: TigerGraph, Kafka, AWS, Docker, Kubernetes, Java, Scala, Go, Grafana, Prometheus, Terraform, Helm
Mar 2019 - Jul 2019
5 monthsGermany
Implementation of Real-Time Analytics Applications
Telefonica Germany GmbH
- Design and implement real-time streaming applications
- Deliver internal workshops on Apache Kafka
- Technologies: Spark, Kafka, Kafka Streams, Scala, Docker, Kubernetes, AWS, Akka Streams
Sep 2018 - Jan 2019
5 monthsGermany
Design & Implementation of an Advanced Analytics Streaming Application
Allianz SE
- Design and implement a proof-of-concept for an event-driven data analytics application
- Develop an ingestion pipeline to transform and feed relational data into a graph database
- Technologies: Spark, Kafka, Java, Scala, Docker, Neo4j, Git
Jan 2018 - Jul 2018
7 monthsGermany
Feature Implementation & Performance Optimizations of German Document Mining Application
Ayfie GmbH
- Optimize a large-scale text analytics pipeline for scalability and performance
- Implement knowledge discovery use cases utilizing state-of-the-art NLP and ML approaches
- Research and apply efficient algorithms for analyzing large data sets
- Educate in-house development team on big data software development and data mining
- Technologies: Spark, Java, Scala, Docker, AWS, CI, Spring, Elasticsearch, SQL, Sonar, Git, Grafana, Graphite
Jun 2017 - Sep 2017
4 monthsGermany
Machine Learning for Knowledge Discovery in Food Recipes
EatSmarter GmbH
- Design proof-of-concept for a calculation pipeline based on multiple heterogeneous data sources and machine learning approaches
- Implement data processing pipeline to calculate nutritional values for food recipes
- Deploy pipeline on client AWS infrastructure
- Technologies: Spark, Scala, AWS, Docker, Python, Git, MySQL, CouchDB
Nov 2016 - May 2017
7 monthsMunich, Germany
Team Lead of Software Development
TUM Student Group “Roboy”
- Lead interdisciplinary team to develop a child-sized humanoid robot
- Responsible for design, implementation, and software engineering processes
- Technologies: C++, ROS, CMake, Unix
Jun 2016 - Sep 2016
4 monthsGermany
Machine Learning for Automatic Classification of Food Recipes
EatSmarter GmbH
- Provide an indicator for the healthiness of food recipes based on expert ratings
- Consult on management and design thinking to derive innovative use cases from client data
- Supervise generation of training data set and analyze data quality
- Implement supervised classification system based on selected features and validate final solution
- Technologies: Spark, Scala, AWS, SQL, Git, Docker, Python
Apr 2016 - Present
9 years 9 monthsGermany
Freelance Data Engineering & Machine Learning Consultant
Self-employed
- Craft high-quality, scalable data-driven applications tailored to client requirements
- Create production-ready software solutions leveraging cutting-edge machine learning techniques and robust big data architectures
- Collaborate with in-house software development teams and non-technical stakeholders to solve practical problems
- Deliver customized workshops to empower teams to develop scalable big data applications independently
Oct 2014 - Mar 2015
6 monthsFürstenfeldbruck, Germany
Research & Development Engineer: Unmanned Aerial Vehicle
Elektroniksystem- und Logistik-GmbH (ESG)
- Design and prototype a dynamic mission planner for integration into UAV mission planning software in C++
- Research route planning approaches for unmanned helicopters to explore predefined areas for hazardous gas
- Implement routing algorithm for integration into mission planning software
- Technologies: C++, Qt, CMake, Unix
May 2013 - Sep 2014
1 year 5 monthsFürstenfeldbruck, Germany
Software Engineer & Developer: Aerosystems Avionics
Elektroniksystem- und Logistik-GmbH (ESG)
- Develop time- and safety-critical distributed middleware in C
- Deploy middleware on a modular avionics platform with interchangeable Unix-like modules
- Technologies: C, CMake, Unix
Summary
I am dedicated to help my clients build scalable, high-quality big data & machine learning solutions.
- Senior Freelance Data Engineering & AI Consultant with eight years of experience
- Expert-level knowledge and experience in big data technologies Apache Spark & Apache Kafka
- Content creator on self-hosted blog, YouTube and social media
- Founder of the Spark Rockstars Academy: Teaching Data Engineers to Pro-Level
- Co-Founder of DayCaptain: The fastest time planning tool for effective developers
Skills
Big Data Technologies
- Apache Spark
- Apache Kafka
- Neo4j
- Tigergraph
- Big Data Architectures
- Kafka Streams
- Apache Airflow
- Apache Iceberg
- Delta Lake
- Databricks
- Snowflake
- Hadoop
- Hdfs
- Mapreduce
- Apache Hive
- Apache Hbase
- Sql
- Mysql
- Oracle
- Postgresql
- Informatica
- Yarn
- Elasticsearch
- Apache Cassandra
Software Development
- Scala
- Python
- Java
- C/c++
- Clean Code
- Software Engineering
- Software Architectures
- Test-driven Development
Artificial Intelligence
- Pytorch
- Llms (Transformers)
- Tensorflow
- Scikit-learn
- Nlp
- Machine Learning Algorithms
- Statistics
Cloud & Deployment
- Docker
- Kubernetes
- Aws
- Jenkins
- Langchain
Communication
- Requirements Analysis
- Non-technical Consulting
- Training For Developers
Content Creation
- Visualization For Learning
- Video Creation
- Blog Posts
Languages
German
NativeEnglish
AdvancedSpanish
AdvancedEducation
Jan 2016 - Apr 2016
Kwantlen Polytechnic University Vancouver
Semester abroad · Computer Science · Vancouver, Canada
Oct 2015 - May 2017
Technical University Munich
M. Sc. Computer Science · Munich, Germany
Oct 2011 - Mar 2015
University of Applied Sciences Munich
B. Sc. Computer Science · Munich, Germany
Need a freelancer? Find your match in seconds.
Try FRATCH GPT More actions
Similar Freelancers
Discover other experts with similar qualifications and experience