Philipp B.

Instructor

Avatar placeholder
Munich, Germany

Experience

Sep 2023 - Present
2 years 4 months

Instructor

Spark Rockstars Academy

  • Help developers with individual live coaching to become pro-level Apache Spark engineers
  • Organize and host multi-day, tailored Apache Spark workshops for development teams
  • Create educational technical content on a self-hosted blog, YouTube, and social media
Sep 2023 - Dec 2023
4 months

Implement LLM-Based Tax-Automation Tool

Self-owned

  • Implement a command line-based tax automation and PDF processing tool
  • Fine-tune Llama2 LLM with custom generated training data for information retrieval from PDF documents
  • Utilize a student–teacher approach to generate training data with ChatGPT results
  • Technologies: Python, PyTorch, Pandas, Google Colab, Llama, ChatGPT, langchain, Git, Docker, Kubernetes, AWS, Jenkins
Sep 2022 - Sep 2023
1 year 1 month
Germany

Re-Design of a Large-Scale Streaming Application

Deutsche Börse

  • Conceptualize functional and non-functional requirements, consult business and technical specification teams, and conduct technical reviews
  • Develop technical design for real-time streaming application processing multi-billion messages per day
  • Implement Spark Structured Streaming applications with custom outer-join operations, HBase access, and complex data structures
  • Conceptualize and implement data migration routines from classical data warehouse to event stream–based representation
  • Technologies: Spark (SQL, Structured Streaming), Kafka (Avro), HBase, Oracle DB, Informatica, HDFS, Zeppelin, YARN, Apache Hive
Dec 2021 - Jul 2022
8 months
Germany

Advanced Analytics in a Large-Scale Knowledge Graph

DB Systel

  • Implement and operate a SaaS tool for personal time management with full-stack development including a Neo4j backend
  • Address client stability and performance challenges with a large-scale single-node Neo4j database
  • Implement a real-time recommendation engine in Neo4j based on Naive Bayesian classification
  • Develop a concurrent ETL process to load a timely-sorted two-dimensional grid of linked nodes in Neo4j
  • Perform frontend development of landing and app pages (HTML, CSS, JavaScript, Selenium)
  • Setup monitoring system for Neo4j and automate nightly backups
  • Technologies: Neo4j, Python, AWS, Grafana, Graphite, PostgreSQL, HDFS, Kafka
Jan 2021 - Present
5 years

Co-Founder

DayCaptain

  • Co-founded the fastest personal time planning tool for effective developers
Sep 2019 - Apr 2021
1 year 8 months
Germany

Advanced Graph Analytics on a Multi-Billion Node Knowledge Graph

Ippen Digital GmbH

  • Design, implement, and operate a large-scale, stream-fed multi-billion node knowledge graph using TigerGraph
  • Create a forms graph-based real-time recommendation engine for news articles
  • Perform requirement analysis, schema modeling, and validation of multiple data streams in a knowledge graph
  • Technologies: TigerGraph, Kafka, AWS, Docker, Kubernetes, Java, Scala, Go, Grafana, Prometheus, Terraform, Helm
Mar 2019 - Jul 2019
5 months
Germany

Implementation of Real-Time Analytics Applications

Telefonica Germany GmbH

  • Design and implement real-time streaming applications
  • Deliver internal workshops on Apache Kafka
  • Technologies: Spark, Kafka, Kafka Streams, Scala, Docker, Kubernetes, AWS, Akka Streams
Sep 2018 - Jan 2019
5 months
Germany

Design & Implementation of an Advanced Analytics Streaming Application

Allianz SE

  • Design and implement a proof-of-concept for an event-driven data analytics application
  • Develop an ingestion pipeline to transform and feed relational data into a graph database
  • Technologies: Spark, Kafka, Java, Scala, Docker, Neo4j, Git
Jan 2018 - Jul 2018
7 months
Germany

Feature Implementation & Performance Optimizations of German Document Mining Application

Ayfie GmbH

  • Optimize a large-scale text analytics pipeline for scalability and performance
  • Implement knowledge discovery use cases utilizing state-of-the-art NLP and ML approaches
  • Research and apply efficient algorithms for analyzing large data sets
  • Educate in-house development team on big data software development and data mining
  • Technologies: Spark, Java, Scala, Docker, AWS, CI, Spring, Elasticsearch, SQL, Sonar, Git, Grafana, Graphite
Jun 2017 - Sep 2017
4 months
Germany

Machine Learning for Knowledge Discovery in Food Recipes

EatSmarter GmbH

  • Design proof-of-concept for a calculation pipeline based on multiple heterogeneous data sources and machine learning approaches
  • Implement data processing pipeline to calculate nutritional values for food recipes
  • Deploy pipeline on client AWS infrastructure
  • Technologies: Spark, Scala, AWS, Docker, Python, Git, MySQL, CouchDB
Nov 2016 - May 2017
7 months
Munich, Germany

Team Lead of Software Development

TUM Student Group “Roboy”

  • Lead interdisciplinary team to develop a child-sized humanoid robot
  • Responsible for design, implementation, and software engineering processes
  • Technologies: C++, ROS, CMake, Unix
Jun 2016 - Sep 2016
4 months
Germany

Machine Learning for Automatic Classification of Food Recipes

EatSmarter GmbH

  • Provide an indicator for the healthiness of food recipes based on expert ratings
  • Consult on management and design thinking to derive innovative use cases from client data
  • Supervise generation of training data set and analyze data quality
  • Implement supervised classification system based on selected features and validate final solution
  • Technologies: Spark, Scala, AWS, SQL, Git, Docker, Python
Apr 2016 - Present
9 years 9 months
Germany

Freelance Data Engineering & Machine Learning Consultant

Self-employed

  • Craft high-quality, scalable data-driven applications tailored to client requirements
  • Create production-ready software solutions leveraging cutting-edge machine learning techniques and robust big data architectures
  • Collaborate with in-house software development teams and non-technical stakeholders to solve practical problems
  • Deliver customized workshops to empower teams to develop scalable big data applications independently
Oct 2014 - Mar 2015
6 months
Fürstenfeldbruck, Germany

Research & Development Engineer: Unmanned Aerial Vehicle

Elektroniksystem- und Logistik-GmbH (ESG)

  • Design and prototype a dynamic mission planner for integration into UAV mission planning software in C++
  • Research route planning approaches for unmanned helicopters to explore predefined areas for hazardous gas
  • Implement routing algorithm for integration into mission planning software
  • Technologies: C++, Qt, CMake, Unix
May 2013 - Sep 2014
1 year 5 months
Fürstenfeldbruck, Germany

Software Engineer & Developer: Aerosystems Avionics

Elektroniksystem- und Logistik-GmbH (ESG)

  • Develop time- and safety-critical distributed middleware in C
  • Deploy middleware on a modular avionics platform with interchangeable Unix-like modules
  • Technologies: C, CMake, Unix

Summary

I am dedicated to help my clients build scalable, high-quality big data & machine learning solutions.

  • Senior Freelance Data Engineering & AI Consultant with eight years of experience
  • Expert-level knowledge and experience in big data technologies Apache Spark & Apache Kafka
  • Content creator on self-hosted blog, YouTube and social media
  • Founder of the Spark Rockstars Academy: Teaching Data Engineers to Pro-Level
  • Co-Founder of DayCaptain: The fastest time planning tool for effective developers

Skills

Big Data Technologies

  • Apache Spark
  • Apache Kafka
  • Neo4j
  • Tigergraph
  • Big Data Architectures
  • Kafka Streams
  • Apache Airflow
  • Apache Iceberg
  • Delta Lake
  • Databricks
  • Snowflake
  • Hadoop
  • Hdfs
  • Mapreduce
  • Apache Hive
  • Apache Hbase
  • Sql
  • Mysql
  • Oracle
  • Postgresql
  • Informatica
  • Yarn
  • Elasticsearch
  • Apache Cassandra

Software Development

  • Scala
  • Python
  • Java
  • C/c++
  • Clean Code
  • Software Engineering
  • Software Architectures
  • Test-driven Development

Artificial Intelligence

  • Pytorch
  • Llms (Transformers)
  • Tensorflow
  • Scikit-learn
  • Nlp
  • Machine Learning Algorithms
  • Statistics

Cloud & Deployment

  • Docker
  • Kubernetes
  • Aws
  • Jenkins
  • Langchain

Communication

  • Requirements Analysis
  • Non-technical Consulting
  • Training For Developers

Content Creation

  • Visualization For Learning
  • Video Creation
  • Blog Posts

Languages

German
Native
English
Advanced
Spanish
Advanced

Education

Jan 2016 - Apr 2016

Kwantlen Polytechnic University Vancouver

Semester abroad · Computer Science · Vancouver, Canada

Oct 2015 - May 2017

Technical University Munich

M. Sc. Computer Science · Munich, Germany

Oct 2011 - Mar 2015

University of Applied Sciences Munich

B. Sc. Computer Science · Munich, Germany

Need a freelancer? Find your match in seconds.
Try FRATCH GPT
More actions

Similar Freelancers

Discover other experts with similar qualifications and experience

Ursula M.
Ursula M.

Business Mentor

View Profile
Marleen K.
Marleen K.

Atlassian Specialist E2E Processes

View Profile
Felix M.
Felix M.

AI Trainer

View Profile
Hernani C.
Hernani C.

AI CTO & Co-Founder

View Profile
Eduard V.
Eduard V.

Workshop Leader 'Introduction to AI Development Tools'

View Profile
Jonas K.
Jonas K.

Project Lead

View Profile
Christian G.
Christian G.

Product Owner – Redesign of an iOS and Android SaaS Application Platform

View Profile
Alexander L.
Alexander L.

Guest lecturer in Artificial Intelligence (Master’s Level)

View Profile
Stephan S.
Stephan S.

Project and Product Manager, Consultant and Trainer

View Profile
Christian M.
Christian M.

Senior Feature Engineer - Banking

View Profile
Andreas E.
Andreas E.

AI Coach and Consultant

View Profile
Louis G.
Louis G.

Freelance Solutions Architect and Machine Learning Engineer

View Profile
Stefan M.
Stefan M.

Project Manager

View Profile
Marc S.
Marc S.

Consultant / Interim / Freelance

View Profile
Annette E.
Annette E.

Product Director

View Profile
Sascha J.
Sascha J.

Agile Coach & Partner

View Profile
Stephan S.
Stephan S.

Senior Data/ML Consultant & Technical Lead

View Profile
Fabian D.
Fabian D.

Volunteer Examiner for IT Apprenticeships

View Profile
Kai K.
Kai K.

Chatbots for Tax and Legal Texts

View Profile
Thomas B.
Thomas B.

Agile Coach & Senior Scrum Master

View Profile
Albert F.
Albert F.

Lead Product Owner

View Profile
René P.
René P.

Founder and CEO

View Profile
Marco T.
Marco T.

Scrum Master

View Profile
Christian W.
Christian W.

Project Manager for e-Invoice Implementation

View Profile
Andreas D.
Andreas D.

Interim Chief Technology & Product Officer

View Profile
Martina P.
Martina P.

CEO / Producer

View Profile
Roland J.
Roland J.

Business Analyst Generative AI solutions

View Profile
Katrin M.
Katrin M.

CEO

View Profile
Jens H.
Jens H.

#52 Test Manager in the Field of Safety-Critical Infrastructure

View Profile
Martin M.
Martin M.

Product Owner AI Learning Platform

View Profile