Jorge Machado

Data Architect

Avatar placeholder
Würzburg, Germany

Experience

Mar 2025 - Present
11 months

Data Architect

Deutsche Bahn

  • Design and provide best practices on data modeling for dbt, including changing dimensions, late arriving data handling, and testing
  • Design the ingestion flow from other systems into S3 and Redshift
  • Design and implement new partitions for Dagster and incremental loading with dbt
  • Map business requirements to technical architectures
  • Instruct junior team members
Sep 2024 - Mar 2025
7 months

Data Architect Expert

SAP AG

  • Led the architectural design and implementation of Kafka Tiered Storage rollout across 30+ Kubernetes clusters in multi-cloud environments (Azure, AWS, GCP)
  • Defined and implemented infrastructure provisioning using Crossplane for declarative and consistent deployment across cloud providers
  • Developed a custom Golang-based Kafka Operator to standardize tiered storage activation for data pipelines
  • Designed and automated GitOps-based deployment strategies using Flux and Helm for safe and repeatable rollouts
  • Optimized Gardener shoot configurations to align cluster resources with Kafka workload and cost efficiency requirements
May 2024 - Nov 2024
7 months

Data Architect Expert

s.Oliver GmbH

  • Designed a medallion architecture on Databricks for scalable, modular data ingestion, transformation, and consumption
  • Implemented incremental ETL pipelines using PySpark to efficiently extract and process SAP data
  • Architected and implemented dbt-based semantic layers with dimensional modeling for fact and dimension tables
  • Established Dev-to-Prod CI/CD pipelines to standardize deployment and enforce governance
  • Defined role-based access control and security concepts aligned with enterprise Azure standards
  • Enabled real-time data integration by connecting Kafka streams to Databricks for enriched analytics
  • Introduced AI/ML use cases, including FP-Growth for basket analysis and time series forecasting models
  • Mentored junior developers on Databricks best practices to ensure long-term platform adoption
Jan 2023 - Aug 2023
8 months

Data Architect Expert

ias Gruppe

  • Architected an end-to-end Azure Data Lakehouse solution leveraging Azure Synapse, Delta Lake, and Azure Data Lake Storage Gen2 for scalable storage and query performance
  • Designed and implemented streaming ingestion pipelines using Azure IoT Hub, Event Hub, and Service Bus for real-time telemetry data capture from thousands of IoT devices
  • Developed data integration and transformation flows using Airbyte for ELT and dbt for business logic modeling, dimensional design, and lineage tracking
  • Orchestrated complex data workflows using Azure Data Factory, integrating batch and streaming processes
  • Implemented Delta Lake-based time travel and ACID transactions for data reliability and traceability
  • Designed RBAC, resource tagging strategies, and monitoring with Azure Monitor and Log Analytics for operational transparency and security
  • Enabled Power BI integration for near real-time business dashboards and collaborated with product and operations teams on requirements translation
Sep 2022 - May 2024
1 year 9 months
Frankfurt, Germany

Data Architect Expert

Deutsche Bahn

  • Designed and implemented real-time streaming architectures using AWS Kinesis, Lambda, and Apache Spark for time-sensitive analytics use cases
  • Architected delta ingestion pipelines on AWS Glue and Apache Hudi for efficient small-file compaction and time travel analytics
  • Delivered business-critical KPIs and dashboards with end-to-end data lineage and auditability across S3, PostgreSQL, and CloudWatch
  • Defined and enforced infrastructure-as-code principles using AWS CDK for scalable, replicable environments
  • Introduced and rolled out dbt for semantic modeling and reusable business logic integrated into GitLab CI/CD workflows
  • Conducted architectural evaluations of Databricks, Snowflake, and AWS Athena to support future platform strategy decisions
  • Mentored a team of developers, optimizing development cycles and ensuring cloud data engineering best practices
  • Implemented IoT 4.0 pipelines for ingesting telemetry data and supporting predictive analytics initiatives
Sep 2021 - Sep 2022
1 year 1 month
Rottendorf, Germany

Kafka Expert

S.Oliver GmbH

  • Developed Spring Boot Kafka Streams applications
  • Created custom Kafka source connectors for SAP systems and custom sink connectors to write back to SAP
  • Deployed Kafka Connect connectors with monitoring on Azure Kubernetes Service
  • Developed data pipelines using Airflow and Azure Cloud
  • Architected data pipelines between on-premise and Azure Cloud
  • Wrote Spark jobs to clean and aggregate data
Feb 2021 - Aug 2022
1 year 7 months
Germany

Software Developer

RTL Deutschland

  • Designed and implemented a Lakehouse architecture combining Azure Databricks, Delta Lake, and Azure Synapse for batch and real-time workloads with ACID compliance
  • Built RESTful data APIs using FastAPI and deployed them via Azure App Services as a controlled access layer
  • Developed incremental ETL pipelines using PySpark and dbt, implementing star schema models for semantic consistency and historical tracking
  • Enabled interactive reporting and visual analytics using Power BI integrated into Azure
  • Implemented strong data access controls, audit logging, and resource monitoring for GDPR compliance and governance
  • Established automated CI/CD pipelines for data infrastructure using Azure-native tooling
Sep 2020 - Jun 2021
10 months
Munich, Germany

Cloud Solution Architect

Allianz Technology

  • Migrated data lakes into Azure Cloud with high automation using ArgoCD, Jenkins, Helm charts, and Terraform
  • Developed Spark jobs for data lake migration
  • Created Helm charts for Azure AKS automation
  • Refactored application designs to be cloud-native and onboarded internal customers to Azure
  • Implemented Spring Boot Kafka Streams applications and Argo workflow pipelines
Mar 2020 - May 2020
3 months
Munich, Germany

Big Data Architect, Data Architect

BMW AG

  • Developed data pipelines using Spark and Airflow for self-driving car data
  • Generated metrics for geospatial applications
  • Ingested data into Elasticsearch using Apache Spark
  • Applied functional programming principles with Scala
Jan 2020 - May 2020
5 months
Stuttgart, Germany

Big Data Developer

DXC

  • Automated Azure Kubernetes cluster deployments
  • Created and deployed deep learning Spark jobs with PyTorch and GPUs on Kubernetes
  • Performed GPU inferencing on terabytes of data
Sep 2017 - Jun 2018
10 months
Nuremberg, Germany

Big Data Developer, Spark / Kafka Developer, Data Architect

GfK

  • Wrote Kafka Connectors to ingest data into Accumulo in a Kerberized environment
  • Kerberized applications for Hadoop, Kafka, and Kafka Connect
  • Created statistic plans for RDF4J queries over Accumulo
  • Developed Apache NiFi workflows
  • Introduced Git flow, CI/CD, and Docker automation
  • Set up Kafka Connect with Kerberos on Google Kubernetes
  • Wrote Java applications based on RDF and web semantics
Apr 2017 - Sep 2017
6 months
Frankfurt, Germany

Big Data Architect

Deutsche Bahn

  • Sized and configured Hadoop clusters with Kerberos and Active Directory
  • Migrated data using Sqoop and managed workflows with Oozie
  • Implemented data pipelines using Kylo, Apache NiFi, and Talend
  • Deployed Hortonworks Cloud Break on AWS and Apache Storm streaming applications
  • Supported internal clients with streaming and data cleaning operations
Oct 2016 - Mar 2017
6 months
Dresden, Germany

Big Data Developer and Architect

Kiwigrid

  • Created Spark jobs for historical data reporting
  • Developed custom Spark data sources for HBase and aggregation for data exploration
  • Architected an alerting and computing framework based on Spark Streaming
  • Deployed applications using Docker

Skills

General Skills:

  • Apache Spark
  • Java Mapreduce
  • Scala
  • Java
  • Python
  • Perl
  • Tornado
  • Rest Apis
  • Jira
  • Etl
  • Docker
  • Maven
  • Gradle
  • Kubernetes
  • Jenkins
  • Cloud Build
  • Azure Cosmos Db
  • S3
  • Neo4j
  • Azure Kubernetes Service
  • Aks
  • Flask
  • Spring Boot
  • Data Vault 2.0
  • Pytorch
  • Tensorflow
  • Azure Iot
  • Modbus
  • Mqtt
  • Opc
  • Plc
  • Azure Data Factory
  • Azure Synapse
  • Llm

Operating System Skills:

  • Aix
  • Ubuntu
  • Cento Os
  • Mac Osx
  • Windows Server 2008 R2
  • Flexframe
  • Routing
  • Git
  • Ibm Hadr
  • Ibm Tsm
  • Aws S3
  • Apache Mesos

Sap Skills:

  • Rfc
  • Snc
  • Charm
  • Kernel Upgrades
  • Ehp Upgrade
  • Ssfs
  • Sso
  • Hana

Databases:

  • Oracle 11
  • Db2
  • Sap Max Db
  • Mysql
  • Aws Redshift
  • Postgres

Cloud Technologies:

  • Aws Emr
  • Aws Glue
  • Aws Ecs
  • Aws S3
  • Google App Engine
  • Azure Kubernetes
  • Azure Containers

Languages

German
Advanced
English
Advanced

Certifications & licenses

Databricks Lakehouse Platform Accreditation

Confluent Certified Developer For Apache Kafka

Generative AI With Large Language Models (LLM)

CKAD: Certified Kubernetes Application Developer

Microsoft Certified: Azure Fundamentals

Data Engineering Nanodegree

Functional Programming Principles In Scala On Coursera

Big Data Analytics Fraunhofer IAIS

Big Data Analytics By University Of California, San Diego On Coursera

Databricks Developer Training For Apache Spark

Hadoop Platform And Application Framework By University Of California On Coursera

Machine Learning With Big Data By University Of California, San Diego On Coursera

SAP Os And Db Migration (Tadm70)

SAP Database Administration I (Oracle) (Adm 505)

SAP Database Administration II (Oracle) (Adm 506)

SAP Netweaver As Implementation Und Operation I (SAP Tadm10)

SAP Netweaver Portal - Implementation And Operation (Tep10)

ITL Foundation V4

Need a freelancer? Find your match in seconds.
Try FRATCH GPT
More actions

Similar Freelancers

Discover other experts with similar qualifications and experience

Ashkan Zadeh
Ashkan Zadeh

Microsoft Azure Senior Data Engineer / Senior Data Scientist

View Profile
Serge Kalinin
Serge Kalinin

MLOps (machine learning operations)

View Profile
Rodrigo Herrán
Rodrigo Herrán

Data Engineer and Founder

View Profile
Stephan Sahm
Stephan Sahm

Senior Data/ML Consultant & Technical Lead

View Profile
Alexander Zhirov
Alexander Zhirov

Senior Data Architect & Data Engineer

View Profile
Max Ritter
Max Ritter

Cloud (AWS) | AI | DevOps | Data

View Profile
Felix Bruckner
Felix Bruckner

Data Consultant & Technical Lead DataVerse

View Profile
Anton Klonov
Anton Klonov

Head of Technical Overall Integration NSC / Hadoop Cloud Development

View Profile
Eduard Van kleef
Eduard Van kleef

Workshop Leader 'Introduction to AI Development Tools'

View Profile
Ildar Allayarov
Ildar Allayarov

Senior Data Engineer

View Profile
Markus Groh
Markus Groh

Data Solution Architect, Founder

View Profile
Christian Schulz
Christian Schulz

Data-Scientist/AI Engineer

View Profile
Stefan Corsten
Stefan Corsten

SSIS Development

View Profile
Martin Mauch
Martin Mauch

Freelance Data Architect

View Profile
Stephan Martin
Stephan Martin

Sabbatical, professional development

View Profile
Vladislav Lasmann
Vladislav Lasmann

Freelance Data Engineer / Architect

View Profile
Louis Guitton
Louis Guitton

Freelance Solutions Architect and Machine Learning Engineer

View Profile
Jürgen Fey
Jürgen Fey

AR/VR/XR Architect

View Profile
Guino Ndjenndja
Guino Ndjenndja

Senior Data Engineer

View Profile
Thomas Hoefkens
Thomas Hoefkens

Senior MLOps, DevOps Engineer

View Profile
Torsten Glunde
Torsten Glunde

BI consultant

View Profile
Philipp Brunenberg
Philipp Brunenberg

Instructor

View Profile
Pappu Prasad
Pappu Prasad

Senior Cloud Consultant (AWS Services and Consulting)

View Profile
Philipp Grunert
Philipp Grunert

Data Scientist & Data Engineer

View Profile
Martin Musiol
Martin Musiol

Product Owner AI Learning Platform

View Profile
Petru Kisalita
Petru Kisalita

Architect & Technical Team Lead & Senior Developer

View Profile
Michael Fecher
Michael Fecher

Freelancer, Solution Architect

View Profile
Himanshu Negi
Himanshu Negi

Principal (Data Scientist/Data Engineer/Gen AI Engineer)

View Profile
Karl Estermann
Karl Estermann

incl. CI/CD, automation

View Profile
Ivaylo Sieme
Ivaylo Sieme

Cloud Architect & AI Engineer

View Profile