Jan Krol

Data Expert

Avatar placeholder
Berlin, Germany

Experience

Jun 2024 - Present
1 year 8 months

Data Expert

Manufacturing

Mar 2023 - May 2024
1 year 3 months

Data Expert

Intralogistics

  • Provided consulting and implementation of AWS infrastructure to support global process operations in Transport & Logistics
  • Provisioned and operated servers, OS environments, and databases in AWS
  • Identified and presented optimization potentials in commercial and technical terms
  • Administered and maintained provided systems
  • Developed maintenance and monitoring concepts
  • Advised development projects on system use, configuration, and optimization
  • Consulted on architectures and operational concepts using AWS Cloud
  • Trained internal employees on new AWS services and working methods

Services: AWS Glue, Redshift, EMR, SageMaker, Python

Jan 2022 - Feb 2023
1 year 2 months

Data Expert

Logistics

  • Developed and implemented a standardized big data architecture for group-wide platform services in the Transport & Logistics sector on Azure
  • Automated solutions using Infrastructure as Code (Terraform, Ansible)
  • Presented and discussed sub-project architectures on Azure
  • Implemented real-time data streaming with Apache Kafka and monitoring solutions
  • Advised on Azure platform strategy and reference architectures
  • Developed mechanisms for proactive elimination of vulnerabilities in Azure and Kubernetes clusters
  • Conceptualized container orchestration platforms with Kubernetes CI/CD
  • Created user and authorization concepts according to group specifications
  • Managed operational services within an agile team

Services: Azure Purview, Azure Synapse Analytics, Azure Data Factory, Azure Databricks, Terraform, GitLab Runner, Azure DevOps

Sep 2021 - Jan 2022
5 months

Data Expert

E-Commerce

  • Strategically developed and migrated analytics data pipelines into a Data Lakehouse architecture on AWS
  • Enhanced the Big Data Lake environment and ensured stringent data quality and GDPR compliance
  • Performed exploratory analysis and algorithm development through data provisioning and preparation (AWS Glue, Spark, Lambda)
  • Developed ETL jobs and data pipelines to provide ready-to-consume data sources (AWS Glue, Redshift, Spark, PySpark)
  • Conducted regression testing and quality checks in data pipelines and the data lake
  • Implemented high-performance streaming data processing with Kinesis, Kafka, and Lambda
  • Orchestrated and connected multiple data sources
  • Automated deployments using DevOps best practices (CodeBuild, CodePipeline, GitHub Actions)
  • Built infrastructure with IaC (AWS CDK)
  • Monitored data quality, compliance, and costs

Services: AWS Glue, Kinesis, Kafka, Apache Spark, Data Catalog, S3, Athena, Redshift, Lambda, ECS, Step Functions

Apr 2020 - Sep 2021
1 year 6 months

Data Expert

E-Commerce

  • Guided internal e-commerce product teams in developing, implementing, and maintaining high-performance data processing and integration systems
  • Migrated existing data services, pipelines, and assets to a new event-based serverless architecture
  • Developed and executed Lambda functions and PySpark jobs
  • Designed architecture and integration with Kafka for real-time processing and analysis of event data
  • Implemented PySpark transformations, filtering, and aggregations
  • Ensured efficient and reliable connection with Kafka, configured security settings, and integrated with other components
  • Established extensive testing and monitoring mechanisms
  • Delivered a high-performance, scalable event system enabling data-driven decision-making

Services: AWS Glue, Apache Spark, Data Catalog, S3, Athena, Redshift, Lambda, ECS, Step Functions

Feb 2019 - Apr 2020
1 year 3 months

Data Expert

Transport & Logistics

  • Integrated logistics data streams with Event Hub and Kafka using PySpark Structured Streaming
  • Designed and implemented a pipeline for capturing, processing, and forwarding data streams
  • Utilized PySpark Structured Streaming for efficient real-time data processing
  • Configured and initialized PySpark streaming jobs and defined necessary data structures
  • Conducted comprehensive testing and monitoring to ensure smooth data transmission and high data quality
  • Enabled robust and efficient integration of logistics data streams with Event Hubs
  • Delivered real-time utilization of logistics data for analysis and further processing

Services: Azure Synapse Analytics, Purview Data Catalog, Apache Spark, Event Hub, Structured Streaming, GraphFrame, Azure Storage v2, Power BI

Sep 2018 - Feb 2019
6 months

Data Expert

Transport & Logistics

  • Spearheaded development of a robust data strategy and governance framework to streamline and enhance data handling capabilities
  • Constructed a sophisticated data management platform on Databricks
  • Designed and implemented an efficient data hub ingestion platform
  • Led the design and establishment of an organization-wide data strategy aligned with business goals
  • Developed a comprehensive data governance framework ensuring data accuracy, privacy, and compliance
  • Oversaw deployment and customization of the data management platform on Databricks
  • Enhanced data processing, analysis, and reporting capabilities with Power BI
  • Engineered a robust data hub with advanced ingestion pipelines based on AWS EventBridge
  • Optimized data flow from diverse sources to centralized storage systems (Data Lake House on Azure)
  • Collaborated with cross-functional teams to integrate the data management platform with existing IT infrastructure
  • Conducted training sessions and workshops to foster a data-driven culture and enhance data literacy

Services: Azure Databricks, Databricks Data Catalog, AWS EventBridge, Kinesis, Event Hub, Structured Streaming, Apache Spark

Data Expert

Transport & Logistics

  • Served as the technical lead managing a team of 3 offshore developers while implementing scalable and robust data solutions in Azure Databricks
  • Introduced Databricks Live Tables for schema and table management
  • Implemented Databricks Asset Bundle following an Infrastructure as Code mindset
  • Designed and refined the medallion data architecture to optimize data processing workflows
  • Collaborated closely with multiple business units to ensure data solutions met their specific requirements
  • Established coding standards and best practices for the development team
  • Conducted code reviews and provided technical guidance
  • Facilitated knowledge transfer and technical upskilling sessions
  • Developed scalable ETL pipelines in Azure Databricks
  • Created optimized data storage solutions with future scalability in mind
  • Established a complete IaC workflow for data platform components
  • Integrated version control and CI/CD for Databricks Asset Bundles
  • Automated deployment of table schemas, jobs, and notebooks
  • Implemented environment promotion strategies (Dev/Test/Prod)
  • Managed configuration for cross-environment consistency

Services: Azure Databricks, Databricks Live Tables, Databricks Asset Bundle, Azure Data Factory, Delta Lake, Spark SQL, Azure Key Vault, Azure Storage, Power BI

Summary

Big Data Specialist Focus: Big Data, Cloud Architecture, Data Management Platforms

Skills

  • Big Data Platform Specialist With Focus On Amazon Web Services & Microsoft Azure

  • Etl Processes/pipelines & Data Engineering

  • Architecture Of Data Management Plaftorm In Enterprises

  • Build Up Of Data Lakes & Data Lakehouses

  • Application Migrations Using Cloud Services

  • Consulting & Implementation Of Automation Concepts Especially Devops

  • Integration Of Active Directory, Security Concepts And Compliance Requirements

  • Monitoring And Logging

  • Confident In Python, Sql, Typescript, Golang

  • Big Data Cloud Architecture (Aws & Microsoft Azure)

  • Data Engineering (Databricks, Synapse Analytics, Fabric, Apache Spark, Aws Glue, Athena, Redshift & Emr)

  • Infrastructure As Code (Terraform, Pulumi, Aws Cdk, Arm)

Languages

German
Native
English
Advanced
Polish
Advanced

Certifications & licenses

AWS Business Professional

AWS Certified Cloud Practitioner

AWS Certified Machine Learning – Specialty

AWS Certified Solutions Architect – Associate

AWS Technical Professional

Azure Solutions Architect Expert: AZ-300: Microsoft Azure Architect Technologies AZ-301: Microsoft Azure Architect Design

Databricks Certified Associate Developer For Apache Spark 3.0

HashiCorp Certified: Terraform Associate

Need a freelancer? Find your match in seconds.
Try FRATCH GPT
More actions

Similar Freelancers

Discover other experts with similar qualifications and experience

Serge Kalinin
Serge Kalinin

MLOps (machine learning operations)

View Profile
Umar Maqsud
Umar Maqsud

Senior AI Architect & Engineer

View Profile
Kai Held
Kai Held

Backend Python Engineer

View Profile
Benito Exner
Benito Exner

Cloud DevOps Engineer

View Profile
Qaiser Abbasi
Qaiser Abbasi

Freelance Lead DevOps Engineer

View Profile
Max Ritter
Max Ritter

Cloud (AWS) | AI | DevOps | Data

View Profile
Michal Budzyn
Michal Budzyn

Senior Golang Engineer

View Profile
Robert Raźniewski
Robert Raźniewski

Software Developer

View Profile
Niko Schmuck
Niko Schmuck

Developing Architect, Technical Lead "gridlytics"

View Profile
Martin Musiol
Martin Musiol

Product Owner AI Learning Platform

View Profile
Yannick Schuchmann
Yannick Schuchmann

Freelance IT Consultant/Advisor

View Profile
Marcel Meyer
Marcel Meyer

Cloud-Architect, Senior Solution Architect, Senior Software-Engineer

View Profile
Fady Kuzman
Fady Kuzman

Senior Software Developer / Tech Lead

View Profile
Stephan Baier
Stephan Baier

Freelance Data Scientist

View Profile
Michael Fecher
Michael Fecher

Freelancer, Solution Architect

View Profile
Thomas Hoefkens
Thomas Hoefkens

Senior MLOps, DevOps Engineer

View Profile
Stephan Sahm
Stephan Sahm

Senior Data/ML Consultant & Technical Lead

View Profile
Matthias Isler
Matthias Isler

Fractional CTO (Principal Engineer / Technical Architect)

View Profile
Manuel Pasieka
Manuel Pasieka

AI Engineer

View Profile
Stephan Rudolph
Stephan Rudolph

ICT Architect/Programmer, DevOps, Design, Implementation, Test, Documentation

View Profile
Pappu Prasad
Pappu Prasad

Senior Cloud Consultant (AWS Services and Consulting)

View Profile
Domenik Jones
Domenik Jones

Python Engineer and Cloud Migration Consultant

View Profile
Kiriakos Krastillis
Kiriakos Krastillis

Tech Lead: API Experience Platform

View Profile
Alexander Zhirov
Alexander Zhirov

Senior Data Architect & Data Engineer

View Profile
Prasad Tilloo
Prasad Tilloo

Solution Architect / Senior Manager – DTC E-Commerce Platform

View Profile
Shamaila Mahmood
Shamaila Mahmood

Senior Software Architect

View Profile
Christian Schulz
Christian Schulz

Data-Scientist/AI Engineer

View Profile
Arne Hendricks
Arne Hendricks

Embedded Fullstack Developer

View Profile
Enis Spahi
Enis Spahi

Software Developer

View Profile
Pierre Bernard
Pierre Bernard

Director Engineering & Technology

View Profile