Advising on the use of analytics and BI tools and services in the Microsoft Azure stack (e.g. MS Fabric, Synapse Workspaces and dedicated SQL pools, SQL Database, PostgreSQL, Snowflake, Databricks, Data Factory, SSIS, Analysis Services, Function Apps, Power BI, ML ...)
Independent design of analytics solutions with Python, SQL, etc.
Designing and implementing ETLs and data pipelines
Creating and maintaining APIs
Applying CI/CD, testing and version control independently
Data modeling
Model development and model optimization
Anomaly detection with AI
Predictive analytics
Technologies used: Snowflake, Fabric, Azure Synapse Analytics, Azure Data Factory, Azure Data Lake, Azure DevOps, Databricks, Spark, CI/CD, SQL Database, Python
Jan 2021 - Dec 2022
1 year
Germany
Senior Data Engineer and Data Governance Manager
Statistisches Bundesamt
Project conception
Designing a big data architecture for processing very large data volumes
Designing and implementing ETLs and data pipelines
Data governance in the CDP
Data classification and cataloging (e.g. PII detection)
Metadata management and data inventory
Access controls and permissions management (access management)
Supporting the department in planning and carrying out new projects
Onboarding and training department staff on the Cloudera Data Platform