Recommended expert

Davide Imperati

Consultant – Lead Data Engineer

Davide Imperati
Msida, United Kingdom

Experience

Aug 2022 - Present
3 years 7 months
United Kingdom

Consultant – Lead Data Engineer

Awaze

  • Federated data across three major brands and a few minor ones
  • Defined a target data model and guided the migration from legacy systems to the Snowflake data warehouse
  • Implemented master data management in the SaaS vendor tool (Step by Stibo)
  • Extracted data from legacy systems, transformed it for compatibility with the new data model, and loaded it into the data warehouse preserving lineage
  • Enabled the frontend team to fetch data from the unified system to support cross-selling across brands
  • Centralized data governance by enabling data stewards to perform governance according to business functions
  • Implemented logging, monitoring, and enhanced aggregate data visualization (basic analytics) under dual-run constraints over two years

Technologies: Python, Pandas, SciPy, FastAPI, Flask, Django, Git, GitHub, Jenkins, Jira, ClickMe, CI/CD, TDD, DevOps, Terraform, Docker, Snowflake, PostgreSQL, AS400, Matillion, AWS (S3, ECS, ELB, Route53, ECR), sFTP, dbt, API, Step (Stibo), Linux, Bash scripting.

May 2021 - Aug 2022
1 year 4 months
United Kingdom

Consultant – Tech Lead and Machine Learning Engineer

Many Pets (Bought By Many)

  • Onboarded internal and external datasets to support customer service and marketing
  • Automated import of PureCloud data, reformatted to enable advanced call center monitoring, delivering 15% performance improvement internally and 42% for a third-party center
  • Automated import of Mention-Me data for marketing analytics, replacing manual processing and saving time
  • Set up Airflow to run database manipulations with dbt and analytics tools in a containerized environment to improve performance and decouple dependencies

Technologies: Python, Pandas, SciPy, FastAPI, Flask, Django, Git, GitHub, Jenkins, Jira, ClickMe, CI/CD, TDD, DevOps, Terraform, Docker, Fivetran, BigQuery, Composer-Airflow, GCP, dbt, API, sFTP, New Relic.

Apr 2020 - Oct 2020
7 months
United Kingdom

Consultant – Core Data Engineering Lead, Solution Architect (Neuron Program)

Vodafone

  • Led migration of Vodafone’s Big Data platform to Google Cloud for all European markets handling terabytes per day
  • Refurbished core data engineering squad capabilities after IR35 impact, assessing and mitigating technical debt
  • Negotiated scope reduction with stakeholders to meet timelines and budget under COVID-19 constraints
  • Delivered migration with minor delay despite loss of knowledge and missing documentation

Technologies: Java EE, Scala, Python, PySpark, GitHub, Jenkins, Jira, CI/CD, TDD/BDD, DevOps, test automation, load/stress testing, cost optimization, GCP (Dataflow, Composer, Dataproc, Cloud Storage, BigQuery, Bigtable, Spanner, Flink), Kubernetes, Docker, Terraform.

Jul 2019 - Feb 2020
8 months
United Kingdom

Consultant – Quant Research and Solution Architect

Lloyds Banking Group

  • Revamped automated trade surveillance platform to meet auditor criteria
  • Mediated among stakeholders to standardize approaches across asset classes and ensure developer alignment
  • Defined templates for efficient, standardized analytics implementation
  • Implemented high-end analytics using NLP, machine learning, and advanced quantitative methods
  • Achieved audit passing with significant cost reduction and 67% spam reduction in alerts

Asset classes: FX spot/options, rates futures, bonds, swaps, repo, bespoke OTC Technologies: Java EE, Python, Pandas, NLTK, SciPy, NumPy, PySpark, Dask, Kafka, Bitbucket, Jenkins, Jira, CI/CD, TDD, DevOps, SVN, Confluence.

Jan 2019 - Apr 2019
4 months
United Kingdom

Consultant – Principal Data Scientist

News UK – The Times

  • Delivered “Project James,” a reinforcement learning AI for direct marketing optimization under a Google innovation grant
  • Assessed a partially implemented platform, rebuilt the core using state-of-the-art tools, and tuned for production viability within schedule
  • Overcame time pressure, partial documentation, and lack of prior case studies
  • Revolutionized churn reduction and supported award-winning contact center

Technologies: Python, pandas, SciPy, NumPy, TensorFlow, Django, Flask, GitHub, Jenkins, Jira, GitOps, CI/CD, DevOps, Kubernetes, Docker, Terraform, microservices, Confluence.

Jul 2018 - Dec 2018
6 months
United Kingdom

Consultant – Principal Data Scientist

News UK – The Times

  • Developed an online propensity model and API to improve conversion and personalize user experience for The Times Digital
  • Implemented real-time predictions at 1000+ predictions/sec with <250ms latency
  • Increased subscriptions by 5% and cross-sales by 9% and piloted high-throughput API deployment on Kubernetes

Technologies: Python, Pandas, NLTK, SciPy, NumPy, Django, Nginx, Docker, Kubernetes, Terraform, TensorFlow, GitHub, Jenkins, Jira, CI/CD, DevOps, New Relic.

Mar 2017 - Aug 2018
1 year 6 months
United Kingdom

Vice President of Technology

JP Morgan Chase

  • Managed delivery of a cloud logging and monitoring platform across 20-person, 3-site team for AWS public cloud adoption
  • Reviewed architecture post-PoC and scaled the platform to handle 5TB/day (5 billion messages, 1.3 billion peak)
  • Met strict cyber-security, availability, disaster recovery, and SLA/SLO requirements under constrained approved services
  • Enabled monitoring of 5 mission critical cloud applications, pioneering new patterns and scalable architecture

Technologies: AWS (API Gateway, Route53, S3, DynamoDB, Kinesis, Elastic Beanstalk, Lambda, ELB, IAM, CloudWatch, CloudTrail), MySQL, Boto, Terraform, FluentD, Flink, Kafka, Kafka Streams, Kinesis Firehose, NiFi, Cassandra, CQL, Elasticsearch, Logstash, Kibana, Java EE, Python, Bitbucket, Jenkins, Jira, CI/CD, TDD, BDD, DevOps, Docker, Kubernetes, Datadog.

Mar 2016 - Feb 2017
1 year
United Kingdom

Vice President of Data Architecture

JP Morgan Chase

  • Established standardized regulatory reporting across all businesses following regulatory change
  • Created controlled vocabularies and automated metadata management procedures
  • Served dictionaries and reference data via REST APIs within a microservice architecture
  • Delivered tools to mitigate regulatory risk and provide corporate insight

Asset classes: FX spot/options, rates futures, bonds, swaps, derivatives, OTC Technologies: Java EE, Spring, Python, RDF, OWL, SPARQL, semantic web standards, ontologies, semantic wiki, knowledge graphs, Neo4j, BigQuery (Blazegraph), ISO20022, Bitbucket, Jenkins, Jira, CI/CD, TDD, BDD, DevOps, Docker.

Nov 2014 - Feb 2016
1 year 4 months
United Kingdom

Vice President of Data Architecture

JP Morgan Chase

  • Developed meta-analytics for the Corporate and Investment Bank, labeling and scoring all data repositories and software products
  • Defined data quality metrics and formal ontologies for logical data models
  • Scanned metadata to infer physical models and linked them via heuristics, with manual refinement by information architects
  • Increased productivity of information architects by 4.7× through semi-automated processes

Technologies: Java, Spring, Python, RDF, OWL, semantic web standards, ontologies, knowledge graphs, BigQuery, ISO11179, Bitbucket, Jenkins, Jira, CI/CD, TDD, DevOps.

Summary

Davide Imperati's background builds on two decades of academia and corporate experience in quant research, data strategy, and large scale cloud migration. His technical experience is compounded with robust soft skills and deep understanding of business domain in finance, telecom, media, logistics, and digital marketing. He operates during the initial phases of green field data-driven projects (PoC – Pilot). He also has a proven experience intervening in under-performing data related projects and deliver them controlling for budget, time, and resource constraint.

Skills

  • Data Modeling And Database Design
  • Data Integration And Transformation
  • Data Governance And Security
  • Big Data Technologies (E.g., Hadoop, Spark, Nosql)
  • Data Warehousing
  • Cloud Computing
  • Sql Programming
  • Etl (Extract, Transform, Load)
  • Streaming (Spark, Kafka, Flink, Kinesis)
  • Business Intelligence Tools
  • Analytics And Reporting
  • Data Visualization
  • Machine Learning And Ai Techniques
  • Statistical Analysis
  • Data Quality Management
  • Data Profiling
  • Metadata Management
  • Schema Design And Optimization
  • Capacity Planning And Performance Tuning
  • Database Administration
  • Backup And Recovery
  • Disaster Recovery Planning
  • Nosql Databases (E.g., Mongodb, Cassandra)
  • Relational Databases (E.g., Oracle, Mysql, Sql Server)
  • Data Architecture Frameworks (E.g., Togaf)
  • Data Governance Frameworks (E.g., Dama-dmbok)
  • Agile Software Development Methodologies
  • Project Management
  • Data Privacy And Compliance (E.g., Gdpr, Ccpa)
  • Data Storage Technologies (E.g., San, Nas)
  • Data Access And Authentication
  • Cloud Storage (E.g., S3, Azure Blob)
  • Data Virtualization
  • Data Federation
  • Api Design And Development
  • Web Services
  • Distributed Systems
  • Performance Testing And Optimization
  • Systems Integration
  • Data Architecture Governance
  • Data Flow Analysis
  • Conceptual, Logical, And Physical Data Models
  • Multi-dimensional Data Modeling
  • Master Data Management
  • Reference Data Management
  • Data Lineage And Traceability
  • Data Migration
  • Data Transformation
  • Data Enrichment
  • Data Classification And Categorization
  • Agile
  • Scrum
  • Kanban
  • Xp
  • Extreme Programming
  • Ttd
  • Bdd
  • Listening To People
  • Project Delivery
  • Stakeholder Management
  • Product Owner
  • Influence
  • Leadership
  • Fixing Processes
  • Waterfall
  • Book Of Work
  • Milestones
  • Backlog
  • Jira
  • Trello
  • Continuous Delivery
  • Continuous Integration
  • Jenkins
  • Versioning Systems
  • Git
  • Bitbucket
  • Svn
  • Java
  • Python
  • Pandas
  • Numpy
  • Scikit
  • Scipy
  • Nltk
  • Statistics
  • Analytic
  • Machine Learning
  • Artificial Intelligence
  • Regression
  • Decision Trees
  • Random Forests
  • Support Vector Machines
  • Tensorflow
  • Neural Networks
  • Reinforcement Learning
  • Multiarmed Bandits
  • Expert Advise
  • Object Oriented Programming
  • Oop
  • Solid Principles
  • Cloud
  • Aws
  • Google Cloud
  • Gcp
  • Reporting

Languages

Italian
Native
German
Advanced
English
Advanced

Education

PhD · Computational Statistics

MSc · Computer Science

New York University

PostDoc · New York, United States

...and 1 more

Certifications & licenses

Certified AWS Cloud Practitioner

Certified PADI Instructor

PADI

Certified Scrum Product Owner

Profile

Created
Last Update
Need a freelancer? Find your match in seconds.
Try FRATCH GPT
More actions

Frequently asked questions

Do you have questions? Here you can find further information.

Where is Davide based?

Davide is based in Msida, United Kingdom and can operate in on-site, hybrid, and remote work models.

What languages does Davide speak?

Davide speaks the following languages: Italian (Native), German (Advanced), English (Advanced).

How many years of experience does Davide have?

Davide has at least 10 years of experience. During this time, Davide has worked in at least 7 different roles and for 6 different companies. The average length of individual experience is 1 year and 1 month. Note that Davide may not have shared all experience and actually has more experience.

What roles would Davide be best suited for?

Based on recent experience, Davide would be well-suited for roles such as: Consultant – Lead Data Engineer, Consultant – Tech Lead and Machine Learning Engineer, Consultant – Core Data Engineering Lead, Solution Architect (Neuron Program).

What is Davide's latest experience?

Davide's most recent position is Consultant – Lead Data Engineer at Awaze.

What companies has Davide worked for in recent years?

In recent years, Davide has worked for Awaze and Many Pets (Bought By Many).

Which industries is Davide most experienced in?

Davide is most experienced in industries like Banking and Finance, Tourism and Hospitality, and Insurance. Davide also has some experience in Media, Entertainment and Publishing and Telecommunication.

Which business areas is Davide most experienced in?

Davide is most experienced in business areas like Information Technology (IT), Business Intelligence, and Marketing. Davide also has some experience in Customer Service, Project Management, and Research and Development (R&D).

Which industries has Davide worked in recently?

Davide has recently worked in industries like Tourism and Hospitality and Insurance.

Which business areas has Davide worked in recently?

Davide has recently worked in business areas like Business Intelligence, Information Technology (IT), and Customer Service.

What is Davide's education?

Davide holds a Doctorate in Computational Statistics and a Master in Computer Science.

Does Davide have any certificates?

Davide has 3 certificates. These include: Certified AWS Cloud Practitioner, Certified PADI Instructor, and Certified Scrum Product Owner.

What is the availability of Davide?

Davide is immediately available full-time for suitable projects.

What is the rate of Davide?

Davide's rate depends on the specific project requirements. Please use the Meet button on the profile to schedule a meeting and discuss the details.

How to hire Davide?

To hire Davide, click the Meet button on the profile to request a meeting and discuss your project needs.

Average rates for similar positions

Rates are based on recent contracts and do not include FRATCH margin.

1000
750
500
250
Market avg: 650-810 €
The rates shown represent the typical market range for freelancers in this position based on recent contracts on our platform.
Actual rates may vary depending on seniority level, experience, skill specialization, project complexity, and engagement length.