Davide Imperati
Consultant – Lead Data Engineer
Experience
Consultant – Lead Data Engineer
Awaze
- Federated data across three major brands and a few minor ones
- Defined a target data model and guided the migration from legacy systems to the Snowflake data warehouse
- Implemented master data management in the SaaS vendor tool (Step by Stibo)
- Extracted data from legacy systems, transformed it for compatibility with the new data model, and loaded it into the data warehouse preserving lineage
- Enabled the frontend team to fetch data from the unified system to support cross-selling across brands
- Centralized data governance by enabling data stewards to perform governance according to business functions
- Implemented logging, monitoring, and enhanced aggregate data visualization (basic analytics) under dual-run constraints over two years
Technologies: Python, Pandas, SciPy, FastAPI, Flask, Django, Git, GitHub, Jenkins, Jira, ClickMe, CI/CD, TDD, DevOps, Terraform, Docker, Snowflake, PostgreSQL, AS400, Matillion, AWS (S3, ECS, ELB, Route53, ECR), sFTP, dbt, API, Step (Stibo), Linux, Bash scripting.
Consultant – Tech Lead and Machine Learning Engineer
Many Pets (Bought By Many)
- Onboarded internal and external datasets to support customer service and marketing
- Automated import of PureCloud data, reformatted to enable advanced call center monitoring, delivering 15% performance improvement internally and 42% for a third-party center
- Automated import of Mention-Me data for marketing analytics, replacing manual processing and saving time
- Set up Airflow to run database manipulations with dbt and analytics tools in a containerized environment to improve performance and decouple dependencies
Technologies: Python, Pandas, SciPy, FastAPI, Flask, Django, Git, GitHub, Jenkins, Jira, ClickMe, CI/CD, TDD, DevOps, Terraform, Docker, Fivetran, BigQuery, Composer-Airflow, GCP, dbt, API, sFTP, New Relic.
Consultant – Core Data Engineering Lead, Solution Architect (Neuron Program)
Vodafone
- Led migration of Vodafone’s Big Data platform to Google Cloud for all European markets handling terabytes per day
- Refurbished core data engineering squad capabilities after IR35 impact, assessing and mitigating technical debt
- Negotiated scope reduction with stakeholders to meet timelines and budget under COVID-19 constraints
- Delivered migration with minor delay despite loss of knowledge and missing documentation
Technologies: Java EE, Scala, Python, PySpark, GitHub, Jenkins, Jira, CI/CD, TDD/BDD, DevOps, test automation, load/stress testing, cost optimization, GCP (Dataflow, Composer, Dataproc, Cloud Storage, BigQuery, Bigtable, Spanner, Flink), Kubernetes, Docker, Terraform.
Consultant – Quant Research and Solution Architect
Lloyds Banking Group
- Revamped automated trade surveillance platform to meet auditor criteria
- Mediated among stakeholders to standardize approaches across asset classes and ensure developer alignment
- Defined templates for efficient, standardized analytics implementation
- Implemented high-end analytics using NLP, machine learning, and advanced quantitative methods
- Achieved audit passing with significant cost reduction and 67% spam reduction in alerts
Asset classes: FX spot/options, rates futures, bonds, swaps, repo, bespoke OTC Technologies: Java EE, Python, Pandas, NLTK, SciPy, NumPy, PySpark, Dask, Kafka, Bitbucket, Jenkins, Jira, CI/CD, TDD, DevOps, SVN, Confluence.
Consultant – Principal Data Scientist
News UK – The Times
- Delivered “Project James,” a reinforcement learning AI for direct marketing optimization under a Google innovation grant
- Assessed a partially implemented platform, rebuilt the core using state-of-the-art tools, and tuned for production viability within schedule
- Overcame time pressure, partial documentation, and lack of prior case studies
- Revolutionized churn reduction and supported award-winning contact center
Technologies: Python, pandas, SciPy, NumPy, TensorFlow, Django, Flask, GitHub, Jenkins, Jira, GitOps, CI/CD, DevOps, Kubernetes, Docker, Terraform, microservices, Confluence.
Consultant – Principal Data Scientist
News UK – The Times
- Developed an online propensity model and API to improve conversion and personalize user experience for The Times Digital
- Implemented real-time predictions at 1000+ predictions/sec with <250ms latency
- Increased subscriptions by 5% and cross-sales by 9% and piloted high-throughput API deployment on Kubernetes
Technologies: Python, Pandas, NLTK, SciPy, NumPy, Django, Nginx, Docker, Kubernetes, Terraform, TensorFlow, GitHub, Jenkins, Jira, CI/CD, DevOps, New Relic.
Vice President of Technology
JP Morgan Chase
- Managed delivery of a cloud logging and monitoring platform across 20-person, 3-site team for AWS public cloud adoption
- Reviewed architecture post-PoC and scaled the platform to handle 5TB/day (5 billion messages, 1.3 billion peak)
- Met strict cyber-security, availability, disaster recovery, and SLA/SLO requirements under constrained approved services
- Enabled monitoring of 5 mission critical cloud applications, pioneering new patterns and scalable architecture
Technologies: AWS (API Gateway, Route53, S3, DynamoDB, Kinesis, Elastic Beanstalk, Lambda, ELB, IAM, CloudWatch, CloudTrail), MySQL, Boto, Terraform, FluentD, Flink, Kafka, Kafka Streams, Kinesis Firehose, NiFi, Cassandra, CQL, Elasticsearch, Logstash, Kibana, Java EE, Python, Bitbucket, Jenkins, Jira, CI/CD, TDD, BDD, DevOps, Docker, Kubernetes, Datadog.
Vice President of Data Architecture
JP Morgan Chase
- Established standardized regulatory reporting across all businesses following regulatory change
- Created controlled vocabularies and automated metadata management procedures
- Served dictionaries and reference data via REST APIs within a microservice architecture
- Delivered tools to mitigate regulatory risk and provide corporate insight
Asset classes: FX spot/options, rates futures, bonds, swaps, derivatives, OTC Technologies: Java EE, Spring, Python, RDF, OWL, SPARQL, semantic web standards, ontologies, semantic wiki, knowledge graphs, Neo4j, BigQuery (Blazegraph), ISO20022, Bitbucket, Jenkins, Jira, CI/CD, TDD, BDD, DevOps, Docker.
Vice President of Data Architecture
JP Morgan Chase
- Developed meta-analytics for the Corporate and Investment Bank, labeling and scoring all data repositories and software products
- Defined data quality metrics and formal ontologies for logical data models
- Scanned metadata to infer physical models and linked them via heuristics, with manual refinement by information architects
- Increased productivity of information architects by 4.7× through semi-automated processes
Technologies: Java, Spring, Python, RDF, OWL, semantic web standards, ontologies, knowledge graphs, BigQuery, ISO11179, Bitbucket, Jenkins, Jira, CI/CD, TDD, DevOps.
Summary
Davide Imperati's background builds on two decades of academia and corporate experience in quant research, data strategy, and large scale cloud migration. His technical experience is compounded with robust soft skills and deep understanding of business domain in finance, telecom, media, logistics, and digital marketing. He operates during the initial phases of green field data-driven projects (PoC – Pilot). He also has a proven experience intervening in under-performing data related projects and deliver them controlling for budget, time, and resource constraint.
Skills
- Data Modeling And Database Design
- Data Integration And Transformation
- Data Governance And Security
- Big Data Technologies (E.g., Hadoop, Spark, Nosql)
- Data Warehousing
- Cloud Computing
- Sql Programming
- Etl (Extract, Transform, Load)
- Streaming (Spark, Kafka, Flink, Kinesis)
- Business Intelligence Tools
- Analytics And Reporting
- Data Visualization
- Machine Learning And Ai Techniques
- Statistical Analysis
- Data Quality Management
- Data Profiling
- Metadata Management
- Schema Design And Optimization
- Capacity Planning And Performance Tuning
- Database Administration
- Backup And Recovery
- Disaster Recovery Planning
- Nosql Databases (E.g., Mongodb, Cassandra)
- Relational Databases (E.g., Oracle, Mysql, Sql Server)
- Data Architecture Frameworks (E.g., Togaf)
- Data Governance Frameworks (E.g., Dama-dmbok)
- Agile Software Development Methodologies
- Project Management
- Data Privacy And Compliance (E.g., Gdpr, Ccpa)
- Data Storage Technologies (E.g., San, Nas)
- Data Access And Authentication
- Cloud Storage (E.g., S3, Azure Blob)
- Data Virtualization
- Data Federation
- Api Design And Development
- Web Services
- Distributed Systems
- Performance Testing And Optimization
- Systems Integration
- Data Architecture Governance
- Data Flow Analysis
- Conceptual, Logical, And Physical Data Models
- Multi-dimensional Data Modeling
- Master Data Management
- Reference Data Management
- Data Lineage And Traceability
- Data Migration
- Data Transformation
- Data Enrichment
- Data Classification And Categorization
- Agile
- Scrum
- Kanban
- Xp
- Extreme Programming
- Ttd
- Bdd
- Listening To People
- Project Delivery
- Stakeholder Management
- Product Owner
- Influence
- Leadership
- Fixing Processes
- Waterfall
- Book Of Work
- Milestones
- Backlog
- Jira
- Trello
- Continuous Delivery
- Continuous Integration
- Jenkins
- Versioning Systems
- Git
- Bitbucket
- Svn
- Java
- Python
- Pandas
- Numpy
- Scikit
- Scipy
- Nltk
- Statistics
- Analytic
- Machine Learning
- Artificial Intelligence
- Regression
- Decision Trees
- Random Forests
- Support Vector Machines
- Tensorflow
- Neural Networks
- Reinforcement Learning
- Multiarmed Bandits
- Expert Advise
- Object Oriented Programming
- Oop
- Solid Principles
- Cloud
- Aws
- Google Cloud
- Gcp
- Reporting
Languages
Education
PhD · Computational Statistics
MSc · Computer Science
New York University
PostDoc · New York, United States
Certifications & licenses
Certified AWS Cloud Practitioner
Certified PADI Instructor
PADI
Certified Scrum Product Owner
Profile
Frequently asked questions
Do you have questions? Here you can find further information.
Where is Davide based?
What languages does Davide speak?
How many years of experience does Davide have?
What roles would Davide be best suited for?
What is Davide's latest experience?
What companies has Davide worked for in recent years?
Which industries is Davide most experienced in?
Which business areas is Davide most experienced in?
Which industries has Davide worked in recently?
Which business areas has Davide worked in recently?
What is Davide's education?
Does Davide have any certificates?
What is the availability of Davide?
What is the rate of Davide?
How to hire Davide?
Average rates for similar positions
Rates are based on recent contracts and do not include FRATCH margin.
Similar Freelancers
Discover other experts with similar qualifications and experience
Experts recently working on similar projects
Freelancers with hands-on experience in comparable project as a Consultant – Lead Data Engineer
Nearby freelancers
Professionals working in or nearby Msida, United Kingdom