Paul Martins - Data Engineer

Q: Where is Paul based?

Paul is based in Warsaw, Poland .

Q: What languages does Paul speak?

Paul speaks the following languages: English (Advanced), Japanese (Intermediate) .

Q: How many years of experience does Paul have?

Paul has at least 8 years of experience. During this time, Paul has worked in at least 2 different roles and for 4 different companies . The average length of individual experience is 2 years and 11 months . Note that Paul may not have shared all experience and actually has more experience.

Q: What is Paul's latest experience?

Paul's most recent position is Data Engineer at Luxoft .

Q: What companies has Paul worked for in recent years?

In recent years, Paul has worked for Luxoft , Unicage , and Biobot Analytics .

Q: Which industries is Paul most experienced in?

Paul is most experienced in industries like Information Technology (IT) , Banking and Finance , and Healthcare .

Q: Which business areas is Paul most experienced in?

Paul is most experienced in business areas like Business Intelligence , Information Technology (IT) , and Quality Assurance (QA) . Paul also has some experience in Research and Development (R&D) .

Q: Which industries has Paul worked in recently?

Paul has recently worked in industries like Banking and Finance , Information Technology (IT) , and Healthcare .

Q: Which business areas has Paul worked in recently?

Paul has recently worked in business areas like Business Intelligence , Information Technology (IT) , and Quality Assurance (QA) .

Recommended expert

Warsaw, Poland

Experience

Mar 2023 - Nov 2025

2 years 9 months

Data Engineer

Luxoft

Built and deployed an end-to-end enterprise data integration platform using CloverDX ETL pipelines, Python, PostgreSQL, and AWS services to ingest, validate, and structure raw analytical datasets supporting AI-powered automation for financial operations and digital banking workflows.
Designed data extraction connectors collecting fragmented structured and semi-structured input sources and aligning them to unified schema definitions required for downstream analytics workflows.
Built automated data loading and distribution jobs targeting multi-region storage across S3, RDS, and Redshift, ensuring secure data availability for risk analysis, fraud detection models, credit scoring, and scalable financial reporting.
Worked closely with business product leaders to evaluate new integration paths and prototype rapid connectors for high-priority data partners.
Provided on-call operational support to troubleshoot ingestion failures, data latency bottlenecks, and corrupted financial files, performing root-cause analysis through transaction-level replay and controlled environment replication.
Maintained automated data quality profiling, validation rulesets, and error-handling flows, ensuring consistency and reducing manual reconciliation across systems.
Created internal technical documentation including data lineage, field definitions, reconciliation rules, financial lifecycle diagrams, and mapping specifications used across engineering, compliance, and support teams.

Oct 2021 - Feb 2023

1 year 5 months

Data Engineer

Unicage

Built cloud-based ETL ingestion framework using Airflow, Python, Aurora PostgreSQL, and AWS Lambda to integrate multiple partner data providers into financial-grade web applications.
Developed custom SQL transformation scripts with field-level validation logic to handle malformed input and edge-case behavior from third-party interfaces.
Integrated data warehousing concepts including dimensional modeling and incremental loading patterns to support scalable insights tooling.
Collaborated with security teams to align data access flows with regulatory controls and auditing documentation.
Introduced automated regression data tests, enabling detection of mapping drift before deployment to production systems.

Apr 2019 - Oct 2021

2 years 7 months

Data Engineer

Biobot Analytics

Built large-scale COVID-19 public health data processing pipelines using Databricks, Apache Spark, Snowflake, and AWS to ingest real-time case reporting feeds from hospitals, diagnostic labs, and national open-data programs supporting public health intelligence platforms.
Integrated disparate raw datasets including vaccination progress tracking, ICU bed utilization, mortality curves, and population density metrics into curated warehouse models designed for advanced epidemiological and operational analysis.
Designed automated data validation rules and quality scoring frameworks utilizing anomaly detection logic and threshold-based alerting tied to pipeline health metrics.
Built operational observability dashboards in Grafana and Cloud Monitoring, visualizing pipeline latency, throughput, and schema-change impact to assist proactive issue detection.
Provided rapid response support during emergency reporting intervals, verifying the correctness of published datasets prior to high-visibility distribution.

Feb 2018 - Mar 2019

1 year 2 months

Data Developer Intern

Amazon

Modernized legacy ETL workflows by migrating to modular, service-based pipelines, reducing operational maintenance and improving reliability across data systems.
Built automated ingestion frameworks for partner data feeds with cleansing and normalization, reducing processing time and improving data accuracy.
Partnered with Security & Compliance teams to integrate regulated access controls and audit mechanisms, ensuring alignment with enterprise governance and regulatory standards.

Industries Experience

See where this freelancer has spent most of their professional time. Longer bars indicate deeper hands-on experience, while shorter ones reflect targeted or project-based work.

Experienced in Information Technology (5 years), Banking and Finance (4 years), and Healthcare (2.5 years).

Information Technology

Banking and Finance

Healthcare

Business Areas Experience

The graph below provides a cumulative view of the freelancer's experience across multiple business areas, calculated from completed and active engagements. It highlights the areas where the freelancer has most frequently contributed to planning, execution, and delivery of business outcomes.

Experienced in Business Intelligence (8 years), Information Technology (5 years), Quality Assurance (4 years), and Research and Development (2.5 years).

Business Intelligence

Information Technology

Quality Assurance

Research and Development

Summary

Cloud-focused Senior Data Engineer with over eight years of hands-on experience designing and delivering high-reliability data processing systems, enterprise ETL pipelines, and distributed integration platforms across financial and AI-driven environments. Deep background integrating complex data sources, optimizing large-scale pipelines, and ensuring data integrity for mission-critical applications. Strong collaboration with cross-functional teams including analysts, architects, and business stakeholders within fast-paced environments.

Skills

Cloud & Infra: Aws (Lamda, S3, Ec2, Rds, Cloudwatch, Emr), Azure, Docker, Kubernetes
Etl & Data Pipelines: Cloverdx, Apache Nifi, Airflow, Informatica, Talend, Ssis, Glue, Kafka
Database & Warehousing: Postgresql, Mysql, Oracle, Redshift, Bigquery, Snowflake
Language & Scripting: Python, Typescript/javascript, Java, Sql, Bash
Architecture: Data Modeling, Batch & Streaming Pipelines, Microservices, Ddd, Event-driven Integration
Tools: Git, Ci/cd, Terraform, Jira, Confluence

Languages

English

Advanced

Japanese

Intermediate

Education

Oct 2013 - Jun 2017

The University of Tokyo

B.S., Computer Science · Computer Science · Japan

Profile

Created

December 2025

Need a freelancer? Find your match in seconds.

Try FRATCH GPT

Frequently asked questions

Do you have questions? Here you can find further information.

Where is Paul based?

Paul is based in Warsaw, Poland.

What languages does Paul speak?

Paul speaks the following languages: English (Advanced), Japanese (Intermediate).

How many years of experience does Paul have?

Paul has at least 8 years of experience. During this time, Paul has worked in at least 2 different roles and for 4 different companies. The average length of individual experience is 2 years and 11 months. Note that Paul may not have shared all experience and actually has more experience.

What roles would Paul be best suited for?

Based on recent experience, Paul would be well-suited for roles such as: Data Engineer, Data Developer Intern.

What is Paul's latest experience?

Paul's most recent position is Data Engineer at Luxoft.

What companies has Paul worked for in recent years?

In recent years, Paul has worked for Luxoft, Unicage, and Biobot Analytics.

Which industries is Paul most experienced in?

Paul is most experienced in industries like Information Technology (IT), Banking and Finance, and Healthcare.

Which business areas is Paul most experienced in?

Paul is most experienced in business areas like Business Intelligence, Information Technology (IT), and Quality Assurance (QA). Paul also has some experience in Research and Development (R&D).

Which industries has Paul worked in recently?

Paul has recently worked in industries like Banking and Finance, Information Technology (IT), and Healthcare.

Which business areas has Paul worked in recently?

Paul has recently worked in business areas like Business Intelligence, Information Technology (IT), and Quality Assurance (QA).

What is Paul's education?

Paul holds a Bachelor in Computer Science from The University of Tokyo.

What is the availability of Paul?

Paul is immediately available full-time for suitable projects.

What is the rate of Paul?

Paul's rate depends on the specific project requirements. Please use the Meet button on the profile to schedule a meeting and discuss the details.