Shyam S.

Data Scientist

Berlin, Germany

Experience

Jun 2024 - Jul 2025
1 year 2 months
Lyon, France

Data Scientist

Université Gustave Eiffel

  • Developed an automated pipeline to evaluate ML models for predicting lane changes and future trajectories in autonomous driving.

  • Trained and optimized Transformer models, integrated large-scale trajectory datasets, and deployed the pipeline as a FastAPI service.

  • Achieved 15% higher accuracy, 18% lower error rates, and 25% faster training.

  • Technologies: Python, PyTorch, Scikit-learn, MySQL, MLflow, GitHub Actions, S3.

  • Built an NLP pipeline using BERT and Llama 2 to extract and summarize critical information from disaster reports.

  • Applied supervised fine-tuning and prompt engineering to adapt models for structured summarization and entity extraction.

  • Produced concise, real-time summaries, enhancing disaster data workflows and decision-making speed.

  • Technologies: BERT, Llama 2, LoRA, PyTorch, Transformers, AWS, MLflow, FastAPI, Docker, GitHub Actions, CI/CD Pipelines, S3.

Jan 2024 - May 2024
5 months

Data Scientist

Freelance

  • Built an end-to-end ETL and clickstream A/B testing framework to optimize checkout flows, reducing analysis time by 30%.
  • Processed ~200k–300k events for funnel/drop-off analysis and trained Random Forest & XGBoost models, achieving ROC-AUC 0.82 and 15% higher precision in conversion prediction.
  • Technologies: Python, Pandas, scikit-learn, XGBoost, Git, AWS Lambda, S3, fastAPI, GitHub Actions, PostgreSQL, Docker.
Nov 2022 - Jan 2024
1 year 3 months
Bengaluru, India

Data Scientist

Bosch Limited

  • Built a cloud-based AI pipeline using YOLOv6 on SageMaker to automatically detect traffic violations from city video feeds, improving detection accuracy by 30% and vehicle identification by 21%.
  • Automated video processing: Lambda triggers inference on new S3 uploads and aggregates results in real-time.
  • Added a GDPR-compliant anonymization plugin that masks faces and license plates using YOLOv6 and OpenCV, ensuring privacy while maintaining analytics accuracy.
  • Streamlined model updates and deployment with CI/CD, ensuring consistent and scalable operations.
  • Technologies: Python, YOLOv6, PyTorch, OpenCV, AWS Lambda, S3, SageMaker, CI/CD, GitHub Actions, Docker.
Sep 2021 - Oct 2022
1 year 2 months
Singapore

Data Scientist / Analyst

TUM Asia

  • Developed real-time traffic dashboards using city-wide camera feeds, enabling planners to monitor traffic flow and congestion effectively.
  • Processed data from 11 urban locations and generated actionable insights for traffic management.
  • Applied unsupervised clustering on GPS data to identify travel patterns, contributing to 15% cost savings in public transit planning.
  • Technologies: Python, Matplotlib, Plotly, PostgreSQL.
Oct 2019 - Sep 2021
2 years
Singapore

Research Associate / Data Analyst

Nanyang Technological University

  • Analyzed GPS data from a bike-sharing service to improve user satisfaction and operational efficiency.
  • Increased bike-sharing usage time by 18% through pattern recognition and optimized service allocations.
  • Designed interactive dashboards using Plotly for real-time performance tracking.
  • Technologies: Python, SQL, Plotly, Data Analysis.

Languages

English
Advanced
German
Intermediate

Education

Jul 2017 - Jun 2019

Technical University of Munich

Master of Science · Munich, Germany

Certifications & licenses

IBM Certified Data Scientist

IBM

Need a freelancer? Find your match in seconds.
Try FRATCH GPT
More actions