Jayana Shah
Implementation of Data Management Tool for LLM & Speech Technologies
Experience
Implementation of Data Management Tool for LLM & Speech Technologies
Fraunhofer IIS
- Developed a PoC for data processing and integration of an open-source TTS/LLM data management platform
- Automated metadata pipelines, boosting efficiency by 30%
- Implemented IAM, data quality checks, and versioning; conducted gap analyses with UI/UX and HMI designers
- Built KPI dashboards in Power BI to monitor performance, conducted usability tests, and ensured GDPR and AI privacy compliance
- Tools used: Openmetadata, Airflow, Python, SQL, MySQL, Elasticsearch, Docker, RBAC, Power BI, S3, GCP, LLMs into workflows
AI Data Engineer Intern
Fraunhofer IIS
- Built metadata pipelines with Python, SQL, dbt, Airflow, Snowflake, ETL workflows
- Evaluated open-source tools for LLM data management, lineage, catalog, and governance in TTS workflows
- Used Docker for deployments (AWS/GCP), conducted interviews, integration tests, and documented findings
AI Research Assistant - Machine learning and Deep learning
AIBE Lab, Friedrich-Alexander-Universität
- Applied transfer learning on CNNs for cochlear implant EEG data, fine-tuned and optimized models to 74% accuracy
- Leveraged normal-hearing EEG data, visualized performance, and generated insights for neuro-steered hearing aids
Working Student - Data Engineering & Software Support
Fraunhofer IIS
- Designed, monitored, and diagnosed ELT/ETL pipelines within EasyDCP software workflows
- Automated testing and QA processes with CI/CD integration, improving system reliability by 25%
- Provided L2/L3 technical support, managed Jira tickets, collaborated with developers for RCA and bug fixing
- Researched digital twin in education, analyzed applications and identified key research gaps
Software Engineer
Newgen Software Technologies
- Led Agile SDLC for BPMN product software, aligned business and IT teams for seamless delivery, and mentored juniors
- Developed and optimized secure RESTful APIs and backend systems using Java, Python, JSP, and MERN stack
- Conducted code and design reviews, wrote clean, well-tested, performant code, performed unit and A/B testing, and managed production deployments
- Liaised with Indian banks including Axis Bank to deploy payment and loans workflow management solutions
Python Developer Intern
Kubix Square
- Built a Chrome extension using JavaScript, HTML, PHP, and CSS to highlight webpage elements on hover
- Automated web scraping and browser tasks with Selenium, delivering a functional Google plugin
Clinical Medical Text Analysis Dashboard
- Built a Power BI dashboard to analyze HL7 medical text and evaluate clinical NLP data quality for disease types
- Tools used: Tableau, Data Cleaning, Visualization & Analytics, Microsoft Office, Google Colab
E-Commerce Data Warehouse Development
- Developed a scalable e-commerce Data Vault (Hubs, Links, Satellites) following Star Schema Mart
- Tools: Microsoft Fabric, Power BI & Query, Data Visualization, Analytics & Modeling, Data Vault 2.0
News Recommender System
- Built a personalized news recommendation engine using TF-IDF and cosine similarity to suggest relevant articles
- Tools used: Pickle, TF-IDF, cosine similarity, Jupyter Notebook, EDA
Impact of Emotional Intelligence for Professional Growth
St. Francis Institute of Technology
- Analyzed emotional intelligence and academic performance, integrated ML and UI/UX with 89% accuracy
- Tools used: Python, Pandas, Scikit-learn, OpenCV, Matplotlib, Seaborn, Jupyter Notebook, EDA, ML frameworks, UI/UX
Summary
Detail-oriented Engineer with 3+ years of hands-on experience developing automated data pipelines, software solutions using Python, SQL, Airflow. Skilled in delivering scalable data architectures, IT systems, optimizing workflows, and improving system resilience within production environments.
Adept at cross-functional collaboration, CI/CD automation and translating complex business requirements into efficient data products that drive actionable insights.
Skills
Programming: Python, R, Java, Javascript (React, Node.js), C/c++, Object Oriented (Oop), Sql, Linux, Windows.
Ml & Deep Learning: Tensorflow, Keras, Hugging Face, Pytorch, Fastapi Numpy, Json, Csv, Xml, Rdf, Hdf5.
Databases & Vector Databases: Postgresql, Mysql, Mssql, Nosql, Mongodb, Snowflake.
Ticketing, Testing: Confluence, Kanban, Git/github, Jenkins, Kubernetes, N8n.
Data Engineering & Cloud Ml Platforms: Aws (S3), Bigquery, Azure, Spark, Devsecops/devops.
Analytics & Bi: Power Bi, Kpi Dashboards, Tableau.
Other Tools: Collibra, Figma, Condens, Agile/scrum, Microsoft Office Package (M365), Excel.
Soft Skills: Problem-solving, Fast Learner, Active Listener, Team Player, Collective Effort, Self-directed, Stakeholder Management And Customer Orientation, Analytical Thinking And Conceptual Thinking Skills, Solution-oriented, Structured Work And Strong Documentation Skills.
Languages
Education
Friedrich-Alexander-Universität Erlangen-Nuremberg
Master of Data Science · Data Science · Nuremberg, Germany
St. Francis Institute of Technology, Mumbai University
Bachelor of Information Technology · Information Technology · Mumbai, India
Certifications & licenses
Getting Started With Data Analytics On AWS
Certificate On Machine Learning Models And AI Using Python
ATS Solutions
Similar Freelancers
Discover other experts with similar qualifications and experience