Machine status detection in industrial 3D printing based on infrared image data
Baseline: Client has infrared cameras installed inside production machines for manual monitoring. Goal: Automate the process monitoring to identify irregularities from live images. Activities: I contributed to this project as a machine learning developer and project coordinator:
Outcome: Provided a production-ready system to inform operators of potential faults in real time.
Technologies: PyTorch, Convolutional neural networks, docker, git, OpenCV, FastAPI, computer vision, pytorch-lightning, ipywidgets
Quality prediction based on time series data in a manufacturing context
Baseline: Multitude of sensors collects time series data from production machines at high frequency. Goal: Identify patterns in the data linked to final product quality. Activities: I worked in this project as a machine learning developer and project coordinator:
Outcome: Derived qualitative insights to guide process engineers in optimizing production.
Technologies: PyTorch, CNNs / ROCKET model, data wrangling, git, time series classification, explainable AI
Automated question answering via retrieval of internal documents
Baseline: Company employees manually search internal documentation for information. Goal: Integrate chatbot for question answering (QA) into company communication platform. Activities: I supported this project as a machine learning scientist:
Outcome: Made responsive chatbot for document-based extractive QA available to all employees.
Technologies: transformers, haystack, LLMs, document retrieval, semantic search, question answering, BERT, git, beautifulsoup4
Estimation and analysis of variance spillover networks for academic research
Baseline: Existing network estimation methodology only works reliably on small datasets. Goal: Apply machine learning methods to analyze large datasets of economic and financial data. Activities: I realized this project as an econometrics researcher:
Outcome: Published acclaimed paper in Quantitative Economics, follow-up papers under review.
Technologies: pandas / numpy, time series forecasting, scikit-learn, python, SQL, glmnet, econometrics, vector auto-regression, networkx