Thomas Hoefkens
Senior MLOps, DevOps Engineer
Experience
Senior MLOps, DevOps Engineer
Trianel Energy
- Setting up and operating an end-to-end MLOps platform on Azure ML and Kubernetes (Kubeflow) for automated deployment, monitoring, and scaling of forecasting models (e.g. Temporal Fusion Transformer, Informer, Autoformer).
- Implementing CI/CD pipelines in Azure DevOps for the complete ML lifecycle – from resource provisioning (Terraform), data transformation (Hugging Face Datasets, Pandas, PyTorch, CUDA cluster) through training and evaluation to model registry and endpoint deployment.
- Integrating MLflow for experiment tracking, model versioning, performance monitoring, and automated registration in the Azure Model Registry.
- Developing and containerizing PyTorch training jobs (Azure Notebook, Jupyter Notebooks) for price and time series forecasts (PFC models) with automatic rollout via Azure ML endpoints and REST/gRPC interfaces, Docker containerization, and securing with OAuth 2.0.
- Setting up monitoring and alerting mechanisms (Prometheus, MLflow metrics), centralized logging, and cost tracking.
- Automating infrastructure provisioning and model deployment using Terraform, Helm, and Azure CLI; connecting to existing market data systems and event pipelines.
- Migrating existing workloads and databases (IONOS → Azure, MongoDB) with integration into central MLOps workflows and internal networks.
- Extending the platform with LLM-based tools (LangChain, LangServe) to integrate GPT-based analysis modules into existing Spring Boot services for market anomaly detection and automated reports.
- Designing and architecting a software solution to efficiently process mass data (>3000 messages/sec) (market data store).
- Spring Boot / Java 21 container development with RabbitMQ to distribute market data via MongoDB (Kubernetes) with fast data storage in Redis RMaps, deduplication, and forwarding messages to read model queues. Building read models for UI display in MongoDB.
- Integrating RESTHeart to create a REST API for MongoDB.
- Building an Angular frontend to simplify data querying and master data management.
- Agentic coding with remote and local LLMs (Claude Sonnet, Ollama Qwen) and MCP servers.
- Developing Python scripts for transforming and cleaning incoming market data (Pandas, scikit-learn).
Senior DevOps Engineer and Platform Architect
Mynaric Laser Communications AG
- Architecting and designing a DevOps and deployment platform.
- Setting up a DevOps and software deployment platform based on Azure AKS and AWS ECS/EKS, GitLab Enterprise, and Ansible.
- Migrating VMware workloads to Azure AKS and AWS EKS.
- Configuring and deploying applications with Microsoft Entra (app registrations, app roles, published web APIs with OAuth authentication and authorization).
- Developing serverless backend services in TypeScript.
- Building GraphQL APIs against a Neo4j database.
- Writing Ansible playbooks and setting up inventory for VMware-managed virtual machines.
- Automating playbook deployment via Ansible Control Tower (open source AWX) for Unix instances.
- Implementing GitOps with GitLab and creating GitLab pipelines.
- Deploying infrastructure on Azure using Pulumi and Terraform.
- Agile teamwork, SCRUM.
Senior MLOps, DevOps Engineer and Platform Architect
Dyrisk GmbH / MunichRE
- Building and operating a company-wide MLOps infrastructure on Azure Databricks to automate the full ML lifecycle from data processing to model deployment.
- Implementing scalable ETL/ELT pipelines with Airflow and Databricks to continuously feed training and inference pipelines with cleaned risk data.
- Automating training, evaluation, and deployment pipelines with MLflow, Azure DevOps, and Databricks Workflows, including automatic model registration, versioning, and promotion across stages.
- Containerizing and deploying trained models as scalable REST/gRPC services via Azure ML endpoints and Kubernetes; integrating them into production risk and security platforms.
- Setting up infrastructure-as-code provisioning with Terraform, Helm, and Kustomize for reproducible deployments, including monitoring, logging, and security components.
- Implementing end-to-end monitoring with Prometheus, Grafana, and the Elastic Stack (Beats, Logstash) to track pipeline performance, model metrics, and system health.
- Building a secure authentication and access infrastructure (Keycloak, OAuth 2.0, OIDC, JWT) for APIs and internal MLOps services.
- Developing and integrating .NET Core microservices to provide model APIs and orchestrate services in production.
- Designing and implementing CI/CD pipelines (Azure DevOps, GitLab CI) for automated testing, container builds, release management, and multi-cloud deployment.
- Monitoring, logging, and incident handling for production machine learning workloads in on-premises and cloud environments.
- Managing central SSH and access control via Teleport to ensure compliance and auditability.
- Agile teamwork in a SCRUM process with close coordination between data engineering, MLOps, and DevOps.
Senior AWS Cloud Expert, Digital Transformation Architect
EnBW Energie Baden-Württemberg
- Setting up and configuring Red Hat OpenShift, deployment monitoring, alerting, DB operator, and Nginx Ingress Controller.
- Developing Java (Spring Boot / Spring Cloud) AWS Lambda microservices with an API-first approach using OpenAPI.
- Containerizing microservices with Docker and creating Docker Compose definitions for local development and testing.
- Writing automated unit tests using AWS LocalStack.
- Building an Angular 12/TypeScript frontend to display automated energy trading with options for manual intervention and correction.
- Creating streaming connectors for AWS MSK (managed Kafka) to automate processing of marketplace trading streaming messages.
- Setting up a base AWS resource pipeline and per-microservice pipelines to automate infrastructure creation with AWS CDK, EKS cluster setup with external DNS, AWS Load Balancer Controller for automatic load balancing, and Route53 configuration.
- Provisioning EC2 instances with Terraform and accessing them via AWS SSM.
- Integrating with AWS IAM and Cognito (single sign-on); working with AWS Control Tower and VPC networking via Transit Gateway attachments.
- Developing Helm charts for automated OpenShift deployments.
- OpenAPI-first approach for backend services and Swagger UI integration with backend endpoints.
- Agile teamwork, SCRUM ceremonies, bi-weekly sprints.
Senior Cloud Architect and FullStack Engineer
Bayer AG, Digital Farming
- Developing a field and crop management solution for large agricultural enterprises, consisting of AWS Lambda-based .NET Core (C#) microservices, supported by domain-driven design and event sourcing, plus two frontend solutions (Angular for web, Xamarin (now MAUI) as a cross-platform mobile management solution).
- Frontend TypeScript development (Angular) and .NET Standard development (Xamarin) as part of a fully cross-functional team responsible for delivering all technical components of a requirement (e.g. frontend changes, backend development including Terraform and CI/CD setup, automated unit tests, and Sonar quality checks).
- Implementing a fully decoupled architecture using SQS, DynamoDB, API Gateway, Route53, AWS Lambda, and .NET IDP with Azure AD federation and JWT-based authentication/authorization.
- Building GraphQL APIs with schema stitching across multiple backend farming data sources (weather forecasts, spraying recommendations, pest data).
- Creating a .NET Core CLI tool for the technical management of the digital farming platform.
- Developing cross-platform iOS/Android features (Xamarin), push notifications, and mapping using Carto maps, including an extension plugin for VS Code that links JavaScript with C# via the V8 engine.
- Developing GitLab CI pipelines.
- Integrating Raygun for centralized logging.
- Agile teamwork, SCRUM ceremonies, bi-weekly sprints.
Senior Cloud Developer
Otis France
- Development of field service applications (Kony platform for iOS and Android) and backend endpoints (Java Spring Boot, Spring Cloud).
- Integration of internal APIs (Asset Management, Field Service Management).
- Development of Azure Functions and Function Apps (C#).
- Development of ASP.NET MVC admin interfaces.
- Azure DevOps (Team Foundation Server) pipeline development.
- Oracle 12 PL/SQL development, database design and maintenance.
Technical Project Lead
Mobility Media-Saturn E-Business GmbH
- Technical lead for evaluating, selecting and implementing a mobility platform (MDM and MAM), setting up an internal enterprise app store and BYOD policies.
- Kony platform development of various apps based on the Kony platform / cross-platform development in JavaScript for Android, iOS and Windows tablets.
- MC@POS → Kony app for use in stores, product comparison, inventory, pricing, online in-store orders.
Technical Team Lead, Senior Developer, Enterprise Architect for the CRM domain
Telefonica 02 Germany GmbH & Co KG
- Team lead for CRM and order management applications, CRM domain enterprise architect, developer and liaison to business and operations stakeholders and senior management.
- Development of service requests, integration of over 40 systems across the entire provisioning and billing landscape.
- Middleware integrations via Tuxedo, MQSeries, WebLogic and WebSphere.
- UI development, backend development (Oracle-based and Java server-based), integration with middleware systems such as RabbitMQ, Tibco and Tuxedo (REST, SOAP-based systems and database connectors).
- Led a complex fat-client upgrade that enabled VBA customizations but encountered 32-bit limitations; upgraded to the Microsoft VSTA engine (a unique project worldwide!).
Technical Team Leader, Senior Developer
Deutsche Bahn AG
- Development of an HR platform and an e-recruiting mirror on the "Internet" side based on PeopleSoft HCMS.
- Development of a ticketing system based on JBoss (backend) and Apache MyFaces UI.
- Development of the Deutsche Bahn enterprise portal (employee-focused).
- Broker messaging development, asynchronous message delivery.
- Development of batch-mode application runs.
Application Developer
Telefonica 02 Germany GmbH & Co KG
- WebLogic 5.1 Java development, EJB development.
- Development of a JSP-based frontend.
- PoC setup of the Oracle 8i jServer.
- Migration of web applications to WebLogic 6.0.
- Win32 API development.
- Vantive 8.2 frontend development support.
Summary
Passionate Senior MLOps, DevOps, and Cloud/Platform Engineer with extensive experience in developing and operating scalable platforms and AI-powered solutions. Expertise in building end-to-end MLOps workflows: data preparation (ETL/ELT pipelines), developing ETL/ELT pipelines in Python (Pandas, PySpark, Airflow, Dask), experiment tracking and model versioning with MLflow, using Azure Databricks Data Lake for data engineering and training, feature stores (Feast), and automated deployment of trained models as services.
Skilled in implementing continuous training and delivery pipelines (CI/CD) for ML and software components using GitLab CI/CD, Azure DevOps, and Kubeflow.
Strong knowledge of virtualization and container orchestration (Kubernetes: OpenShift, EKS, AKS, OVH Kubernetes, Docker) and infrastructure as code with Terraform and Ansible. Proficient in observability, classic and model monitoring with Prometheus, Grafana, and traditional logging stacks (Loki, ELK, DataDog). Experienced in backend development with C#, Java, TypeScript, Python, and Bash scripting, as well as SQL and NoSQL databases.
Skills
- Programming & Frameworks: C# .Net Core, Java (Spring Boot, Spring Cloud, J2ee), Golang, Python (Pandas, Numpy, Scikit-learn, Tensorflow, Pytorch), Bash Scripting
- Frontend & Cross-platform: Typescript, Angular, React, Macos, Xamarin, Maui
- Cloud & Container Platforms: Aws Cloud, Azure Cloud, Ovh Cloud, Gcp, Docker, Kubernetes, Aks, Eks, Red Hat Openshift, Vmware Tanzu, Cluster Controller, Cluster Operator
- Infrastructure As Code & Ci/cd: Terraform, Ansible, Pulumi, Kustomize, Helm Charts, Github Actions, Gitlab Ci, Azure Devops, Argocd, Flux, Jenkins Pipelines
- Messaging, Data & Databases: Rabbitmq, Redis, Mongodb, Neo4j, Sql, Oracle, Aws Msk (Managed Kafka)
- Observability & Logging: Prometheus, Grafana, Elasticsearch, Kibana, Beats, Loki, Datadog
- Mlops & Data Engineering: Mlflow, Azure Databricks, Apache Spark, Feast, Dvc
- Machine Learning & Ai: Xgboost, Scikit-learn, Hugging Face Transformers
Languages
Similar Freelancers
Discover other experts with similar qualifications and experience