Recommended expert
Surya (Vara prasad) Alla
AI Software Engineer
Experience
Jan 2024 - Present
2 years 1 monthHamm, Germany
AI Software Engineer
Fraunhofer FIT
- Developed LLM-based automation utilities including structured reasoning pipelines, LLM-as-a-Judge evaluation tools, and multi-model comparison frameworks.
- Built RAG pipelines for internal research workflows using LangChain, ChromaDB, and FastAPI, enabling semantic retrieval and multi-step reasoning.
- Integrated LLM microservices into existing ML systems using Docker, FastAPI, and GitLab CI/CD with reproducible deployment workflows.
- Designed inference APIs combining vision models and LLM reasoning for multimodal analytics and decision-making.
- Optimized embedding-based retrieval using vector store pruning, improved chunking logic, and dynamic retriever selection.
- Performed prompt engineering and system instruction tuning for consistency, robustness, and reasoning quality.
- Built benchmarking suites to evaluate LLM latency, reasoning quality, retrieval accuracy, and robustness under different prompt templates.
Jul 2022 - Dec 2023
1 year 6 monthsSiegen, Germany
Machine Learning Engineer & Technical Co-Founder
InnoSaddle GbR
- Developed ML-driven geometry extraction and analysis workflows, integrating preprocessing, measurement logic, and prototype inference services.
- Collaborated with technical and non-technical partners to translate ML insights into product features and user-facing tools.
- Built reproducible experimentation pipelines and provided explainability reports for ML outputs.
Feb 2018 - Aug 2019
1 year 7 monthsHyderabad, India
Software Engineer (Data)
Wipro Technologies
- Developed structured ETL workflows supporting downstream AI/analytics use-cases across Azure Data Lake, Databricks, and SQL systems.
- Optimized data pipelines and improved monitoring reliability for production ingestion systems.
Summary
GenAI Engineer with experience in designing LLM-based systems, RAG pipelines, multimodal AI workflows, and scalable inference services. Skilled in embeddings, vector databases, LLM orchestration, prompt engineering, API design, and optimizing model execution on GPU/edge devices. Blend of software engineering, ML engineering, and MLOps enables end-to-end ownership of GenAI products from prototype to production.
Skills
- Languages: Python, C++, Bash
- Llm Tooling: Langchain, Langgraph, Openai/anthropic Apis, Vllm, Huggingface, Transformers
- Rag: Embeddings (Openai, Mpnet), Chromadb, Faiss, Vector Stores, Retrievers, Chunking
- Genai: Prompt Engineering, Llm Evaluation, Multimodal Models, Tool-calling, Text+vision Pipelines
- Mlops: Dvc, Mlflow, Gitlab Ci/cd, Docker, Kubernetes (Familiar)
- Deployment: Fastapi Microservices, Onnx Runtime, Tensorrt, Gpu/jetson Systems
- Cloud: Aws (S3, Ec2, Ecr), Container Deployment
- Other: Numpy, Opencv, Open3d, Scipy, Linux
Languages
German
AdvancedEnglish
AdvancedEducation
Oct 2019 - Sep 2023
University of Siegen
M.Sc. Mechatronics, Specialization: Computer Vision, Deep Learning, C++ · Mechatronics · Siegen, Germany
Sep 2013 - Mar 2017
Osmania University
B.E. Mechanical Engineering · Mechanical Engineering · Hyderabad, India
Need a freelancer? Find your match in seconds.
Try FRATCH GPT More actions
Similar Freelancers
Discover other experts with similar qualifications and experience