I am a Senior AI/ML Engineer with 12+ years designing, building, and scaling AI-driven products across healthcare, edtech, and fintech domains. I specialize in LLM systems, Retrieval-Augmented Generation (RAG), and multi-agent orchestration using LangChain, LangGraph, AutoGen and MCP. I also have hands-on experience with full-stack and backend software engineering (Python, FastAPI, Django) and modern MLOps/DevOps on AWS, Azure, and Kubernetes. I focus on reliability, safety, and scalability through LLMOps automation and continuous evaluation that aligns AI capabilities with user experience, privacy and safety requirements. I enjoy mentoring engineers and collaborating with cross-functional teams to deliver impactful, compliant AI solutions. I am passionate about turning research into production-ready AI systems that solve real problems, continuously improving performance and safety, and helping teams ship responsibly with measurable impact.

Robert Lungu

I am a Senior AI/ML Engineer with 12+ years designing, building, and scaling AI-driven products across healthcare, edtech, and fintech domains. I specialize in LLM systems, Retrieval-Augmented Generation (RAG), and multi-agent orchestration using LangChain, LangGraph, AutoGen and MCP. I also have hands-on experience with full-stack and backend software engineering (Python, FastAPI, Django) and modern MLOps/DevOps on AWS, Azure, and Kubernetes. I focus on reliability, safety, and scalability through LLMOps automation and continuous evaluation that aligns AI capabilities with user experience, privacy and safety requirements. I enjoy mentoring engineers and collaborating with cross-functional teams to deliver impactful, compliant AI solutions. I am passionate about turning research into production-ready AI systems that solve real problems, continuously improving performance and safety, and helping teams ship responsibly with measurable impact.

Available to hire

I am a Senior AI/ML Engineer with 12+ years designing, building, and scaling AI-driven products across healthcare, edtech, and fintech domains. I specialize in LLM systems, Retrieval-Augmented Generation (RAG), and multi-agent orchestration using LangChain, LangGraph, AutoGen and MCP. I also have hands-on experience with full-stack and backend software engineering (Python, FastAPI, Django) and modern MLOps/DevOps on AWS, Azure, and Kubernetes. I focus on reliability, safety, and scalability through LLMOps automation and continuous evaluation that aligns AI capabilities with user experience, privacy and safety requirements. I enjoy mentoring engineers and collaborating with cross-functional teams to deliver impactful, compliant AI solutions.

I am passionate about turning research into production-ready AI systems that solve real problems, continuously improving performance and safety, and helping teams ship responsibly with measurable impact.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Senior AI/ML Engineer at Mantracare
June 1, 2022 - September 1, 2025
Architected and deployed a multi-agent, domain-specific healthcare LLM assistant using LangGraph, LangChain, and ReAct/Chain-of-Thought reasoning with LiveKit, improving clinician workflow efficiency by 35%. Designed a Hybrid RAG pipeline with HNSW indexing, metadata-aware retrieval, and TinyBERT reranking, achieving a 45% improvement in factual accuracy and 3× query throughput. Implemented fine-tuning and adapter-based customization for LLaMA and GPT-4 models using QLoRA and Instruction Tuning on AWS SageMaker, reducing training cost by 38% and cycle time by 50%. Integrated MCP (Model Context Protocol) for cross-agent data interoperability between LLM agents and external healthcare APIs. Engineered LLMOps infrastructure on AWS Bedrock, SageMaker, and EKS with CI/CD automation (GitHub Actions, Terraform), cutting deployment turnaround from weeks to days. Optimized inference performance with vLLM, TensorRT-LLM, and FlashAttention, achieving 50% lower latency and 35% higher GPU utilizat
Senior AI Software Engineer at Cognii
April 1, 2019 - March 1, 2022
Developed cloud-native backend systems for Cognii’s AI tutoring platform, leveraging Python, FastAPI, and containerized microservices. Trained and deployed NLP models with Transformer architectures (BERT, GPT, RoBERTa) to enhance automated grading and tutoring accuracy by 30%. Integrated generative conversational models via Hugging Face and OpenAI APIs to enable adaptive, context-aware learning experiences. Optimized data and inference pipelines with NumPy and Scikit-learn, achieving 35% latency reduction. Deployed end-to-end ML workflows on AWS SageMaker and Kubernetes, ensuring high availability and reliability. Partnered with product and research teams to drive AI roadmap alignment with learning science goals, improving user engagement by 25%.
Software Engineer at Lemonway
August 1, 2016 - January 1, 2019
Developed full-stack financial systems using the MERN stack (MongoDB, Express, React, Node.js) and Python (Flask, Django). Built secure RESTful and GraphQL APIs reducing partner integration time by 40%. Implemented real-time fraud detection pipelines with Pandas, Scikit-learn, and Kafka, improving anomaly detection accuracy by 30%. Designed interactive analytics dashboards with React, D3.js, and WebSockets, increasing operational visibility by 45%. Containerized and orchestrated microservices with Docker and AWS ECS/Kubernetes, maintaining 99.9% uptime and faster CI/CD cycles. Worked in Agile, compliance-heavy fintech environments, delivering scalable, auditable systems.
Python Developer at GitLab
November 1, 2014 - May 1, 2016
Engineered backend analytics and personalization services using Python (Flask, Django). Developed predictive ML models for user behavior forecasting, improving engagement predictions by 35%. Containerized ML experiments with Docker and AWS EC2, cutting deployment cycles by 50%. Tuned PostgreSQL and MySQL queries for large-scale data pipelines, achieving 40% faster performance. Collaborated with data and product teams to launch A/B experimentation frameworks that increased feature rollout validation efficiency by 30%.

Education

Master of Science (MS) in Computer Science at University of Bucharest
July 1, 2012 - September 1, 2014
Bachelor of Science (BS) in Computer Science at University of Bucharest
October 1, 2009 - June 30, 2012

Qualifications

Add your qualifications or awards here.

Industry Experience

Healthcare, Education, Financial Services, Software & Internet, Professional Services