I'm a results-driven Senior Machine Learning Engineer with deep expertise in NLP and large language models, blending solid research fundamentals with hands-on production engineering. I design, fine-tune, and deploy scalable language applications that power real-world business impact. I thrive on building cost-efficient, reliable AI systems, mentoring teams, and turning complex requirements into robust ML workflows. My track record includes leading retrieval-augmented platforms, improving model quality, safety, and observability across enterprise-scale deployments.

Adrian Hoang

I'm a results-driven Senior Machine Learning Engineer with deep expertise in NLP and large language models, blending solid research fundamentals with hands-on production engineering. I design, fine-tune, and deploy scalable language applications that power real-world business impact. I thrive on building cost-efficient, reliable AI systems, mentoring teams, and turning complex requirements into robust ML workflows. My track record includes leading retrieval-augmented platforms, improving model quality, safety, and observability across enterprise-scale deployments.

Available to hire

I’m a results-driven Senior Machine Learning Engineer with deep expertise in NLP and large language models, blending solid research fundamentals with hands-on production engineering. I design, fine-tune, and deploy scalable language applications that power real-world business impact.

I thrive on building cost-efficient, reliable AI systems, mentoring teams, and turning complex requirements into robust ML workflows. My track record includes leading retrieval-augmented platforms, improving model quality, safety, and observability across enterprise-scale deployments.

See more

Experience Level

Expert
Expert
Expert
Expert

Language

English
Fluent

Work Experience

Senior Machine Learning Engineer at Artefact
October 1, 2023 - November 7, 2025
Led design and deployment of enterprise-scale NLP and LLM systems, including a retrieval-augmented LLM platform with vector search, reducing inference cost by 38% while maintaining accuracy. Designed prompt orchestration with evaluation, filtering, and observability to improve reliability of customer-facing AI assistants. Guided fine-tuning of open LLMs using LoRA and adapters for specialized domains, improving output alignment and relevance. Collaborated with infrastructure and data teams to enhance monitoring, benchmarking, and model versioning. Mentored engineers in advanced NLP and deployment practices, promoting robust, reproducible ML workflows.
Machine Learning Engineer at Miquido
September 1, 2023 - September 1, 2023
Developed and deployed retrieval-augmented generation (RAG) systems combining vector databases and LLM APIs, boosting domain-specific answer accuracy by 25% and cutting latency by 40%. Led model compression via quantization and distillation, reducing inference costs by 30% while preserving performance. Built summarization, classification, and semantic search services to automate document processing. Integrated experiment tracking, model registry, and CI/CD workflows to accelerate iterations and improve deployment reliability. Partnered with product teams to integrate ML models into user-facing products with real-time feedback and evaluation. Designed hybrid architectures combining LLMs with structured retrieval for safety and compliance-critical workflows. Authored documentation and reusable ML templates to standardize experimentation and deployment.
Software Engineer at PELTARION
February 1, 2020 - February 1, 2020
Designed, trained, and deployed NLP systems; implemented scalable APIs and model-serving solutions. Delivered NLP microservices for sentiment analysis, entity recognition, and question answering, improving content relevance and user engagement. Migrated ML components to PyTorch-based pipelines, improving training speed and model iteration cycles. Developed REST and gRPC interfaces for model inference, enabling integration across multiple product lines. Established containerized environments with Docker and Kubernetes to streamline deployments and ensure reproducibility. Collaborated with researchers and engineers to operationalize transformer-based NLP models for search and recommendation systems.
Python Developer at Seldon
March 1, 2017 - March 1, 2017
Implemented and maintained Python-based data and ML systems, supporting early-stage NLP product initiatives. Built NLP prototypes for intent classification and keyword extraction using traditional ML methods. Refactored and modularized legacy scripts into maintainable Python packages to enhance reliability and reusability.

Education

Master of Science in Computer Science at University of Essex
September 1, 2020 - August 1, 2022
Bachelor of Science in Computer Science at Yanshan University
September 1, 2011 - August 1, 2015

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Computers & Electronics, Professional Services