AI Engineer specializing in NLP and agentic AI with 8+ years of experience building production language systems from data collection, preprocessing, and model training through evaluation, deployment, and post-launch analytics. Strong background in transformer fine-tuning, retrieval-augmented generation, workflow orchestration, experimentation, and model-quality measurement across enterprise software, SaaS, consulting, and digital-health environments.

Henry Dong

AI Engineer specializing in NLP and agentic AI with 8+ years of experience building production language systems from data collection, preprocessing, and model training through evaluation, deployment, and post-launch analytics. Strong background in transformer fine-tuning, retrieval-augmented generation, workflow orchestration, experimentation, and model-quality measurement across enterprise software, SaaS, consulting, and digital-health environments.

Available to hire

AI Engineer specializing in NLP and agentic AI with 8+ years of experience building production language systems from data collection,
preprocessing, and model training through evaluation, deployment, and post-launch analytics. Strong background in transformer
fine-tuning, retrieval-augmented generation, workflow orchestration, experimentation, and model-quality measurement across
enterprise software, SaaS, consulting, and digital-health environments.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert

Work Experience

Senior AI Engineer at HealthTap
March 1, 2025 - Present
Developed agentic clinical-intake workflows using LangGraph and Pydantic to gather symptoms and summarize chart context, utilizing function calling to surface structured visit briefs for physicians. Built HIPAA-aware retrieval and grounding pipelines using Amazon Kendra for semantic search and Amazon S3 for secure document storage, improving consistency through reranker optimization. Trained and evaluated medical text classifiers on Amazon SageMaker, implementing threshold calibration and safety review to manage model abstention on ambiguous triage cases. Implemented rigorous LLM evaluation harnesses using agentic LLM-as-a-judge patterns to detect hallucinations and tool-use errors, creating release gates within AWS CodePipeline. Partnered with clinicians and security teams to translate care-delivery requirements into measurable model objectives and audit-ready AWS deployment practices. Optimized conversational intake flows by leveraging LangSmith for tracing and observability, compari
AI Consultant, Manager at Accenture Applied Intelligence
January 1, 2023 - February 28, 2025
Architected retrieval-augmented assistants using Azure OpenAI Service and Azure AI Search, combining prompt orchestration with custom guardrails to shorten analyst research workflows. Led data engineering for document ingestion and vector indexing using PySpark on Azure Databricks, improving recall and freshness across multimodal knowledge bases. Evaluated LLM applications using task-specific golden sets and rubric-based review, establishing measurable acceptance criteria before production deployment. Built generation services with FastAPI and Azure Kubernetes Service AKS, supporting secure deployment patterns for open-source models and client-controlled infrastructure. Introduced multi-agent orchestration in 2024 using LangGraph and AutoGen for planner-executor patterns and human-in-the-loop escalation, reducing brittle prompt chains in complex enterprise processes. Developed Semantic Kernel implementations for enterprise clients to map operational constraints and convert prototype fi
Senior NLP Engineer at Grammarly
August 1, 2020 - December 31, 2022
Led transformer-based ranking models (BERT-family) for rewrite features, utilizing product telemetry and linguistic signals to increase suggestion acceptance rates across high-volume writing surfaces. Designed large-scale text cleaning and weak-labeling pipelines using Spark and Airflow, generating training corpora from anonymized edits while maintaining privacy. Trained multilingual quality-estimation models with PyTorch and XLM-R, improving intent-preserving rewrite detection for global users. Built offline evaluation frameworks in Python using statsmodels and scikit-learn to measure precision, acceptance lift, latency, and model calibration. Collaborated with product analysts to segment usage, diagnose failure modes, and link NLP model quality directly to user retention.
Machine Learning Engineer at IBM Watson
June 1, 2017 - July 31, 2020
Built NER and intent-labeling pipelines over support logs using spaCy, NLTK, and scikit-learn, reducing manual triage for customer success teams. Developed distributed preprocessing jobs in Spark to normalize contracts and articles for Watson Discovery indexing and model training. Fine-tuned early BERT classifiers using PyTorch and tracked experimental metadata and versioning via MLflow. Partnered with search teams on Smart Document Understanding workflows, improving extraction quality across insurance and compliance documents. Shipped monitoring dashboards to track latency, feature drift, and inter-annotator agreement for enterprise NLP releases.

Education

M.S. in Analytics at Georgia Institute of Technology
January 11, 2030 - May 27, 2026

Qualifications

Add your qualifications or awards here.

Industry Experience

Healthcare, Software & Internet, Professional Services