I’m an AI data trainer and annotator who thrives on turning messy data into high-quality labeled datasets that power reliable AI models. I specialize in annotation workflows, taxonomy design, and quality control protocols that measure precision, recall, and inter-annotator agreement, and I’m comfortable working in remote-first environments to keep projects on track. Throughout my career I’ve built robust labeling systems from scratch, led QA reviews to minimize bias, and collaborated with cross-functional teams to boost model performance through better data. I’m passionate about continuous improvement, bias detection, and delivering value through clean, well-documented processes.

ATHARVA BHAIRAM

I’m an AI data trainer and annotator who thrives on turning messy data into high-quality labeled datasets that power reliable AI models. I specialize in annotation workflows, taxonomy design, and quality control protocols that measure precision, recall, and inter-annotator agreement, and I’m comfortable working in remote-first environments to keep projects on track. Throughout my career I’ve built robust labeling systems from scratch, led QA reviews to minimize bias, and collaborated with cross-functional teams to boost model performance through better data. I’m passionate about continuous improvement, bias detection, and delivering value through clean, well-documented processes.

Available to hire

I’m an AI data trainer and annotator who thrives on turning messy data into high-quality labeled datasets that power reliable AI models. I specialize in annotation workflows, taxonomy design, and quality control protocols that measure precision, recall, and inter-annotator agreement, and I’m comfortable working in remote-first environments to keep projects on track.

Throughout my career I’ve built robust labeling systems from scratch, led QA reviews to minimize bias, and collaborated with cross-functional teams to boost model performance through better data. I’m passionate about continuous improvement, bias detection, and delivering value through clean, well-documented processes.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

English
Advanced
Hindi
Advanced

Work Experience

AI Intern at Mxpert
October 1, 2025 - November 26, 2025
Built AI automation tools using TypeScript, Express, Next.js, and LLMs (Gemini, Groq, OpenAI). Created a web-scraper to extract structured service data from any business website via URL. Developed automated S3 pipelines for call recording downloads, LLM transcription, and Google Sheets logging. Enhanced an AI voice/chat assistant for auto-repair shops using GPT + Twilio for call automation.
AI Intern at Genesis
July 1, 2025 - July 1, 2025
Engineered secure authentication using Firebase Admin SDK and JWT. Developed real-time chat with Socket.IO for scalable communication. Built RAG pipeline with Qdrant, Sentence-Transformers, and Groq API for intelligent retrieval.
AI/ML Developer Intern at Codelissian
December 1, 2023 - December 1, 2023
Developed ML models for prediction and classification using scikit-learn and PyTorch. Built NLP chatbot with HuggingFace Transformers and Dialogflow. Deployed ML services using FastAPI and Docker for scalable inference.
Machine Learning Intern at Bharat Intern
April 1, 2023 - April 1, 2023
Implemented classification and clustering models with supervised/unsupervised learning. Developed CV pipelines with CNNs in TensorFlow. Evaluated models using accuracy, precision, recall, and F1-score.
Full-Stack Software Engineer at Mxpert
October 1, 2025 - Present
Designed and developed full-stack web applications, ensuring intuitive user interfaces and optimized performance. Developed scalable Node.js backend services and implemented real-time features using Socket.IO.
Full-Stack Web Developer at Genesis
May 1, 2025 - July 1, 2025
Built a production-ready web application implementing real-time communication via Socket.IO. Optimized PostgreSQL database queries and schema design to enhance data retrieval efficiency.
Web Development Engineer at Codelissian
October 1, 2023 - November 1, 2024
Developed full-stack web features to improve user engagement and implemented comprehensive form validation and submission handling. Collaborated with product teams to deliver high-quality features.
Web Developer Intern at Bharat Intern
September 1, 2022 - September 1, 2023
Developed responsive web applications and RESTful APIs, focusing on user authentication and data visualization. Optimized performance through code splitting and lazy loading.
Full Stack Engineer (AI Focus) at Mxpert
October 1, 2025 - Present
Architected and deployed scalable full stack web applications using TypeScript, Express, and Next.js with end-to-end ownership. Built intelligent web scraper extracting structured business data from any website URL, processing 500+ sites efficiently. Designed automated S3-to-database pipelines handling call recordings, LLM transcription, and Google Sheets integration. Developed AI-powered call analysis engine with automated categorization across 29 service types and statistical reporting dashboards. Enhanced conversational AI assistant integrating GPT, Twilio, and custom backend APIs for auto-repair shop automation. Collaborated with stakeholders to define requirements, break down deliverables, and deliver on iterative timelines.
Full Stack Developer at Genesis
May 1, 2025 - July 1, 2025
Designed and implemented secure authentication system using Firebase Admin SDK and JWT for scalable user management. Developed real-time chat platform with Socket.IO backend and React frontend, supporting 1000+ concurrent connections. Built intelligent RAG pipeline with Qdrant vector database, Sentence-Transformers embeddings, and Groq API for context-aware retrieval. Created robust file ingestion system supporting PDFs and Word documents using PDFPlumber, PyPDF2, and python-docx. Optimized PostgreSQL database schema and queries, reducing response times by 40% for high-traffic endpoints.
Full Stack Developer Intern at Codelissian
June 1, 2023 - December 1, 2023
Developed end-to-end web applications with React frontends, Node.js/Express backends, and MongoDB databases. Built RESTful APIs and microservices architecture with FastAPI for scalable, production-ready ML inference. Integrated AI capabilities including NLP chatbot with HuggingFace Transformers and intelligent search with Pinecone + LangChain. Deployed containerized applications using Docker, ensuring consistent environments across development and production. Collaborated with cross-functional teams using agile methodologies to deliver features on tight deadlines.
Software Development Intern at Bharat Intern
September 1, 2022 - April 1, 2023
Built full stack web applications integrating machine learning models for classification and sentiment analysis. Developed responsive frontend interfaces with React and backend APIs with Express for seamless user experiences. Implemented data processing pipelines with Pandas and NumPy for efficient dataset handling and transformation. Created interactive dashboards and visualizations to present model insights to non-technical stakeholders. Participated in code reviews and maintained clear documentation for knowledge transfer across team members.
Generative AI Associate at Meta via Innodata
January 1, 2026 - Present
Evaluated and validated AI-generated responses across Factuality, Freshness, Locality, and On-Topic & Completeness workflows by following detailed annotation guidelines and quality standards. Executed claim extraction and verification using structured web search, source evaluation, and evidence-based validation to assess the accuracy and reliability of model outputs. Delivered response-level quality assessments using internal evaluation tools, applying form logic, dependency rules, and strict formatting requirements to support large-scale AI model improvement initiatives.
AI Data Trainer at Mxpert
October 1, 2025 - December 31, 2025
Completed data annotation and quality control on structured datasets processing 200+ business service records, improving data coverage by 45% through systematic labeling workflows. Developed and implemented annotation guidelines and taxonomy structures for automated data extraction pipelines, reducing labeling inconsistencies by 30%. Executed audit checks and quality assurance protocols on 500+ audio transcription tasks per week, achieving 95% accuracy and meeting strict SLA requirements. Collaborated with annotation leads to refine labeling workflows and identify edge cases in voice data, contributing to continuous quality improvement initiatives.
AI Training Specialist at Genesis
May 1, 2025 - July 31, 2025
Annotated and curated 5K+ text documents for RAG system training, applying structured taxonomy and labeling guidelines to generate high-fidelity training data. Led quality assurance reviews on annotation outputs, identifying ambiguous cases and edge cases to improve dataset quality and reduce bias. Utilized data manipulation tools to process and validate structured data formats (JSON, CSV), ensuring annotation accuracy and consistency across projects. Documented annotation procedures and contributed to feedback sessions, improving guideline clarity and reducing annotation time by 25%.
AI Data Annotation Specialist at Codelissian
January 1, 2023 - June 30, 2023
Labeled and annotated 50K+ training samples for classification and prediction models, consistently meeting throughput targets while maintaining 95% annotation accuracy across all deliverables. Facilitated data quality reviews on NLP training datasets covering 100+ intents, flagging ambiguous cases and proposing taxonomy refinements to improve guideline clarity. Managed annotation tasks on 3K+ documents for RAG workflow training, utilizing structured labeling protocols and quality control metrics to ensure data consistency. Automated data validation tasks using Python scripts processing 200+ files per day, reducing manual QC time by 40% while maintaining quality standards. Collaborated with cross-functional teams on dataset operations and active learning workflows, contributing to 18% improvement in model performance metrics.

Education

Masters in Artificial Intelligence at Memorial University of Newfoundland
September 1, 2024 - December 1, 2025
B.Tech in Artificial Intelligence at G.H.Raisoni College of Engineering
February 1, 2021 - June 1, 2024
Masters in Artificial Intelligence at Memorial University of Newfoundland
September 1, 2024 - December 1, 2025
B.Tech in Artificial Intelligence at G.H.Raisoni College of Engineering
February 1, 2021 - June 1, 2024
Masters in Artificial Intelligence at Memorial University of Newfoundland
September 1, 2024 - December 1, 2025
B.Tech in Artificial Intelligence at G.H.Raisoni College of Engineering
February 1, 2021 - June 1, 2024
Master’s in Artificial Intelligence at Memorial University of Newfoundland
September 1, 2024 - December 31, 2025
B.Tech in Artificial Intelligence at G.H. Raisoni College of Engineering
February 1, 2021 - June 30, 2024

Qualifications

Deep Learning Specialization
January 11, 2030 - November 26, 2025
NLP Certification
January 11, 2030 - November 26, 2025
Deep Learning Specialization
January 11, 2030 - December 24, 2025
Full Stack Web Development Certification
January 11, 2030 - December 24, 2025

Industry Experience

Computers & Electronics, Software & Internet, Education, Media & Entertainment