I am a results-driven AI/ML engineer with 5+ years of experience designing, training, and deploying production-grade machine learning, LLM, and retrieval-augmented generation (RAG) systems across healthcare, IoT, and enterprise platforms. I specialize in building scalable, end-to-end AI solutions that balance accuracy, latency, and maintainability. I thrive in cross-functional environments and enjoy turning complex data into practical business outcomes. My expertise spans LLM fine-tuning, agentic AI workflows, vector databases, and MLOps automation using PyTorch, TensorFlow, Hugging Face, LangChain, and LlamaIndex. I have a proven track record of improving model performance, reducing inference latency, and accelerating development cycles while ensuring reproducibility and responsible AI practices.

Akhil Banothai

I am a results-driven AI/ML engineer with 5+ years of experience designing, training, and deploying production-grade machine learning, LLM, and retrieval-augmented generation (RAG) systems across healthcare, IoT, and enterprise platforms. I specialize in building scalable, end-to-end AI solutions that balance accuracy, latency, and maintainability. I thrive in cross-functional environments and enjoy turning complex data into practical business outcomes. My expertise spans LLM fine-tuning, agentic AI workflows, vector databases, and MLOps automation using PyTorch, TensorFlow, Hugging Face, LangChain, and LlamaIndex. I have a proven track record of improving model performance, reducing inference latency, and accelerating development cycles while ensuring reproducibility and responsible AI practices.

Available to hire

I am a results-driven AI/ML engineer with 5+ years of experience designing, training, and deploying production-grade machine learning, LLM, and retrieval-augmented generation (RAG) systems across healthcare, IoT, and enterprise platforms. I specialize in building scalable, end-to-end AI solutions that balance accuracy, latency, and maintainability. I thrive in cross-functional environments and enjoy turning complex data into practical business outcomes.

My expertise spans LLM fine-tuning, agentic AI workflows, vector databases, and MLOps automation using PyTorch, TensorFlow, Hugging Face, LangChain, and LlamaIndex. I have a proven track record of improving model performance, reducing inference latency, and accelerating development cycles while ensuring reproducibility and responsible AI practices.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent

Work Experience

AI Engineer at Centrak | RK Software Services LLC
January 1, 2023 - Present
Designed and trained ML models for real-time location services (RTLS) and location intelligence, improving indoor positioning accuracy by 25–35% through feature engineering and signal optimization on sensor and Wi-Fi data streams. Developed and deployed LLM-powered Retrieval-Augmented Generation (RAG) systems enabling natural-language search over enterprise datasets, reducing data lookup time by 40% and improving retrieval relevance by 30%. Engineered vector database pipelines (FAISS, Pinecone, Chroma) for fast semantic retrieval and grounding, supporting LLM accuracy and reducing hallucinations in production workflows. Built custom ML algorithms and fine-tuned open-source foundation models (Hugging Face Transformers, GPT-style) to meet business constraints, shortening development cycles by 30%. Implemented end-to-end MLOps with MLflow, Kubeflow, and feature stores (Feast / Tecton) for model versioning, experiment tracking, and automated retraining, reducing deployment errors by 35%
Machine Learning Engineer at Keystone Recruitment
December 1, 2020 - November 1, 2022
Framed complex business and research challenges as structured ML problems—delivering classification, regression, NLP, and generative AI models that improved task-level accuracy by 25–40% across healthcare, finance, and enterprise use cases. Fine-tuned LLMs and transformer-based models (BERT, GPT, T5) for domain-specific tasks including text classification, summarization, and information extraction, increasing task success rates by 25% over baselines. Built distributed training pipelines using PyTorch and TensorFlow with multi-GPU support, reducing training time by 2× while maintaining reproducibility and traceability via MLflow and Weights & Biases. Designed robust evaluation frameworks with cross-validation, ablation studies, and error analysis to ensure stable performance on unseen data. Conducted adversarial testing, bias analysis, and model robustness checks to reduce production failure rates by 20%. Performed large-scale data preprocessing, feature engineering, and dataset va
Machine Learning Engineer at Web IT Solutions
July 1, 2018 - November 1, 2020
Built and deployed supervised ML models (classification, regression) using Scikit-learn on structured business datasets, establishing foundational pipelines for product and operational decision-making. Cleaned, transformed, and validated raw datasets with Python (Pandas, NumPy) and SQL to improve data accuracy by 30% and create model-ready feature sets. Performed exploratory data analysis to surface trends and actionable insights used in strategic meetings. Created interactive dashboards (Power BI, Tableau, Excel) to visualize KPIs, reducing manual reporting cycles by 40%. Used SQL to extract, join, and validate data from multiple sources, reducing reconciliation errors by 20%. Documented analyses and dashboard logic to enable reproducibility and team handoffs.

Education

Master of Science in Information Technology at Clark University
January 1, 2023 - December 1, 2024
Bachelor of Business Administration at Andhra University
June 1, 2014 - May 1, 2017

Qualifications

Add your qualifications or awards here.

Industry Experience

Healthcare, Professional Services, Software & Internet, Computers & Electronics, Manufacturing