I am an AI/ML Engineer with 4+ years of experience designing and deploying production-grade machine learning and generative AI systems across cloud environments. I specialize in transformer-based NLP pipelines, Retrieval-Augmented Generation (RAG) architectures, and scalable ML workflows that power enterprise analytics and knowledge platforms. I have a strong track record of improving model performance, optimizing large-scale data pipelines, and deploying API-driven inference services using Python, PyTorch, and modern MLOps frameworks. I also work with vector search, embedding pipelines, and semantic retrieval integrated into real-world software platforms, collaborating closely with data engineers and product teams to deliver practical AI solutions.

Pavan Lokesh Tai

I am an AI/ML Engineer with 4+ years of experience designing and deploying production-grade machine learning and generative AI systems across cloud environments. I specialize in transformer-based NLP pipelines, Retrieval-Augmented Generation (RAG) architectures, and scalable ML workflows that power enterprise analytics and knowledge platforms. I have a strong track record of improving model performance, optimizing large-scale data pipelines, and deploying API-driven inference services using Python, PyTorch, and modern MLOps frameworks. I also work with vector search, embedding pipelines, and semantic retrieval integrated into real-world software platforms, collaborating closely with data engineers and product teams to deliver practical AI solutions.

Available to hire

I am an AI/ML Engineer with 4+ years of experience designing and deploying production-grade machine learning and generative AI systems across cloud environments. I specialize in transformer-based NLP pipelines, Retrieval-Augmented Generation (RAG) architectures, and scalable ML workflows that power enterprise analytics and knowledge platforms.

I have a strong track record of improving model performance, optimizing large-scale data pipelines, and deploying API-driven inference services using Python, PyTorch, and modern MLOps frameworks. I also work with vector search, embedding pipelines, and semantic retrieval integrated into real-world software platforms, collaborating closely with data engineers and product teams to deliver practical AI solutions.

See more

Experience Level

Expert
Expert
Expert
Expert
Intermediate
Intermediate

Work Experience

AI / ML Engineer (Generative AI / LLM Systems) at WhyHow.AI
January 1, 2025 - Present
Architected Retrieval-Augmented Generation pipelines with transformer embeddings and vector search to power enterprise document intelligence; built semantic search infrastructure enabling contextual information retrieval across large document repositories, improving response relevance by 25–30%; deployed FastAPI inference services exposing LLM-based APIs; implemented prompt engineering and context-ranking to reduce hallucinations; containerized and deployed AI services with Docker/Kubernetes; implemented monitoring for latency, drift, and reliability.
AI / Machine Learning Engineer at Capgemini
July 1, 2022 - July 1, 2023
Expanded ML models for customer segmentation and churn prediction, improving campaign targeting accuracy by 18–22%. Built scalable data pipelines (Python, SQL, PySpark) to process millions of records; exposed REST APIs for real-time inferences; automated model training/deployment via CI/CD; optimized feature engineering and SQL/ETL to improve data prep performance by 35%.
Machine Learning Engineer at Fractal Analytics
August 1, 2020 - June 1, 2022
Built predictive ML models for classification and regression across demand forecasting, risk prediction, and customer analytics; enforced feature engineering pipelines; planned ETL pipelines; developed Power BI/Tableau dashboards to visualize predictive insights; collaborated with data engineering to improve data quality and reliability; assisted in deploying models into analytics platforms.

Education

Master of Science in Computer Science at Texas A&M University-Kingsville
January 11, 2030 - May 1, 2025
Bachelor of Technology in Computer Science at Jawaharlal Nehru Technological University, Kakinada (JNTUK), India
January 11, 2030 - June 1, 2023

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Professional Services