I am a Generative AI Engineer with 4 years of experience building enterprise-scale AI systems using Large Language Models (LLMs), Retrieval Augmented Generation (RAG), and AI Agents. I design production-ready AI platforms, multi-modal AI applications, and autonomous AI workflows deployed on AWS, Azure, and Google Cloud. My work spans conversational AI, semantic search, AI copilots, and intelligent automation across large enterprise data platforms. I emphasize scalable MLOps, model monitoring, prompt engineering, and real-time insights from transformer architectures and vector embeddings.

Vishal Sharma

I am a Generative AI Engineer with 4 years of experience building enterprise-scale AI systems using Large Language Models (LLMs), Retrieval Augmented Generation (RAG), and AI Agents. I design production-ready AI platforms, multi-modal AI applications, and autonomous AI workflows deployed on AWS, Azure, and Google Cloud. My work spans conversational AI, semantic search, AI copilots, and intelligent automation across large enterprise data platforms. I emphasize scalable MLOps, model monitoring, prompt engineering, and real-time insights from transformer architectures and vector embeddings.

Available to hire

I am a Generative AI Engineer with 4 years of experience building enterprise-scale AI systems using Large Language Models (LLMs), Retrieval Augmented Generation (RAG), and AI Agents. I design production-ready AI platforms, multi-modal AI applications, and autonomous AI workflows deployed on AWS, Azure, and Google Cloud.

My work spans conversational AI, semantic search, AI copilots, and intelligent automation across large enterprise data platforms. I emphasize scalable MLOps, model monitoring, prompt engineering, and real-time insights from transformer architectures and vector embeddings.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Generative AI Engineer at Kimberly Clark
April 1, 2025 - Present
Designed and deployed enterprise-grade generative AI platforms enabling intelligent document search, conversational AI assistants, and predictive analytics. Built an AI Career Coach platform powered by GPT-4 and LangChain with semantic document search. Designed a Retrieval Augmented Generation (RAG) architecture integrating enterprise knowledge bases with ChromaDB, improving answer accuracy by 60%. Developed an Natural Language to SQL interface enabling business users to generate analytics queries conversationally. Implemented LLM fine-tuning pipelines using Hugging Face and Vertex AI for domain-specific NLP models. Built a real-time ESG news intelligence system analyzing millions of global news articles using transformer-based NLP. Optimized inference pipelines via distillation and quantization, reducing infrastructure costs by 35%. Implemented MLOps pipelines including automated evaluation, model monitoring, and retraining workflows. Designed multi-modal document understanding system
AI & ML Engineer at Physicians Mutual
January 1, 2024 - March 1, 2025
Developed scalable AI solutions supporting risk analytics, fraud detection, and enterprise document intelligence. Built NLP pipelines using BERT embeddings for document classification and contract analysis. Designed Doc2Vec-based plagiarism detection platform. Developed machine learning pipelines processing millions of financial transactions using Apache Spark. Implemented Auto ML systems with generative AI model tuning. Built anomaly detection models to identify fraud patterns in insurance claims. Developed predictive analytics models improving risk scoring accuracy by 28%. Deployed models across AWS SageMaker and Azure ML environments. Designed and deployed ML pipelines using Apache Spark and Python. Developed feature engineering frameworks and automated model training workflows. Built fraud detection and anomaly detection models using ensemble techniques and real-time processing. Implemented data preprocessing pipelines for NLP models, including tokenization, embeddings, and documen
Python Developer at Dun & Bradstreet, India
March 1, 2022 - July 1, 2023
Built backend systems and data pipelines supporting large-scale enterprise data services. Developed REST APIs using Django and FastAPI for enterprise data platforms. Implemented computer vision models using OpenCV for real-time image analysis. Built scalable data ingestion pipelines integrating multiple enterprise databases. Developed scalable RESTful APIs using Django and FastAPI to support enterprise data platforms and internal analytics applications. Built data ingestion and ETL pipelines using Python, Pandas, and SQL. Optimized database performance in MySQL and PostgreSQL with indexing and efficient queries. Integrated third-party APIs and external data sources to enrich datasets used in analytics and reporting. Collaborated in Agile teams, contributing to code reviews, testing, and CI/CD workflows.

Education

Master's in information technology at Arizona State University, Tempe, AZ, US
January 11, 2030 - May 11, 2026
Computer Science Engineering at GITAM University, Bangalore, India
January 11, 2030 - May 11, 2026

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Financial Services, Healthcare, Education, Professional Services