I am a Generative AI Engineer with over 5 years of experience specializing in building enterprise-grade AI solutions using cutting-edge technologies like GPT-4, LLaMA, and Mistral. I excel at architecting end-to-end AI workflows, deploying large language models at scale, and integrating vector databases to enable intelligent document processing and semantic search. Passionate about applying innovative AI techniques across healthcare, finance, and retail sectors, I am committed to delivering impactful and ethical AI-powered applications that drive business productivity and operational efficiency. My expertise extends to cloud-native deployments on AWS, Azure, and GCP, along with strong skills in prompt engineering, fine-tuning strategies, and multi-agent system orchestration. Beyond technical skills, I enjoy collaborating with cross-functional teams and leading knowledge sharing sessions to foster understanding of generative AI best practices. I am excited to continue advancing AI innovation and help organizations harness the power of large language models responsibly and effectively.

Arjun Chirumamilla

I am a Generative AI Engineer with over 5 years of experience specializing in building enterprise-grade AI solutions using cutting-edge technologies like GPT-4, LLaMA, and Mistral. I excel at architecting end-to-end AI workflows, deploying large language models at scale, and integrating vector databases to enable intelligent document processing and semantic search. Passionate about applying innovative AI techniques across healthcare, finance, and retail sectors, I am committed to delivering impactful and ethical AI-powered applications that drive business productivity and operational efficiency. My expertise extends to cloud-native deployments on AWS, Azure, and GCP, along with strong skills in prompt engineering, fine-tuning strategies, and multi-agent system orchestration. Beyond technical skills, I enjoy collaborating with cross-functional teams and leading knowledge sharing sessions to foster understanding of generative AI best practices. I am excited to continue advancing AI innovation and help organizations harness the power of large language models responsibly and effectively.

Available to hire

I am a Generative AI Engineer with over 5 years of experience specializing in building enterprise-grade AI solutions using cutting-edge technologies like GPT-4, LLaMA, and Mistral. I excel at architecting end-to-end AI workflows, deploying large language models at scale, and integrating vector databases to enable intelligent document processing and semantic search. Passionate about applying innovative AI techniques across healthcare, finance, and retail sectors, I am committed to delivering impactful and ethical AI-powered applications that drive business productivity and operational efficiency.

My expertise extends to cloud-native deployments on AWS, Azure, and GCP, along with strong skills in prompt engineering, fine-tuning strategies, and multi-agent system orchestration. Beyond technical skills, I enjoy collaborating with cross-functional teams and leading knowledge sharing sessions to foster understanding of generative AI best practices. I am excited to continue advancing AI innovation and help organizations harness the power of large language models responsibly and effectively.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Work Experience

Artificial Intelligence Consultant at Humana
March 1, 2024 - Present
Designed and deployed large-scale Generative AI applications using GPT-4, Claude, and open-source models incorporating LangChain and Crew.ai frameworks, serving over 10,000 daily users with high availability. Developed multi-agent workflows, semantic search solutions using Pinecone, ChromaDB, and Weaviate, and built REST APIs supporting over one million requests monthly. Led cloud deployments on AWS, Azure, and Hugging Face Spaces with comprehensive CI/CD pipelines and model evaluation frameworks. Pioneered Retrieval-Augmented Generation pipelines, prompt engineering, fine-tuning, real-time streaming data processing, and AI governance strategies to ensure ethical, efficient, and cost-effective AI model usage. Conducted technical trainings and built interactive demo apps for stakeholders.
Generative AI Engineer at Indeed
August 31, 2023 - September 4, 2025
Led design and development of Generative AI tools utilizing LLaMA, Mistral, and LangChain, automating internal documentation processes and reducing manual effort by 50%. Implemented RAG-based document summarization bots processing over 10,000 documents daily with semantic search enhancements using ChromaDB and FAISS. Built advanced prompt chaining and conversational memory systems, improving contextual accuracy. Supported core insurance platform integration with domain-specific embedding and multi-modal AI systems. Developed AI chatbots with multi-turn dialogue capabilities, established A/B testing, robust model versioning, and ensured regulatory compliance. Optimized inference latency with quantization and prompt engineering achieving sub-200ms responses.
Machine Learning Engineer at Eaton
December 31, 2020 - September 4, 2025
Architected end-to-end ML pipelines for insurance claims prediction and fraud detection achieving 92% accuracy and 45% processing time reduction. Developed transformer-based conversational assistants handling 5,000+ customer inquiries daily. Implemented large-scale ETL workflows with PySpark, deployed models on SageMaker with monitoring and automated retraining capabilities. Established scalable MLOps practices using Docker and Kubernetes, delivered real-time inference pipelines with sub-100ms latency, and built ensemble models combining deep learning and traditional algorithms. Created model interpretability tools, feature stores, cost optimizations, and led technical mentoring programs.

Education

Masters in Information Science at Trine University
January 11, 2030 - September 4, 2025

Qualifications

Generative AI with Hugging Face and LangChain – Udemy
January 11, 2030 - September 4, 2025
Prompt Engineering for Developers – DeepLearning.AI & OpenAI
January 11, 2030 - September 4, 2025
Machine Learning with Python – SoloLearn
January 11, 2030 - September 4, 2025
SQL for Data Analytics – Udemy
January 11, 2030 - September 4, 2025

Industry Experience

Healthcare, Financial Services, Retail, Software & Internet