I am a Gen AI Engineer with over 4 years of experience building enterprise-scale generative AI solutions. Specializing in deep learning, transformers, and large language models, I've led projects that improved system performance and NLP accuracy significantly. I am passionate about ethical AI development, data privacy, and building scalable AI infrastructures across various cloud platforms. I thrive in collaborative environments where I can mentor others and leverage my skills in machine learning optimization, MLOps, and cloud deployment to create impactful AI applications. Constantly researching privacy-preserving techniques and emerging AI technologies, I aim to contribute to secure, efficient, and responsible AI innovations.

Navya Sree Yellina

I am a Gen AI Engineer with over 4 years of experience building enterprise-scale generative AI solutions. Specializing in deep learning, transformers, and large language models, I've led projects that improved system performance and NLP accuracy significantly. I am passionate about ethical AI development, data privacy, and building scalable AI infrastructures across various cloud platforms. I thrive in collaborative environments where I can mentor others and leverage my skills in machine learning optimization, MLOps, and cloud deployment to create impactful AI applications. Constantly researching privacy-preserving techniques and emerging AI technologies, I aim to contribute to secure, efficient, and responsible AI innovations.

Available to hire

I am a Gen AI Engineer with over 4 years of experience building enterprise-scale generative AI solutions. Specializing in deep learning, transformers, and large language models, I’ve led projects that improved system performance and NLP accuracy significantly. I am passionate about ethical AI development, data privacy, and building scalable AI infrastructures across various cloud platforms.

I thrive in collaborative environments where I can mentor others and leverage my skills in machine learning optimization, MLOps, and cloud deployment to create impactful AI applications. Constantly researching privacy-preserving techniques and emerging AI technologies, I aim to contribute to secure, efficient, and responsible AI innovations.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate

Language

English
Fluent

Work Experience

Generative AI Engineer Intern at Gemini Consulting & Services
January 1, 2025 - Present
Architected an enterprise generative AI platform using OpenAI GPT API, transformers, PyTorch, and TensorFlow, reducing information retrieval latency by 40% while supporting 500+ concurrent users with 90% system uptime improvement. Implemented RAG framework improving NLP accuracy by 25% across 10,000+ production queries through transformers fine-tuning and precision enhancement. Developed multi-channel AI agents with Python and Azure APIs, increasing response throughput by 30% with a focus on ethical AI and data privacy compliance. Established automated MLOps pipelines using Docker, Kubernetes, AWS SageMaker, and Git workflows, accelerating deployment cycles by 35%, and reducing manual errors. Ensured scalable, reliable AI infrastructure with comprehensive unit testing and performance monitoring across cloud platforms.
Systems Engineer at Oracle Cerner
July 31, 2023 - July 31, 2025
Built distributed machine learning monitoring system for 50+ microservices, reducing incident response time by 20% and maintaining 99.9% uptime across 2.5M+ daily transactions. Developed high-performance ETL pipelines for Oracle-to-PostgreSQL migration, improving query performance by 25% and reducing hosting costs by $50K annually. Automated cloud infrastructure provisioning using Python, Terraform, Docker, and Kubernetes across AWS and Azure for containerized services. Implemented Git-based CI/CD workflows with automated validation, reducing high-risk production incidents by 30%. Provided technical guidance and mentorship to junior developers, fostering collaboration across cross-functional teams.
Software Engineer Intern at Televerge Communications
April 30, 2021 - July 31, 2025
Optimized backend systems using Java, Python, and machine learning algorithms, scaling from 7K to 10K+ daily API requests with a 30% throughput improvement and a 15% memory reduction. Developed REST API integrations with React frontend and SQL databases, improving data delivery by 40% and implementing data privacy measures for 5,000+ users. Created reusable software libraries for network protocol implementations, increasing development efficiency by 25%. Processed over 1M network events daily using distributed computing and machine learning in an agile environment.

Education

M.Sc. at Saint Louis University
August 1, 2023 - May 31, 2025
B.Sc. at Koneru Lakshmaiah University
August 1, 2017 - May 31, 2021

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Professional Services, Healthcare, Telecommunications, Financial Services