I'm an AI/ML engineer with 4+ years of experience building and deploying production-grade machine learning and generative AI systems at scale. I have hands-on experience with large language models, retrieval-augmented generation, and end-to-end MLOps on cloud platforms, particularly AWS. I'm passionate about NLP, computer vision, and deep learning, with a focus on low-latency inference, performance optimization, and responsible AI. I enjoy delivering measurable business outcomes through scalable AI solutions and collaborating with cross-functional teams.

Tharun Challa

I'm an AI/ML engineer with 4+ years of experience building and deploying production-grade machine learning and generative AI systems at scale. I have hands-on experience with large language models, retrieval-augmented generation, and end-to-end MLOps on cloud platforms, particularly AWS. I'm passionate about NLP, computer vision, and deep learning, with a focus on low-latency inference, performance optimization, and responsible AI. I enjoy delivering measurable business outcomes through scalable AI solutions and collaborating with cross-functional teams.

Available to hire

I’m an AI/ML engineer with 4+ years of experience building and deploying production-grade machine learning and generative AI systems at scale. I have hands-on experience with large language models, retrieval-augmented generation, and end-to-end MLOps on cloud platforms, particularly AWS.

I’m passionate about NLP, computer vision, and deep learning, with a focus on low-latency inference, performance optimization, and responsible AI. I enjoy delivering measurable business outcomes through scalable AI solutions and collaborating with cross-functional teams.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert

Language

English
Fluent

Work Experience

AI/ML Engineer at Amazon
January 1, 2024 - Present
Built generative AI applications using large language and speech models to enable automated summarization, transcription, and conversational Q&A over structured and unstructured enterprise data, improving knowledge accessibility and operational efficiency by approximately 30%. Fine-tuned transformer-based models using AWS SageMaker and Hugging Face Transformers with distributed training to enhance domain-specific performance while reducing training time and infrastructure costs. Designed and implemented scalable MLOps workflows on AWS using SageMaker, Lambda, Step Functions, and S3 to automate end-to-end model training, evaluation, and deployment pipelines. Developed semantic search and retrieval-augmented generation systems using FAISS, Pinecone, and LangChain to improve document discovery and contextual response accuracy. Built and deployed production AI microservices using FastAPI and Docker on AWS ECS and EKS with monitoring and logging through CloudWatch and X-Ray. Optimized infer
Graduate Assistant at University Of Nebraska Omaha
August 1, 2023 - May 1, 2025
Developed an AI-driven enhancement for the existing ATS web application to automate resume analysis and candidate shortlisting for faculty recruitment. Built Natural Language Processing workflows to extract relevant information such as skills, experience, and education from resumes using Python-based machine learning techniques. Designed and documented REST APIs using Swagger to integrate machine learning services for candidate scoring and ranking within the application. Implemented application-wide logging, monitoring, and transaction management using Spring AOP and Log4J to support AI service reliability. Optimized data processing logic and inference workflows, reducing overall execution time by 70 percent and delivering new features within one month instead of the planned two months. Managed project dependencies and builds using Maven to integrate AI components into the existing system architecture. Conducted unit testing with JUnit and performed code quality reviews using SonarQube
AI/ML Engineer at WIPRO
April 1, 2022 - May 1, 2023
Developed and deployed supervised and unsupervised machine learning models to enhance user engagement and personalization, improving recommendation performance by approximately 14%. Analyzed large-scale user behavior data using Python, SQL, and PySpark to identify trends and generate actionable insights that guided product improvements. Built and maintained automated data pipelines with Apache Spark and Airflow to support reliable model retraining, monitoring, and data quality management. Designed and executed structured A/B testing experiments to evaluate AI-driven features and presented data-backed findings to product and leadership teams. Collaborated with engineers and product managers to define AI-related KPIs and align machine learning initiatives with business objectives. Prototyped and trained deep learning models using PyTorch and TensorFlow for text processing and image classification use cases. Applied feature engineering, hyperparameter tuning, and model validation techniqu

Education

Master of Science in Computer Science at University of Nebraska Omaha
January 11, 2030 - May 1, 2025
Bachelor of Technology in Information Technology at Geethanjali College Of Engineering and Technology
January 11, 2030 - May 1, 2023

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Computers & Electronics, Professional Services

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert