I'm Parikshit Srivastav, an AI Engineer with a focus on building production-grade ML and GenAI platforms across NLP, LLMs, and computer vision. I specialize in fine-tuning LLMs (LoRA, quantization), creating scalable RAG pipelines, and deploying models on AWS SageMaker in cloud-native, containerized environments. I have delivered real-time AI solutions for finance, industrial automation, and enterprise use cases, and I thrive working with cross-functional teams to push the boundaries of AI capabilities. I enjoy building robust annotation and evaluation tools, automating data ingestion and retraining pipelines, and delivering user-friendly interfaces that help teams test and iterate new prompts and models quickly. I’m always learning and sharing knowledge to drive innovation across organizations.

Parikshit Rohit Pravin Srivastav

I'm Parikshit Srivastav, an AI Engineer with a focus on building production-grade ML and GenAI platforms across NLP, LLMs, and computer vision. I specialize in fine-tuning LLMs (LoRA, quantization), creating scalable RAG pipelines, and deploying models on AWS SageMaker in cloud-native, containerized environments. I have delivered real-time AI solutions for finance, industrial automation, and enterprise use cases, and I thrive working with cross-functional teams to push the boundaries of AI capabilities. I enjoy building robust annotation and evaluation tools, automating data ingestion and retraining pipelines, and delivering user-friendly interfaces that help teams test and iterate new prompts and models quickly. I’m always learning and sharing knowledge to drive innovation across organizations.

Available to hire

I’m Parikshit Srivastav, an AI Engineer with a focus on building production-grade ML and GenAI platforms across NLP, LLMs, and computer vision. I specialize in fine-tuning LLMs (LoRA, quantization), creating scalable RAG pipelines, and deploying models on AWS SageMaker in cloud-native, containerized environments. I have delivered real-time AI solutions for finance, industrial automation, and enterprise use cases, and I thrive working with cross-functional teams to push the boundaries of AI capabilities.

I enjoy building robust annotation and evaluation tools, automating data ingestion and retraining pipelines, and delivering user-friendly interfaces that help teams test and iterate new prompts and models quickly. I’m always learning and sharing knowledge to drive innovation across organizations.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

English
Fluent
Hindi
Advanced

Work Experience

Software Developer – Gen AI at DatamanUS
June 1, 2023 - Present
Designed and deployed robust generative AI tools using Python and FastAPI supporting over 1,000 daily users with scalable containerized microservices. Built production-grade Retrieval-Augmented Generation systems using vector databases like ChromaDB and FAISS, improving semantic search relevance by 45%. Fine-tuned LLaMA open-source models on AWS SageMaker and deployed them through API Gateway and Lambda for scalable event-driven infrastructure. Developed real-time full-stack interfaces with React.js to facilitate prompt and model evaluation, reducing iteration time by 30%. Automated dataset workflows with Airflow achieving over 99% reliability. Collaborated on interpretability tools enhancing AI output trust internally.
Graduate Teaching Assistant at University Of Colorado Denver
May 1, 2024 - August 27, 2025
Led C++ programming labs focusing on Data Structures and Program Design, improving student success rates by 9%. Developed and reviewed complex assignments emphasizing STL, memory management, and algorithm optimization. Mentored more than 50 students in advanced C++ best practices and debugging techniques.
AI Engineer at SparkCognition
July 1, 2022 - August 27, 2025
Created and deployed YOLOv5-based computer vision models for industrial defect detection with 94% precision, significantly reducing manual inspection efforts. Developed full-stack applications using React.js and Node.js to enable plant engineers to visualize inspection data and interact with results. Built REST APIs to serve model predictions with 500+ requests per hour, lowering inspection latency. Integrated MLflow to track model experiments and versions, improving collaboration and reproducibility. Developed automated real-time model monitoring with alerting that reduced incident response time by over 60%. Optimized backend inference services to reduce response times from 800ms to 320ms, enabling near real-time feedback in manufacturing. Collaborated with domain experts to build custom annotation and configuration tools enhancing training data quality and model control.
Software Developer – Gen AI at Comcast
June 1, 2023 - Present
Designed and deployed GenAI tools using Python and FastAPI to support annotation workflows and evaluation pipelines for 1,000+ daily users. Built production-grade RAG systems with ChromaDB/FAISS and data pipelines for JSONL/CSV. Fine-tuned open-source LLMs (LLaMA) on AWS SageMaker and deployed endpoints via SageMaker Endpoints with API Gateway and AWS Lambda. Developed full-stack interfaces with React.js and FastAPI for real-time LLM output evaluation. Automated dataset ingestion with Airflow for nightly retraining (99% reliability) and created automated LLM evaluation pipelines using BLEU, ROUGE, METEOR to assess quality and factual accuracy across models.
Graduate Teaching Assistant at University Of Colorado Denver
May 1, 2024 - September 19, 2025
Led C++ programming labs for Data Structures & Program Design, improving student success rate. Mentored 50+ students in advanced C++ concepts, debugging techniques, and best practices.
Software Developer at SparkCognition
July 1, 2022 - September 19, 2025
Developed and deployed computer vision models (YOLOv5) for industrial defect detection achieving 94% precision in real-time manufacturing. Built full-stack dashboards with React.js and Node.js to visualize inspection results. Created REST APIs in Express.js handling 500+ requests/hour. Implemented MLflow to track model versions and experiments; built automated monitoring with real-time alerts; optimized backend inferences from 800ms to 320ms. Collaborated with domain experts to build data annotation/config tools.

Education

Masters of Science in Computer Science at University Of Colorado
August 1, 2022 - May 1, 2024
Master of Science in Computer Science at University Of Colorado Denver
August 1, 2022 - May 1, 2024

Qualifications

AWS Certified Developer Associate
February 1, 2024 - August 27, 2025
AWS Certified Developer - Associate
February 1, 2024 - September 19, 2025

Industry Experience

Financial Services, Manufacturing, Education, Software & Internet