N/A

Harshit Gajjar

N/A

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Senior Engineer at Fractal Analytics
February 1, 2024 - November 25, 2025
Led RAG pipelines with LangChain using vector embeddings and smart document chunking (FAISS/Pinecone), improving LLM response relevance and retrieval accuracy by 30%. Developed and fine-tuned transformer-based NLP models using PyTorch and Hugging Face Transformers, applying improved prompt design and optimization techniques to increase prediction accuracy by 28% in production systems. Built high-performing ML models using XGBoost and TensorFlow, leveraging feature engineering and time-series forecasting methods to boost model precision by 15%. Implemented automated training and deployment pipelines with MLflow and Kubernetes, enabling scalable ML operations and reducing processing time by 40% while cutting cloud computing costs by 25%. Architected reliable ETL data pipelines to process structured and unstructured data using vector embeddings and scalable orchestration, achieving 99% reliability for downstream analytics workflows. Collaborated with stakeholders to deliver enterprise AI
Software Development Engineer Intern at Amazon
August 31, 2023 - August 31, 2023
Trained, deployed, and maintained 5+ ML models for demand forecasting and anomaly detection using CNN and RNN architectures, improving prediction accuracy by 22% and enabling scalable inference for 50K+ transactions daily in AWS SageMaker. Built scalable data preprocessing and feature engineering pipelines with Python, Spark, and Bash, reducing model training time by 35% and refined hyperparameter tuning with Bayesian methods, increasing F1-scores by up to 18%. Implemented automated scripts to monitor model drift and integrated ML outputs using GitHub, Docker, AWS EC2, and CI/CD pipelines, reducing manual interventions by 40% and generating $500K+ projected annual savings.
Software Developer at PwC
January 31, 2022 - January 31, 2022
Fabricated and activated 6+ machine learning solutions for fraud detection and risk scoring across client engagements, improving anomaly detection accuracy by 24%. Designed and maintained end-to-end ETL pipelines in Python and SQL for large-scale processing, handling 2M+ financial records weekly with 99.5% reliability for accurate downstream analytics. Built scalable REST APIs and Flask-based microservices to integrate ML predictions into enterprise apps, reducing manual verification efforts by 40%. Orchestrated complex data validation and preprocessing with PySpark on distributed clusters, cutting data prep time from 12 hours to 3 hours per batch. Engineered interactive dashboards in Power BI and Tableau, accelerating financial client reporting cycles by 50%. Coordinated with auditors and developers to embed AI-driven decision support tools, generating $1.2M+ measurable efficiency savings across global projects.

Education

MS in Computer Science at Northeastern University, USA
January 1, 2022 - December 31, 2023
BE in Computer Engineering at BITS Pilani, UAE
August 1, 2016 - August 31, 2020

Qualifications

Add your qualifications or awards here.

Industry Experience

Financial Services, Retail, Manufacturing, Software & Internet, Professional Services