I am an AI/ML Engineer with 4+ years of experience designing and deploying scalable machine learning systems and Generative AI solutions in production environments. I specialize in large language models, retrieval-augmented generation, and end-to-end ML pipelines from data processing to real-time inference. I have a strong background in MLOps, cloud-native architectures, and distributed systems using AWS, Kubernetes, and PySpark. I am passionate about optimizing model performance, reducing latency, and delivering measurable business impact through robust, production-ready solutions. I enjoy collaborating with cross-functional teams to architect scalable, secure, and observable AI systems.

Parimal Jaykumar Ingle

I am an AI/ML Engineer with 4+ years of experience designing and deploying scalable machine learning systems and Generative AI solutions in production environments. I specialize in large language models, retrieval-augmented generation, and end-to-end ML pipelines from data processing to real-time inference. I have a strong background in MLOps, cloud-native architectures, and distributed systems using AWS, Kubernetes, and PySpark. I am passionate about optimizing model performance, reducing latency, and delivering measurable business impact through robust, production-ready solutions. I enjoy collaborating with cross-functional teams to architect scalable, secure, and observable AI systems.

Available to hire

I am an AI/ML Engineer with 4+ years of experience designing and deploying scalable machine learning systems and Generative AI solutions in production environments. I specialize in large language models, retrieval-augmented generation, and end-to-end ML pipelines from data processing to real-time inference. I have a strong background in MLOps, cloud-native architectures, and distributed systems using AWS, Kubernetes, and PySpark.

I am passionate about optimizing model performance, reducing latency, and delivering measurable business impact through robust, production-ready solutions. I enjoy collaborating with cross-functional teams to architect scalable, secure, and observable AI systems.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Work Experience

Software Engineer at ServiceNow, GA
November 1, 2024 - Present
Architected and delivered scalable backend services using Java and Spring Boot, refactoring monolithic components into microservices and improving end-to-end request latency by 20% under production load. Designed secure authentication and authorization flows using Spring Security and JPA, enforcing RBAC and ensuring data integrity across enterprise workflows serving thousands of users. Built and optimized RESTful APIs with JDBC and servlet-based integrations, enabling reliable high-volume database interactions. Led cloud deployment and CI/CD automation on AWS (EC2, ECS, Lambda, S3) with Jenkins, reducing deployment time by 15% and improving release reliability. Developed automated test suites with JUnit/TestNG achieving 90% code coverage and monitored performance with Splunk to sustain 99.9% uptime.
Software Engineer II at Orion Technolab, India
May 1, 2022 - July 1, 2023
Designed and deployed Spring Boot microservices exposing REST APIs to monitor router telemetry; implemented a real-time, event-driven processing pipeline using Apache Kafka handling 800 req/sec with fault tolerance. Practiced TDD with JUnit/Mockito achieving 90% test coverage, improving release stability. Collaborated with clients and engineers during UAT and production rollouts; resolved issues via Jira/CloudWatch/JBoss with 98% SLA adherence. Mentored 7 junior engineers, improving sprint delivery and onboarding efficiency.
Software Engineer I at Orion Technolab, India
February 1, 2021 - May 1, 2022
Engineered high-throughput financial analytics service in Java 8/Spring Boot, processing real-time and batch data with strict latency and consistency requirements; designed REST APIs across Spring Boot and Node.js (Express); built a React-based analytics dashboard with Redux/D3 for live visualizations, boosting engagement. Implemented reusable frontend components and optimized data fetching to sustain performance under peak loads. Automated CI/CD with Jenkins and Git pipelines, reducing release cycle time by 40% and decreasing manual deployment errors.
Software Engineer Intern at Orion Technolab, India
August 1, 2020 - February 1, 2021
Built a network protocol analytics service using Python, Flask, and Scrapy to automate ingestion and analysis; reduced end-to-end processing time to under 10 minutes. Implemented scheduled data pipelines with cron and MongoDB-backed storage, enabling reliable daily exports to third-party applications via REST APIs and eliminating manual overhead.
AI/ML Engineer at ServiceNow
November 1, 2024 - Present
Led the architecture and deployment of a RAG-based knowledge assistant integrated with ServiceNow, enabling semantic search and automated incident summarization across 100K+ records; reduced average resolution time by 30%. Designed end-to-end document ingestion and embedding pipelines with optimized chunking and vector indexing to improve retrieval relevance and reduce incorrect responses by 25%. Built low-latency, secure REST APIs for ServiceNow workflows and achieved sub-300ms responses under enterprise load. Deployed scalable containerized services with autoscaling and health monitoring to ensure high availability. Implemented evaluation and monitoring to track response accuracy, latency, and usage, reducing manual knowledge lookup by 40%.
ML Engineer II at Orion Technolab
May 1, 2022 - July 1, 2023
Architected and scaled an end-to-end ML pipeline using Python, PySpark, SQL, and Apache Airflow, processing 12M+ daily transactions and reducing risk scoring latency from 4 hours to 15 minutes. Developed and optimized credit risk models using XGBoost and LightGBM (Scikit-learn API), improving AUC from 0.71 to 0.86; handled class imbalance with SMOTE and class-weight tuning. Built a centralized feature store using AWS S3, Glue, and PostgreSQL, engineering 250+ features and standardizing training/inference pipelines to eliminate feature drift. Implemented model monitoring and drift detection using Evidently AI, Prometheus, and Grafana, tracking data distribution shifts and prediction stability in real time. Containerized and deployed models as RESTful microservices using FastAPI, Docker, Kubernetes (EKS), achieving sub-100ms inference latency and 99.9% uptime. Established MLOps workflows with GitHub Actions CI/CD, MLflow for experiment tracking, and model versioning, enabling automated r

Education

Master of Science in Information Systems at California State University, Long Beach
January 11, 2030 - May 1, 2025
Bachelor of Technology in Electronics & Telecommunications at Narsee Monjee Institute of Management Studies (NMIMS)
January 11, 2030 - August 1, 2021
Master of Science in Information Systems at California State University, Long Beach
January 11, 2030 - May 1, 2025

Qualifications

Meta Front-End Developer Professional Certificate
January 11, 2030 - February 23, 2026
Generative AI with Large Language Models
January 11, 2030 - February 23, 2026

Industry Experience

Software & Internet, Professional Services, Media & Entertainment, Education