Looks like you have JavaScript disabled. For the full Twine experience, you will need to re-enable it.

I am an AI/ML Engineer with 4+ years of experience designing and deploying scalable machine learning systems and Generative AI solutions in production environments. I specialize in large language models, retrieval-augmented generation, and end-to-end ML pipelines from data processing to real-time inference. I have a strong background in MLOps, cloud-native architectures, and distributed systems using AWS, Kubernetes, and PySpark. I am passionate about optimizing model performance, reducing latency, and delivering measurable business impact through robust, production-ready solutions. I enjoy collaborating with cross-functional teams to architect scalable, secure, and observable AI systems.…I am an AI/ML Engineer with 4+ years of experience designing and deploying scalable machine learning systems and Generative AI solutions in production environments. I specialize in large language models, retrieval-augmented generation, and end-to-end ML pipelines from data processing to real-time inference. I have a strong background in MLOps, cloud-native architectures, and distributed systems using AWS, Kubernetes, and PySpark. I am passionate about optimizing model performance, reducing latency, and delivering measurable business impact through robust, production-ready solutions. I enjoy collaborating with cross-functional teams to architect scalable, secure, and observable AI systems.

Parimal Jaykumar Ingle

Full Stack Developer, Back-End Developer, Web Developer, +6





I am an AI/ML Engineer with 4+ years of experience designing and deploying scalable machine learning systems and Generative AI solutions in production environments. I specialize in large language models, retrieval-augmented generation, and end-to-end ML pipelines from data processing to real-time inference. I have a strong background in MLOps, cloud-native architectures, and distributed systems using AWS, Kubernetes, and PySpark. I am passionate about optimizing model performance, reducing latency, and delivering measurable business impact through robust, production-ready solutions. I enjoy collaborating with cross-functional teams to architect scalable, secure, and observable AI systems.…I am an AI/ML Engineer with 4+ years of experience designing and deploying scalable machine learning systems and Generative AI solutions in production environments. I specialize in large language models, retrieval-augmented generation, and end-to-end ML pipelines from data processing to real-time inference. I have a strong background in MLOps, cloud-native architectures, and distributed systems using AWS, Kubernetes, and PySpark. I am passionate about optimizing model performance, reducing latency, and delivering measurable business impact through robust, production-ready solutions. I enjoy collaborating with cross-functional teams to architect scalable, secure, and observable AI systems.

Available to hire

I am an AI/ML Engineer with 4+ years of experience designing and deploying scalable machine learning systems and Generative AI solutions in production environments. I specialize in large language models, retrieval-augmented generation, and end-to-end ML pipelines from data processing to real-time inference. I have a strong background in MLOps, cloud-native architectures, and distributed systems using AWS, Kubernetes, and PySpark.

I am passionate about optimizing model performance, reducing latency, and delivering measurable business impact through robust, production-ready solutions. I enjoy collaborating with cross-functional teams to architect scalable, secure, and observable AI systems.

Skills

Experience Level

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Work Experience

AI/ML Engineer at ServiceNow

November 1, 2024 - Present

Led the architecture and deployment of a RAG-based knowledge assistant integrated with ServiceNow, enabling semantic search and automated incident summarization across 100K+ records; reduced average resolution time by 30%. Designed end-to-end document ingestion and embedding pipelines with optimized chunking and vector indexing to improve retrieval relevance and reduce incorrect responses by 25%. Built low-latency, secure REST APIs for ServiceNow workflows and achieved sub-300ms responses under enterprise load. Deployed scalable containerized services with autoscaling and health monitoring to ensure high availability. Implemented evaluation and monitoring to track response accuracy, latency, and usage, reducing manual knowledge lookup by 40%.

Software Engineer at ServiceNow, GA

November 1, 2024 - Present

Architected and delivered scalable backend services using Java and Spring Boot, refactoring monolithic components into microservices and improving end-to-end request latency by 20% under production load. Designed secure authentication and authorization flows using Spring Security and JPA, enforcing RBAC and ensuring data integrity across enterprise workflows serving thousands of users. Built and optimized RESTful APIs with JDBC and servlet-based integrations, enabling reliable high-volume database interactions. Led cloud deployment and CI/CD automation on AWS (EC2, ECS, Lambda, S3) with Jenkins, reducing deployment time by 15% and improving release reliability. Developed automated test suites with JUnit/TestNG achieving 90% code coverage and monitored performance with Splunk to sustain 99.9% uptime.

ML Engineer II at Orion Technolab

May 1, 2022 - July 1, 2023

Architected and scaled an end-to-end ML pipeline using Python, PySpark, SQL, and Apache Airflow, processing 12M+ daily transactions and reducing risk scoring latency from 4 hours to 15 minutes. Developed and optimized credit risk models using XGBoost and LightGBM (Scikit-learn API), improving AUC from 0.71 to 0.86; handled class imbalance with SMOTE and class-weight tuning. Built a centralized feature store using AWS S3, Glue, and PostgreSQL, engineering 250+ features and standardizing training/inference pipelines to eliminate feature drift. Implemented model monitoring and drift detection using Evidently AI, Prometheus, and Grafana, tracking data distribution shifts and prediction stability in real time. Containerized and deployed models as RESTful microservices using FastAPI, Docker, Kubernetes (EKS), achieving sub-100ms inference latency and 99.9% uptime. Established MLOps workflows with GitHub Actions CI/CD, MLflow for experiment tracking, and model versioning, enabling automated r

Software Engineer II at Orion Technolab, India

May 1, 2022 - July 1, 2023

Designed and deployed Spring Boot microservices exposing REST APIs to monitor router telemetry; implemented a real-time, event-driven processing pipeline using Apache Kafka handling 800 req/sec with fault tolerance. Practiced TDD with JUnit/Mockito achieving 90% test coverage, improving release stability. Collaborated with clients and engineers during UAT and production rollouts; resolved issues via Jira/CloudWatch/JBoss with 98% SLA adherence. Mentored 7 junior engineers, improving sprint delivery and onboarding efficiency.

Software Engineer I at Orion Technolab, India

February 1, 2021 - May 1, 2022

Engineered high-throughput financial analytics service in Java 8/Spring Boot, processing real-time and batch data with strict latency and consistency requirements; designed REST APIs across Spring Boot and Node.js (Express); built a React-based analytics dashboard with Redux/D3 for live visualizations, boosting engagement. Implemented reusable frontend components and optimized data fetching to sustain performance under peak loads. Automated CI/CD with Jenkins and Git pipelines, reducing release cycle time by 40% and decreasing manual deployment errors.

Software Engineer Intern at Orion Technolab, India

August 1, 2020 - February 1, 2021

Built a network protocol analytics service using Python, Flask, and Scrapy to automate ingestion and analysis; reduced end-to-end processing time to under 10 minutes. Implemented scheduled data pipelines with cron and MongoDB-backed storage, enabling reliable daily exports to third-party applications via REST APIs and eliminating manual overhead.