I'm a results-driven ML/MLOps engineer with 7+ years of experience building, deploying, and scaling ML/AI systems across multi-cloud environments (AWS, Azure, GCP). I specialize in MLOps pipelines, CI/CD, observability, infrastructure-as-code, container orchestration, and LLMOps. I excel at turning AI research into production-grade solutions that drive measurable business outcomes. I thrive on delivering secure, reliable, and scalable AI systems by collaborating with cross-functional teams, implementing robust monitoring, and automating end-to-end model lifecycles. My focus is on reducing downtime, optimizing latency, and accelerating ML lifecycle automation to enable rapid, safe adoption of AI in production.

Garcha Simranjeet

I'm a results-driven ML/MLOps engineer with 7+ years of experience building, deploying, and scaling ML/AI systems across multi-cloud environments (AWS, Azure, GCP). I specialize in MLOps pipelines, CI/CD, observability, infrastructure-as-code, container orchestration, and LLMOps. I excel at turning AI research into production-grade solutions that drive measurable business outcomes. I thrive on delivering secure, reliable, and scalable AI systems by collaborating with cross-functional teams, implementing robust monitoring, and automating end-to-end model lifecycles. My focus is on reducing downtime, optimizing latency, and accelerating ML lifecycle automation to enable rapid, safe adoption of AI in production.

Available to hire

I’m a results-driven ML/MLOps engineer with 7+ years of experience building, deploying, and scaling ML/AI systems across multi-cloud environments (AWS, Azure, GCP). I specialize in MLOps pipelines, CI/CD, observability, infrastructure-as-code, container orchestration, and LLMOps. I excel at turning AI research into production-grade solutions that drive measurable business outcomes.

I thrive on delivering secure, reliable, and scalable AI systems by collaborating with cross-functional teams, implementing robust monitoring, and automating end-to-end model lifecycles. My focus is on reducing downtime, optimizing latency, and accelerating ML lifecycle automation to enable rapid, safe adoption of AI in production.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

English
Fluent

Work Experience

Senior ML/MLOps Engineer at PixelFyre Code Labs
February 1, 2023 - Present
Architected and deployed a Generative AI-powered Clinical Exam Preparation and Simulation System using Azure OpenAI (GPT models) and AWS Lambda@Edge APIs, reducing editorial turnaround time by 40%. Delivered end-to-end MLOps consulting solutions across AWS SageMaker, Azure ML, and GCP Vertex AI, standardizing model lifecycle management (build, test, deploy) for text, vision, and multi-modal generative models. Ensured safety and reliability of AI systems in high-risk environments. Conducted security audits on LLM pipelines to identify and mitigate adversarial threats, and collaborated with engineering teams to enforce robust security protocols. Developed and deployed custom ML/LLM pipelines using Python and Docker, integrating model APIs into content workflows and analytics systems. Defined and tracked SLIs/SLOs, reducing downtime and enabling automated rollback strategies. Facilitated blameless post-mortems and introduced DevOps maturity models tailored to client transformation journey
Senior AI Software Engineer at Cohere
April 30, 2023 - October 3, 2025
Led Crane-GPT, an intelligent assistant to reduce crane operator workload and errors. Productionized BERT-based NLU services using Ray Serve and Python, reducing inference latency by 30% through optimized batching and model caching. Built Retrieval-Augmented Generation pipelines for enterprise search with vector stores and real-time grounding. Delivered a GCP-based containerization project with secure CI/CD pipelines and container security policies. Automated retraining of product recommender models with Vertex AI pipelines, achieving a 25% increase in relevancy during A/B tests. Improved PostgreSQL backup/restoration strategies and monitoring dashboards with Grafana. Integrated ML models into backend APIs and streaming pipelines via Python and gRPC for real-time recommendations.
Senior Full Stack Engineer at InfiniteT3ch
July 31, 2020 - October 3, 2025
Designed and developed full-stack web applications with Python on the backend and React on the frontend, integrating data via GraphQL and REST APIs. Built PostgreSQL-backed services with efficient query design and indexing. Migrated enterprise workloads from GCP to Azure and deployed ECS-based infrastructure with automated CI/CD. Contributed to disaster recovery planning and system validation scripts.
AI Software Engineer at Evenset Inc
September 30, 2017 - October 3, 2025
Improved system performance and uptime by automating deployment and monitoring, refactored data pipelines to enhance reliability, and contributed to operational resilience through data engineering and infrastructure upgrades.
Senior ML/MLOps Engineer at PixelFyre Code Labs
February 1, 2023 - November 14, 2025
Architected and deployed a Generative AI-powered Clinical Exam Preparation and Simulation System using Azure OpenAI (GPT models) and AWS Lambda@Edge APIs, reducing editorial turnaround time by 40%. Delivered end-to-end MLOps consulting solutions across AWS SageMaker, Azure ML, and GCP Vertex AI, standardizing model lifecycle management (build, test, deploy) for text, vision, and multi-modal generative models. Implemented safety and reliability measures for AI-driven systems in high-risk environments; conducted security audits on LLM pipelines and collaborated with engineering teams to enforce robust security protocols. Developed and deployed custom ML/LLM pipelines using Python and Docker, integrated model APIs into content workflows and analytics systems; created internal automation tools to accelerate operational workflows. Optimized prompt engineering, model versioning, and latency tuning; defined SLIs/SLOs and introduced automated rollback strategies; facilitated blameless post-mor
Senior AI Software Engineer at Cohere
April 1, 2023 - April 1, 2023
Led development of Crane-GPT to reduce crane-operator workload; productionized BERT-based NLU services using Ray Serve and Python, achieving 30% lower inference latency through optimized batching and model caching. Built and integrated Retrieval-Augmented Generation (RAG) pipelines for enterprise search; delivered a GCP-based containerization project with secure CI/CD and container security policies. Automated retraining of product recommendation models via Vertex AI pipelines, increasing relevance by 25% in A/B tests. Improved PostgreSQL backup strategies and built Grafana dashboards monitoring uptime, latency, and reliability; integrated ML models into backend APIs and streaming pipelines using Python and gRPC for real-time recommendations.
Senior Full Stack Engineer at InfiniteT3ch
July 1, 2020 - July 1, 2020
Designed and developed full-stack web applications with Python (backend) and React (frontend), integrating data via GraphQL and REST APIs. Optimized PostgreSQL-backed services with efficient query design, indexing, and schema versioning. Migrated enterprise workloads from GCP to Azure and deployed ECS-based infrastructure with automated CI/CD pipelines; contributed to disaster recovery processes and system validation scripts.
AI Software Engineer at Evenset Inc
September 1, 2017 - September 1, 2017
Improved system performance and uptime by automating deployment and monitoring; refactored and optimized data pipelines to enhance business continuity and reduce failures; contributed to operational resilience through data engineering and infrastructure upgrades.

Education

Master’s in Computer Science at Carleton University
August 1, 2014 - May 1, 2015
Bachelor’s in Computer Engineering at University of Texas at San Antonio (UTSA)
January 1, 2010 - January 1, 2014
Master's in Computer Science at Carleton University
August 1, 2014 - May 1, 2015
Bachelor's in Computer Engineering at University of Texas at San Antonio (UTSA)
January 1, 2010 - January 1, 2014

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Healthcare, Professional Services, Education, Media & Entertainment