I am Ayaan Shaheer, an MLOps & GenAI Engineer with 2+ years of experience building and operating production AI systems across cloud infrastructure, backend engineering, and machine learning platforms.\n\nI'm experienced in deploying LLMs, designing RAG architectures, automating ML delivery pipelines, and running scalable inference workloads on Kubernetes. Most AI projects fail between the notebook and production, and my focus is building the systems that bridge that gap.

Ayaan Shaheer

I am Ayaan Shaheer, an MLOps & GenAI Engineer with 2+ years of experience building and operating production AI systems across cloud infrastructure, backend engineering, and machine learning platforms.\n\nI'm experienced in deploying LLMs, designing RAG architectures, automating ML delivery pipelines, and running scalable inference workloads on Kubernetes. Most AI projects fail between the notebook and production, and my focus is building the systems that bridge that gap.

Available to hire

I am Ayaan Shaheer, an MLOps & GenAI Engineer with 2+ years of experience building and operating production AI systems across cloud infrastructure, backend engineering, and machine learning platforms.\n\nI’m experienced in deploying LLMs, designing RAG architectures, automating ML delivery pipelines, and running scalable inference workloads on Kubernetes. Most AI projects fail between the notebook and production, and my focus is building the systems that bridge that gap.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Work Experience

MLOps & GenAI Engineer at Royal Cloud Consultancy (RCC)
December 19, 2024 - Present
Engineered and deployed 5+ production ML and LLM services using Python, FastAPI, Docker, and Kubernetes, supporting AI workloads across development, staging, and production environments. Built RAG pipelines leveraging FAISS, ChromaDB, and Qdrant, enabling semantic retrieval across 50K+ documents while reducing information retrieval time by ~60%. Developed scalable inference APIs processing 1,000+ daily requests and achieved 99.5%+ service availability through containerized deployments and automated recovery workflows. Automated CI/CD pipelines using GitHub Actions, reducing deployment time from hours to under 15 minutes and enabling faster release cycles. Implemented observability with Prometheus and Grafana, monitoring 20+ infrastructure and application metrics and reducing issue detection time by ~50%.
AI/ML Engineer Intern at Avance Infra Tech Pvt Ltd
July 1, 2024 - December 1, 2024
Built end-to-end machine learning workflows covering data preprocessing, feature engineering, model training, and evaluation using Python-based ML frameworks. Assisted in deploying ML solutions and integrating inference pipelines with backend services, building foundational experience in MLOps and production AI systems.

Education

Bachelor’s Of Computer Application at Jamia Hamdard University
January 11, 2030 - June 29, 2026

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet