Hi, I’m Zaafir Rizwan, a senior AI engineer focused on LLMs, voice AI, and production-grade conversational systems. I design and operate cloud-native AI infrastructure across GCP, Azure, and AWS, with strong emphasis on real-time speech processing, RAG architectures, and scalable MLOps practices. I enjoy translating complex AI capabilities into measurable business value for cross-functional teams. Over the past few years, I’ve built real-time voice agents with sub-500ms latency, integrated enterprise telephony via SIP and WebRTC, and delivered end-to-end AI solutions from model fine-tuning to automated evaluation and observability. I thrive in collaborative environments that challenge me to innovate while maintaining reliability and cost-efficiency.

Zaafir Rizwan

Hi, I’m Zaafir Rizwan, a senior AI engineer focused on LLMs, voice AI, and production-grade conversational systems. I design and operate cloud-native AI infrastructure across GCP, Azure, and AWS, with strong emphasis on real-time speech processing, RAG architectures, and scalable MLOps practices. I enjoy translating complex AI capabilities into measurable business value for cross-functional teams. Over the past few years, I’ve built real-time voice agents with sub-500ms latency, integrated enterprise telephony via SIP and WebRTC, and delivered end-to-end AI solutions from model fine-tuning to automated evaluation and observability. I thrive in collaborative environments that challenge me to innovate while maintaining reliability and cost-efficiency.

Available to hire

Hi, I’m Zaafir Rizwan, a senior AI engineer focused on LLMs, voice AI, and production-grade conversational systems. I design and operate cloud-native AI infrastructure across GCP, Azure, and AWS, with strong emphasis on real-time speech processing, RAG architectures, and scalable MLOps practices. I enjoy translating complex AI capabilities into measurable business value for cross-functional teams.

Over the past few years, I’ve built real-time voice agents with sub-500ms latency, integrated enterprise telephony via SIP and WebRTC, and delivered end-to-end AI solutions from model fine-tuning to automated evaluation and observability. I thrive in collaborative environments that challenge me to innovate while maintaining reliability and cost-efficiency.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
See more

Language

English
Fluent
Urdu
Advanced

Work Experience

Senior AI Engineer at Quest Lab
November 1, 2023 - October 15, 2025
Led the development of advanced video understanding AI leveraging Gemini’s multimodal capabilities for real-time analysis and content summarization. Architected ComfyUI solutions for image generation, editing, and video synthesis enabling flexible multimedia workflows. Engineered complex Retrieval-Augmented Generation (RAG) systems with custom training on organizational datasets to improve semantic search and retrieval precision. Designed autonomous AI agents with enhanced prompt engineering to achieve high-quality conversational task completions. Integrated Speech-to-Text and Text-to-Speech solutions with Whisper, OpenAI, and ElevenLabs APIs to launch voice-enabled features and improve product accessibility. Built scalable AI infrastructure with CI/CD pipelines, automated monitoring, and cloud cost optimizations ensuring high reliability. Spearheaded data engineering pipelines using Azure Data Factory to enable accurate time series forecasting in livestock management applications. D
AI Research Intern at Folio3
July 31, 2022 - August 26, 2025
Developed advanced computer vision models using PyTorch and OpenCV for an automated product tagging system that influenced the company’s AI strategy. Gained end-to-end experience in the machine learning workflow including data preprocessing, feature engineering, model experimentation, evaluation, and cloud-based deployment strategies. Provided technical leadership through model performance analysis and data-driven presentations, informing senior engineering decisions for ongoing AI initiatives.
Senior AI Engineer at Quest Lab (AI Startup)
October 1, 2025 - October 1, 2025
Architected advanced RAG and multimodal systems; fine-tuned compact language models for targeted applications; built an end-to-end Livestock Demand Forecasting pipeline via Azure Data Factory; implemented cutting-edge video analysis with Grounding DINO and Gemini for video understanding; developed AI-powered resume scoring and automated product visualization workflows.
AI Research Intern at Folio3
July 1, 2022 - July 1, 2022
Developed deep learning computer vision models with PyTorch and transformer architectures; implemented production-grade code practices and automated tagging systems; built end-to-end ML workflows and cloud deployments using MLOps best practices and containerized AI applications.
Senior AI Engineer at Quest Lab (AI Startup)
November 1, 2023 - October 1, 2025
Architected a sophisticated Retrieval-Augmented Generation (RAG) system capable of understanding both explicit and implicit knowledge from multimodal data sources. Fine-tuned compact language models for targeted business applications. Built end-to-end data pipelines (including livestock demand forecasting) and advanced video analysis using Grounding DINO, along with AI-driven recruitment and generative product visualization workflows. Delivered containerized deployments with Docker and Kubernetes, achieving 99.9% uptime and faster release cycles.
Senior Voice AI Engineer at Trifusion Tech
December 1, 2025 - Present
Architected production voice agents with LiveKit achieving sub-500ms latency for real-time conversations using Deepgram STT, LLM processing, and Cartesia/Minimax/ElevenLabs TTS. Integrated NetSapiens enterprise telephony via SIP trunk and WebRTC, enabling seamless voice AI deployment at scale. Implemented observability infrastructure with OpenTelemetry and ClickHouse for real-time performance monitoring and distributed tracing across multi-cloud deployments (GCP, Azure). Developed MCP calendar integrations enabling voice agents to intelligently manage scheduling through natural conversation.
DevOps Engineer (Freelance) at Remote Contract
December 1, 2025 - January 1, 2026
Designed CI/CD pipelines with GitHub Actions for microservices architecture, automating build, test, and deployment workflows. Integrated Doppler for centralized secrets management with automated rotation across multiple environments. Built Docker containerization strategy with multi-stage Dockerfiles and automated image versioning.

Education

Bachelor of Science in Artificial Intelligence at National University of Computer and Emerging Sciences (NUCES)
August 19, 2019 - June 13, 2023
Bachelor of Science at National University of Computer and Emerging Sciences (NUCES) - Islamabad
January 1, 2019 - January 1, 2023
Bachelor of Science in Artificial Intelligence at National University of Computer and Emerging Sciences (NUCES)
January 1, 2019 - January 1, 2023

Qualifications

Introduction to Data Engineering Specialization – Coursera
January 1, 2025 - August 26, 2025
Source Systems, Data Ingestion, and Pipelines – Coursera
January 1, 2025 - August 26, 2025
Deep Learning Specialization – Coursera
January 1, 2023 - August 26, 2025
AWS Machine Learning Engineer Nanodegree – Udacity
January 1, 2023 - August 26, 2025
AI Programming with Python Nanodegree – Udacity
January 1, 2023 - August 26, 2025
Introduction to Data Engineering Specialization – Coursera
January 1, 2025 - January 1, 2025
Source Systems, Data Ingestion, and Pipelines – Coursera
January 1, 2025 - January 1, 2025
Deep Learning Specialization – Coursera
January 1, 2023 - January 1, 2023
AWS Machine Learning Engineer Nanodegree – Udacity
January 1, 2023 - January 1, 2023
AI Programming with Python Nanodegree – Udacity
January 1, 2023 - January 1, 2023
Introduction to Data Engineering Specialization
January 1, 2025 - December 29, 2025
Source Systems, Data Ingestion, and Pipelines – Coursera
January 1, 2025 - December 29, 2025
Deep Learning Specialization
January 1, 2023 - December 29, 2025
AWS Machine Learning Engineer Nanodegree
January 1, 2023 - December 29, 2025
AI Programming with Python Nanodegree
January 1, 2023 - December 29, 2025
Deep Learning Specialization
January 1, 2023 - February 23, 2026
AWS ML Engineer Nanodegree
January 1, 2023 - February 23, 2026
Data Engineering Specialization
January 1, 2025 - February 23, 2026
AI Programming with Python
January 1, 2023 - February 23, 2026

Industry Experience

Software & Internet, Agriculture & Mining, Professional Services, Media & Entertainment, Telecommunications, Education