Hi, I’m Sheheryar Ramzan, an AI/LLM Engineer specializing in fine-tuning, evaluating, and deploying large language models in production. I love turning cutting-edge AI research into reliable, scalable applications, building agentic systems, RAG pipelines, and prompt engineering frameworks for real-world impact. Currently I work across two remote AI roles, delivering LLM-powered products from fine-tuned Mistral 7B endpoints to autonomous multi-agent research systems. I enjoy the challenge of turning complex data into practical, user-friendly solutions that help teams build smarter products.

Sheheryar Ramzan

Hi, I’m Sheheryar Ramzan, an AI/LLM Engineer specializing in fine-tuning, evaluating, and deploying large language models in production. I love turning cutting-edge AI research into reliable, scalable applications, building agentic systems, RAG pipelines, and prompt engineering frameworks for real-world impact. Currently I work across two remote AI roles, delivering LLM-powered products from fine-tuned Mistral 7B endpoints to autonomous multi-agent research systems. I enjoy the challenge of turning complex data into practical, user-friendly solutions that help teams build smarter products.

Available to hire

Hi, I’m Sheheryar Ramzan, an AI/LLM Engineer specializing in fine-tuning, evaluating, and deploying large language models in production. I love turning cutting-edge AI research into reliable, scalable applications, building agentic systems, RAG pipelines, and prompt engineering frameworks for real-world impact.

Currently I work across two remote AI roles, delivering LLM-powered products from fine-tuned Mistral 7B endpoints to autonomous multi-agent research systems. I enjoy the challenge of turning complex data into practical, user-friendly solutions that help teams build smarter products.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent

Work Experience

AI Engineer at AlphatechLogics
March 1, 2025 - Present
Developed a real-time pipeline with DensePose and YOLO for logo transfer onto torsos, reducing jitter by 50%. Worked on 3D avatar reconstruction from monocular video using MultiPLY, trained on 40+ multi-person videos.
Algorithm Engineer at Software Motion (Suzhou) Engineering Services Co. Ltd
July 1, 2024 - Present
Successfully converted a two-stage ONNX model to a single-stage ONNX model, reducing inference time by 24 ms. Supported MMDetection3D and BEVFusion model training, evaluation, and export, achieving a 3% mAP gain. Introduced new object detection classes to broaden model functionality for diverse use cases.
Machine Learning Intern at Center of Excellence - Artificial Intelligence
August 31, 2023 - July 18, 2025
Utilized TensorFlow.js and JavaScript for real-time pose estimation with PoseNet. Researched pose-related problems and applied OpenPose, DensePose, and YOLO to pose estimation tasks.
Teaching Assistant at FAST National University of Computer and Emerging Sciences
June 30, 2024 - July 18, 2025
Assisted in grading assignments for 100+ students, providing constructive feedback to enhance learning outcomes. Supported students in Python programming, focusing on interpolation techniques.
AI Engineer + Automation Engineer at AlphatechLogics
March 1, 2025 - Present
Engineered a multilingual voice-to-SQL pipeline (Mandarin & English) using Gemini 2.5 Flash and Whisper, achieving 95% SQL query accuracy and reducing manual data entry by 75%. Built AI-powered automation workflows using n8n and Google Gemini Vision API, including an invoice processing pipeline that extracts 19+ fields into MongoDB with relational null-safe schema. Developed an e-commerce product image automation system with LangChain, Firecrawl, and Supabase (dual-trigger workflow reducing API calls by 40%). Built a real-time perception pipeline (YOLOv11 + DensePose + SMPL) for logo transfer onto torsos, reducing temporal jitter by 50%.
Perception AI Engineer [Concurrent Remote Contract] at Software Motion (Suzhou) Engineering Services Co. Ltd
July 1, 2024 - Present
Leading ONNX deployment workstream — model export, PyTorch–ONNX alignment, and output validation; converted two-stage to single-stage pipeline, reducing inference time by 24ms. Supported MMDetection3D and BEVFusion model training and evaluation; configuration improvements achieved a 3% mAP gain.
Machine Learning Intern at Center of Excellence – Artificial Intelligence
July 1, 2023 - August 31, 2023
Implemented real-time pose estimation using TensorFlow.js and PoseNet; researched and applied OpenPose, DensePose, and YOLO across pose estimation tasks.
Teaching Assistant – Numerical Computing at FAST National University of Computer and Emerging Sciences
January 1, 2024 - June 30, 2024
Supported 100+ students in Python programming and interpolation techniques; provided structured feedback on assignments.

Education

Bachelor of Science at FAST National University of Computer and Emerging Sciences
January 1, 2020 - June 30, 2024
Bachelor of Science in Computer Science at FAST National University of Computer and Emerging Sciences
January 11, 2030 - June 1, 2024

Qualifications

Generative AI with Large Language Models
January 1, 2025 - December 31, 2025
LangChain Chat with Your Data
January 1, 2025 - December 31, 2025
Neural Networks and Deep Learning
January 1, 2023 - December 31, 2023
AWS Academy Graduate - AWS Academy Microservices and CI/CD Pipeline Builder
January 1, 2024 - December 31, 2024
Generative AI with Large Language Models
January 1, 2025 - March 26, 2026
LangChain: Chat with Your Data
January 1, 2025 - March 26, 2026
Neural Networks and Deep Learning
January 1, 2023 - March 26, 2026
AWS Academy Graduate — Microservices & CI/CD Pipeline Builder
January 1, 2024 - March 26, 2026

Industry Experience

Software & Internet, Computers & Electronics, Education, Healthcare, Gaming, Media & Entertainment, Professional Services