Looks like you have JavaScript disabled. For the full Twine experience, you will need to re-enable it.

Hi, I’m Sheheryar Ramzan, an AI/LLM Engineer specializing in fine-tuning, evaluating, and deploying large language models in production. I love turning cutting-edge AI research into reliable, scalable applications, building agentic systems, RAG pipelines, and prompt engineering frameworks for real-world impact. Currently I work across two remote AI roles, delivering LLM-powered products from fine-tuned Mistral 7B endpoints to autonomous multi-agent research systems. I enjoy the challenge of turning complex data into practical, user-friendly solutions that help teams build smarter products.…Hi, I’m Sheheryar Ramzan, an AI/LLM Engineer specializing in fine-tuning, evaluating, and deploying large language models in production. I love turning cutting-edge AI research into reliable, scalable applications, building agentic systems, RAG pipelines, and prompt engineering frameworks for real-world impact. Currently I work across two remote AI roles, delivering LLM-powered products from fine-tuned Mistral 7B endpoints to autonomous multi-agent research systems. I enjoy the challenge of turning complex data into practical, user-friendly solutions that help teams build smarter products.

Sheheryar Ramzan

AI Engineer, Web Developer, Programmer, +3





Hi, I’m Sheheryar Ramzan, an AI/LLM Engineer specializing in fine-tuning, evaluating, and deploying large language models in production. I love turning cutting-edge AI research into reliable, scalable applications, building agentic systems, RAG pipelines, and prompt engineering frameworks for real-world impact. Currently I work across two remote AI roles, delivering LLM-powered products from fine-tuned Mistral 7B endpoints to autonomous multi-agent research systems. I enjoy the challenge of turning complex data into practical, user-friendly solutions that help teams build smarter products.…Hi, I’m Sheheryar Ramzan, an AI/LLM Engineer specializing in fine-tuning, evaluating, and deploying large language models in production. I love turning cutting-edge AI research into reliable, scalable applications, building agentic systems, RAG pipelines, and prompt engineering frameworks for real-world impact. Currently I work across two remote AI roles, delivering LLM-powered products from fine-tuned Mistral 7B endpoints to autonomous multi-agent research systems. I enjoy the challenge of turning complex data into practical, user-friendly solutions that help teams build smarter products.

Available to hire

Hi, I’m Sheheryar Ramzan, an AI/LLM Engineer specializing in fine-tuning, evaluating, and deploying large language models in production. I love turning cutting-edge AI research into reliable, scalable applications, building agentic systems, RAG pipelines, and prompt engineering frameworks for real-world impact.

Currently I work across two remote AI roles, delivering LLM-powered products from fine-tuned Mistral 7B endpoints to autonomous multi-agent research systems. I enjoy the challenge of turning complex data into practical, user-friendly solutions that help teams build smarter products.

Skills

Experience Level

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Intermediate

Intermediate

Intermediate

Language

English

Fluent

Work Experience

AI Engineer at AlphatechLogics

March 1, 2025 - Present

Developed a real-time pipeline with DensePose and YOLO for logo transfer onto torsos, reducing jitter by 50%. Worked on 3D avatar reconstruction from monocular video using MultiPLY, trained on 40+ multi-person videos.

Algorithm Engineer at Software Motion (Suzhou) Engineering Services Co. Ltd

July 1, 2024 - Present

Successfully converted a two-stage ONNX model to a single-stage ONNX model, reducing inference time by 24 ms. Supported MMDetection3D and BEVFusion model training, evaluation, and export, achieving a 3% mAP gain. Introduced new object detection classes to broaden model functionality for diverse use cases.

Machine Learning Intern at Center of Excellence - Artificial Intelligence

August 31, 2023 - July 18, 2025

Utilized TensorFlow.js and JavaScript for real-time pose estimation with PoseNet. Researched pose-related problems and applied OpenPose, DensePose, and YOLO to pose estimation tasks.

Teaching Assistant at FAST National University of Computer and Emerging Sciences

June 30, 2024 - July 18, 2025

Assisted in grading assignments for 100+ students, providing constructive feedback to enhance learning outcomes. Supported students in Python programming, focusing on interpolation techniques.

AI Engineer + Automation Engineer at AlphatechLogics

March 1, 2025 - Present

Engineered a multilingual voice-to-SQL pipeline (Mandarin & English) using Gemini 2.5 Flash and Whisper, achieving 95% SQL query accuracy and reducing manual data entry by 75%. Built AI-powered automation workflows using n8n and Google Gemini Vision API, including an invoice processing pipeline that extracts 19+ fields into MongoDB with relational null-safe schema. Developed an e-commerce product image automation system with LangChain, Firecrawl, and Supabase (dual-trigger workflow reducing API calls by 40%). Built a real-time perception pipeline (YOLOv11 + DensePose + SMPL) for logo transfer onto torsos, reducing temporal jitter by 50%.

Perception AI Engineer [Concurrent Remote Contract] at Software Motion (Suzhou) Engineering Services Co. Ltd

July 1, 2024 - Present

Leading ONNX deployment workstream — model export, PyTorch–ONNX alignment, and output validation; converted two-stage to single-stage pipeline, reducing inference time by 24ms. Supported MMDetection3D and BEVFusion model training and evaluation; configuration improvements achieved a 3% mAP gain.

Machine Learning Intern at Center of Excellence – Artificial Intelligence

July 1, 2023 - August 31, 2023

Implemented real-time pose estimation using TensorFlow.js and PoseNet; researched and applied OpenPose, DensePose, and YOLO across pose estimation tasks.

Teaching Assistant – Numerical Computing at FAST National University of Computer and Emerging Sciences

January 1, 2024 - June 30, 2024

Supported 100+ students in Python programming and interpolation techniques; provided structured feedback on assignments.