I am Sheharyar Ali Kazmi, a Senior AI/ML Engineer with 8+ years of experience designing and deploying scalable machine learning models for NLP, computer vision, and AI-powered applications. I am proficient in Python, TensorFlow, PyTorch, Hugging Face, and I enjoy turning complex data problems into practical solutions that drive business value. I thrive on building robust ML pipelines, deploying models on AWS and Google Cloud, and implementing ML Ops practices to deliver reliable AI at scale. Throughout my career, I have led teams to deliver fraud detection, personalized recommendations, real-time object detection, and AI-enabled automation. I value collaboration, continuous learning, and crafting AI solutions that improve operations and enable innovation. I am passionate about translating domain challenges into elegant, production-ready systems and mentoring engineers to adopt best practices.

Sheharyar Ali Kazmi

I am Sheharyar Ali Kazmi, a Senior AI/ML Engineer with 8+ years of experience designing and deploying scalable machine learning models for NLP, computer vision, and AI-powered applications. I am proficient in Python, TensorFlow, PyTorch, Hugging Face, and I enjoy turning complex data problems into practical solutions that drive business value. I thrive on building robust ML pipelines, deploying models on AWS and Google Cloud, and implementing ML Ops practices to deliver reliable AI at scale. Throughout my career, I have led teams to deliver fraud detection, personalized recommendations, real-time object detection, and AI-enabled automation. I value collaboration, continuous learning, and crafting AI solutions that improve operations and enable innovation. I am passionate about translating domain challenges into elegant, production-ready systems and mentoring engineers to adopt best practices.

Available to hire

I am Sheharyar Ali Kazmi, a Senior AI/ML Engineer with 8+ years of experience designing and deploying scalable machine learning models for NLP, computer vision, and AI-powered applications. I am proficient in Python, TensorFlow, PyTorch, Hugging Face, and I enjoy turning complex data problems into practical solutions that drive business value. I thrive on building robust ML pipelines, deploying models on AWS and Google Cloud, and implementing ML Ops practices to deliver reliable AI at scale.

Throughout my career, I have led teams to deliver fraud detection, personalized recommendations, real-time object detection, and AI-enabled automation. I value collaboration, continuous learning, and crafting AI solutions that improve operations and enable innovation. I am passionate about translating domain challenges into elegant, production-ready systems and mentoring engineers to adopt best practices.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Senior AI/ML Software Engineer at SumatoSoft
August 1, 2025 - October 10, 2025
Led fraud-detection initiatives using XGBoost and AutoML, achieving a 30% reduction in fraudulent transactions. Built personalized recommendation engines with TensorFlow and PyTorch, increasing user engagement by 20%. Designed real-time video surveillance models with PyTorch and YOLOv5 to enable security automation. Deployed ML models via AWS SageMaker and Google Cloud ML Engine, cutting inference latency by 35%. Implemented NLP pipelines with Hugging Face and SpaCy, created multi-modal AI systems, and integrated LLM-powered assistants into enterprise workflows. Implemented data drift detection with Evidently AI and established robust MLOps using MLflow, Docker, and Kubernetes.
AI/ML Software Engineer at BitBean
November 1, 2022 - October 10, 2025
Developed fraud-detection models with XGBoost and AutoML, reducing fraudulent transactions by 30%. Built recommendation systems with TensorFlow and adaptive ML pipelines, improving user engagement. Implemented real-time object detection with PyTorch and OpenCV, enhancing video analytics. Designed NLP chatbots with Hugging Face and Rasa to improve customer service. Streamlined ML pipelines with Apache Airflow; deployed models with AWS SageMaker and Google Cloud ML Engine to optimize inference times. Enabled on-device AI with TensorFlow Lite. Implemented real-time AI inference with TensorFlow Serving and FastAPI, reducing latency. Integrated FAISS for vector search and boosted recommendation performance. Built computer-vision models for medical imaging and introduced anomaly detection for cybersecurity. Created A/B testing frameworks and automated feature engineering with FeatureTools, improving development time by ~50%.
Software Engineer at Freelancing
August 1, 2020 - October 10, 2025
Built data preprocessing pipelines using Pandas and NumPy; created REST APIs with Flask and FastAPI; implemented supervised learning models (logistic regression, SVM) for classification tasks. Developed SQL/NoSQL databases (PostgreSQL, MongoDB) for ML data management. Created data visualization dashboards with Matplotlib and Seaborn. Automated deployment scripts using Bash and Git, and streamlined ML pipelines with Apache Airflow, MLflow, and Docker. Delivered edge AI modules with TensorFlow Lite and deployed models with Flask/FastAPI and Docker.
Senior AI/ML Software Engineer at SumatoSoft
August 1, 2025 - October 10, 2025
Designed and deployed fraud detection systems using XGBoost and AutoML; built personalized recommendation engines with TensorFlow and improved user engagement by 20%. Implemented real-time video surveillance models with PyTorch and YOLOv5, deployed on AWS SageMaker and Google Cloud ML Engine, improving response times by 35%. Created adversarial training pipelines, automated MLOps workflows using MLflow, Docker, and Kubernetes, and integrated LLM-based AI assistants into enterprise apps. Implemented data drift detection with Evidently AI and established scalable, secure production systems.
AI/ML Software Engineer at BitBean
November 1, 2022 - October 10, 2025
Developed fraud detection models with XGBoost and AutoML, reducing fraudulent transactions by 30%. Built recommendation systems with TensorFlow; implemented real-time object detection with PyTorch and OpenCV; deployed models on AWS SageMaker and Google Cloud ML Engine, optimizing inference times. Led edge AI models with TensorFlow Lite for real-time inference on mobile devices; integrated FAISS for vector search, boosting similarity matching in recommendations. Implemented data privacy & security measures and built computer vision models for medical imaging.
Software Engineer (Freelance) at Freelancing
August 1, 2020 - October 10, 2025
Developed data preprocessing pipelines with Pandas and NumPy; built REST APIs with Flask and FastAPI; implemented supervised learning models (logistic regression, SVM) to improve classification tasks. Designed SQL/NoSQL databases for ML data management; built evaluation dashboards with Matplotlib and Seaborn; performed dimensionality reduction with PCA and t-SNE; automated deployment scripts with Bash and Git; delivered end-to-end ML solutions across projects.
Senior AI/ML Engineer at SumatoSoft
December 1, 2022 - February 28, 2026
Led a team of engineers to develop fraud detection systems using XGBoost, achieving ~30% improvement in fraud prevention. Built personalized recommendation systems with TensorFlow, boosting user engagement by ~20%. Designed and deployed real-time object detection with YOLOv5/OpenCV for enhanced security automation. Advanced NLP pipelines with Hugging Face and SpaCy for chatbot performance and sentiment analysis. Deployed models on AWS SageMaker and Google Cloud ML Engine, reducing inference times by ~40%. Implemented Retrieval-Augmented Generation (RAG) for efficient document retrieval. Constructed real-time data processing pipelines with Kafka and Airflow. Orchestrated multi-agent AI workflows with Crew AI for operational automation. Optimized hyperparameter tuning with Optuna/Ray Tune, improving accuracy by ~30%. Built scalable React dashboards and RESTful APIs (FastAPI/Node.js) for client-facing AI services. Implemented MLOps workflows via CI/CD (Jenkins, GitLab, AWS CodePipeline).
Software Engineer AI/ML at BitBean
August 1, 2019 - November 1, 2022
Developed fraud detection models using XGBoost, reducing fraudulent transactions by 30%. Built NLP chatbots with Hugging Face/Rasa and integrated into customer service workflows. Implemented video surveillance object detection with PyTorch/OpenCV, enhancing security. Deployed AI models on AWS SageMaker and Google Cloud ML Engine, cutting inference time by ~25%. Applied TensorFlow Lite for edge AI to enable real-time on mobile devices. Integrated FAISS for real-time vector search, improving recommender accuracy. Built NLP pipelines and APIs (FastAPI) for real-time data processing. Streamlined ML pipelines with Apache Airflow, improving deployment speed. Implemented real-time speech-to-text via Google Cloud Speech APIs and created CV models for medical imaging.
Freelance AI/ML Developer at Freelancing
September 1, 2017 - August 1, 2019
Developed data preprocessing pipelines (Python, Pandas, NumPy) for feature extraction in ML models. Built REST APIs with Flask/FastAPI enabling real-time inference. Implemented supervised learning models (Logistic Regression, SVM) for classification tasks. Deployed models on AWS SageMaker and Google Cloud ML Engine; reduced inference times. Used FAISS for vector search and explored edge inference with TensorFlow Lite. Built NLP chatbots using HuggingFace and Rasa; created production-ready APIs and dashboards to visualize model performance.

Education

Bachelor's Degree in Computer Science at NUST
January 11, 2030 - January 1, 2020
Bachelor of Science in Computer Science at NUST (National University of Sciences & Technology), Islamabad, Pakistan
January 11, 2030 - January 1, 2020
Bachelor's Degree in Computer Science at NUST
January 11, 2030 - January 1, 2019

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Professional Services, Media & Entertainment, Computers & Electronics, Healthcare