I'm Jaspinder Singh, currently an AI Engineer Intern at BMW Group in Munich. I design and deploy advanced AI solutions for automotive systems, focusing on large language models (LLMs), vision-language models (VLMs), and cloud-based architectures. My work includes building in-car conversational agents, integrating AI with embedded systems, and delivering scalable REST APIs using FastAPI, Docker, and AWS. I actively collaborate with cross-functional teams to translate research into production-ready solutions. Previously at VisLab (an Ambarella Inc. company), I contributed to deep learning and multimodal perception for autonomous driving, including pedestrian intention forecasting and distributed model training. I enjoy applying state-of-the-art AI to real-world challenges and collaborating across disciplines to drive impactful results.

Jaspinder Singh

I'm Jaspinder Singh, currently an AI Engineer Intern at BMW Group in Munich. I design and deploy advanced AI solutions for automotive systems, focusing on large language models (LLMs), vision-language models (VLMs), and cloud-based architectures. My work includes building in-car conversational agents, integrating AI with embedded systems, and delivering scalable REST APIs using FastAPI, Docker, and AWS. I actively collaborate with cross-functional teams to translate research into production-ready solutions. Previously at VisLab (an Ambarella Inc. company), I contributed to deep learning and multimodal perception for autonomous driving, including pedestrian intention forecasting and distributed model training. I enjoy applying state-of-the-art AI to real-world challenges and collaborating across disciplines to drive impactful results.

Available to hire

I’m Jaspinder Singh, currently an AI Engineer Intern at BMW Group in Munich. I design and deploy advanced AI solutions for automotive systems, focusing on large language models (LLMs), vision-language models (VLMs), and cloud-based architectures. My work includes building in-car conversational agents, integrating AI with embedded systems, and delivering scalable REST APIs using FastAPI, Docker, and AWS. I actively collaborate with cross-functional teams to translate research into production-ready solutions.

Previously at VisLab (an Ambarella Inc. company), I contributed to deep learning and multimodal perception for autonomous driving, including pedestrian intention forecasting and distributed model training. I enjoy applying state-of-the-art AI to real-world challenges and collaborating across disciplines to drive impactful results.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert

Work Experience

AI Engineer Intern at BMW Group
April 1, 2025 - Present
Designed and fine-tuned Large Language Models (LLMs) for in-car user experiences, applying RLHF post-training techniques and prompt engineering for conversational agents with emphasis on safety and personalization. Built agentic architectures enabling context-aware reasoning in embedded automotive environments, and developed scalable REST APIs using FastAPI and Docker, integrating solutions with AWS and Azure cloud services.
Computer Vision Engineer Intern at VisLab (an Ambarella Inc. company)
April 1, 2025 - October 19, 2025
Designed and implemented deep learning models (Conv-DNN, LSTM, and Vision-Language Models) to forecast pedestrian behavior in real-time for autonomous driving. Integrated spatial visual data with semantic context via VLMs to enhance prediction and decision-making. Supported distributed training and performance profiling with DeepSpeed, DDP, and NVIDIA Nsight; performed model optimization and inference acceleration using quantization (INT8/INT4), KV-cache, activation checkpointing, and pruning for resource-constrained automotive deployment.

Education

Master of Science in Artificial Intelligence at University of Bologna
September 1, 2023 - October 1, 2025
Bachelor of Engineering in Automation at University of Bologna
September 1, 2020 - October 1, 2023
Technical Diploma at Technical Institute Lonato
September 1, 2015 - June 1, 2020

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Transportation & Logistics, Computers & Electronics, Consumer Goods, Telecommunications, Manufacturing
    paper AI Agent Chatbot for Company Briefings:

    ◦ Developed an intelligent conversational agent designed for company briefings and corporate
    communications using LangChain framework for building robust language model applications.
    ◦ Implemented comprehensive security measures focusing on prompt injection prevention and data leakage
    protection to ensure safe deployment in enterprise environments.
    ◦ Conducted extensive testing across different stages of the AI pipeline using LangGraph for workflow
    orchestration and LangSmith for monitoring and evaluation of model performance

    paper Optimizing Retrieval-Augmented Generation

    Explored optimization strategies for Retrieval-Augmented Generation (RAG) by comparing Late Chunking and Early Chunking approaches. Evaluated the impact of static and dynamic segmenting on retrieval and generation performance, highlighting trade-offs in efficiency, context preservation, and scalability. Achieved insights into designing adaptive chunking strategies for enhanced RAG workflows.
    Skills: Retrieval-Augmented Generation (RAG) · LLM · RAG · Pattern Recognition · Pytorch · LLMs

    paper Pedestrian Intention Forecasting using DNN and Vision-Language Models

    This project leverages Convolutional Deep Neural Networks (Conv-DNN) and Vision-Language Models (VLM) to predict pedestrian behaviors in real-time. The model integrates spatial visual data and semantic context to anticipate future movements of pedestrians, enhancing the safety and responsiveness of autonomous driving systems. By analyzing environmental cues and pedestrian dynamics, this solution contributes to safer navigation in complex urban environments.
    Skills: Autonomous driving Systems · Deep Neural Networks · Computer Vision · Vision Language Models · Pattern Recognition · Pytorch

    paper Vision Anomaly Detection

    This project focuses on developing computer vision systems capable of detecting anomalies in images or video data. It involves training models to identify patterns that deviate from normal visual behavior, which is crucial in applications such as industrial inspection, medical imaging, and surveillance. The project explores unsupervised, semi-supervised, and deep learning-based approaches to accurately detect rare and subtle anomalies, even with limited labeled data.
    Skills: Convolutional Neural Networks (CNN) · Deep Learning · Computer Vision · VLMs · Pattern Recognition

    paper LLMs Reasoning and Planning with HPC Leonardo @ Cineca

    This project explores how Large Language Models (LLMs) can be enhanced to perform complex reasoning and planning tasks. It focuses on evaluating and improving the model’s ability to break down multi-step problems, make logical inferences, and generate coherent action sequences toward specific goals. The project integrates techniques such as chain-of-thought prompting, tool use, memory management, and modular architectures to simulate cognitive-like decision-making in LLMs.
    Project done at Cineca using the supercomputer Leonardo,