I’m an AI Engineer passionate about building intelligent, scalable systems that bridge machine learning and real-world impact. I specialize in NLP, computer vision, and LLM-driven automation using Python, Hugging Face, LangChain, and cloud platforms like AWS and GCP. I’ve engineered projects like GestureCap (gesture-based screen control using OpenCV), VocalVision AI (multilingual image captioning app), and LLM-powered financial analysis tools that automate insights from complex reports. My focus is on delivering end-to-end AI solutions — from data pipelines and model deployment to intelligent automation — with measurable business outcomes. I bring analytical thinking, fast learning, and hands-on problem-solving to every project. Let’s transform your ideas into intelligent systems.

K.Surya Srinija

I’m an AI Engineer passionate about building intelligent, scalable systems that bridge machine learning and real-world impact. I specialize in NLP, computer vision, and LLM-driven automation using Python, Hugging Face, LangChain, and cloud platforms like AWS and GCP. I’ve engineered projects like GestureCap (gesture-based screen control using OpenCV), VocalVision AI (multilingual image captioning app), and LLM-powered financial analysis tools that automate insights from complex reports. My focus is on delivering end-to-end AI solutions — from data pipelines and model deployment to intelligent automation — with measurable business outcomes. I bring analytical thinking, fast learning, and hands-on problem-solving to every project. Let’s transform your ideas into intelligent systems.

Available to hire

I’m an AI Engineer passionate about building intelligent, scalable systems that bridge machine learning and real-world impact. I specialize in NLP, computer vision, and LLM-driven automation using Python, Hugging Face, LangChain, and cloud platforms like AWS and GCP.

I’ve engineered projects like GestureCap (gesture-based screen control using OpenCV), VocalVision AI (multilingual image captioning app), and LLM-powered financial analysis tools that automate insights from complex reports.

My focus is on delivering end-to-end AI solutions — from data pipelines and model deployment to intelligent automation — with measurable business outcomes.

I bring analytical thinking, fast learning, and hands-on problem-solving to every project. Let’s transform your ideas into intelligent systems.

See more

Language

English
Fluent

Work Experience

Artificial Intelligence Intern at Infosys-Springboard
January 1, 2025 - October 6, 2025
Applied LLMs (Azure OpenAI, LLaMA) for automated analysis of balance sheets and income statements, enabling extraction of key financial insights. Processed 1,000+ documents using LangChain and RegEx, boosting policy verification efficiency by 30%. Streamlined financial document processing with NLP, reducing review time and accelerating decision-making.
Gesture Capturing at Independent Project
May 1, 2025 - Present
Developed a gesture-capturing pipeline using OpenCV and MediaPipe; tracked hand gestures on a webcam at ~30 FPS, built a frame-level gesture tracker, and integrated OCR via Tesseract. Achieved ~90% OCR accuracy on captured regions and reduced manual interaction by 100% in a demonstration use case.
AI/ML Web App Developer at VocalVision AI
May 1, 2025 - October 6, 2025
Built a Flask-based web app that generates image captions and translates them into 10 languages with TTS support. Leveraged VT-GPT-2 for captioning, Marian MT for multilingual translation, and gTTS for speech output; delivered caption generation with ~1.2s latency. Included analysis of income statements and balance sheets with LLMs for automated insight generation.
WhatsApp Chat Analyzer at Independent Project
November 1, 2024 - October 6, 2025
Processed 10,000+ WhatsApp messages to extract daily/weekly activity patterns, sentiment shifts, and user statistics. Used Python, Pandas, NLTK, Scikit-Learn, RegEx, Matplotlib, Seaborn, Plotly, and Streamlit for preprocessing, modeling, and visualization; visualized results via a Streamlit dashboard with 10+ interactive plots.

Education

BE-CSE (AIML) at Chandigarh University
August 1, 2022 - October 6, 2025
PCM (GPA: 8.1) at Rajiv Gandhi University of Knowledge and Technologies
January 1, 2020 - January 1, 2022
GPA 10.0 at Sri Chaitanya Techno Schools
July 1, 2019 - July 1, 2020

Qualifications

Data Science Virtual Internship
January 11, 2030 - October 6, 2025
Introduction to Artificial Intelligence
January 11, 2030 - October 6, 2025
Python for Data Science, AI Development
January 11, 2030 - October 6, 2025
Data Visualization with Cognos Dashboard
January 11, 2030 - October 6, 2025
AI-ML engineer -Infosys
October 25, 2024 - January 1, 2025
AI- ML engineer - genesys international coopoeration ltd
July 1, 2025 - December 30, 2025

Industry Experience

Software & Internet, Healthcare, Life Sciences, Professional Services, Education, Gaming
    paper Bing-GPT Voice Assistant (Research Paper)

    As the Research Author & AI Engineer, I published a paper on a GPT-based voice assistant integrating speech recognition, dialogue management, and NLP to enable real-time, context-aware conversational interaction. The assistant combines speech-to-text and LLMs to simulate natural human communication.

    paper WhatsApp Chat Analyzer using ML & NLP

    As the Data Analyst & NLP Engineer, I built an analytics dashboard that processes 10,000+ WhatsApp messages to uncover user activity, sentiment, and communication trends. Implemented with Python, Pandas, NLTK, Scikit-learn, Matplotlib, and Streamlit, it visualizes message density, emoji usage, and chat patterns through 10+ interactive plots.

    paper Analysing Income Statement & Balance Sheet Table with LLM

    As the Data Analyst & NLP Engineer, I built an analytics dashboard that processes 10,000+ WhatsApp messages to uncover user activity, sentiment, and communication trends. Implemented with Python, Pandas, NLTK, Scikit-learn, Matplotlib, and Streamlit, it visualizes message density, emoji usage, and chat patterns through 10+ interactive plots.

    paper VocalVision AI — Multilingual Image Captioning System

    As the AI Engineer, I developed a Flask-based web application that generates image captions and translates them into 10+ international languages with text-to-speech (TTS) support. Using ViT-GPT2, MarianMT, and gTTS, the system delivers captions and audio responses with an average 1.2s latency, enhancing accessibility and multilingual communication.

    paper GestureCap — AI-Powered Gesture Control using OpenCV

    As the AI Developer, I built a real-time gesture-controlled screen capture system using Python, OpenCV, MediaPipe, PyAutoGUI, and Tesseract OCR. The model tracks hand gestures at ~30 FPS and extracts on-screen text with ~90% OCR accuracy, eliminating manual inputs and enabling full automation of on-screen interactions.