I'm Surya Prakash Reddy Mallangi, a Generative AI Engineer and Data Science professional with 4+ years of experience delivering NLP, ML, and LLM solutions across the UK and India. I have led generative AI platforms and retrieval-augmented systems that cut manual review effort by 50% and reduced candidate feedback turnaround from 1 hour to 5-10 minutes. I'm NVIDIA Certified in Generative AI and LLMs, with hands-on experience in fine-tuning, evaluation, and secure cloud deployments on AWS and Azure. I focus on turning AI research into production-grade, business-aligned products for recruitment, marketing, and analytics. I enjoy collaborating with cross-functional teams and translating complex AI capabilities into tangible outcomes that drive efficiency and impact.

Surya Prakash Reddy Mallangi

I'm Surya Prakash Reddy Mallangi, a Generative AI Engineer and Data Science professional with 4+ years of experience delivering NLP, ML, and LLM solutions across the UK and India. I have led generative AI platforms and retrieval-augmented systems that cut manual review effort by 50% and reduced candidate feedback turnaround from 1 hour to 5-10 minutes. I'm NVIDIA Certified in Generative AI and LLMs, with hands-on experience in fine-tuning, evaluation, and secure cloud deployments on AWS and Azure. I focus on turning AI research into production-grade, business-aligned products for recruitment, marketing, and analytics. I enjoy collaborating with cross-functional teams and translating complex AI capabilities into tangible outcomes that drive efficiency and impact.

Available to hire

I’m Surya Prakash Reddy Mallangi, a Generative AI Engineer and Data Science professional with 4+ years of experience delivering NLP, ML, and LLM solutions across the UK and India. I have led generative AI platforms and retrieval-augmented systems that cut manual review effort by 50% and reduced candidate feedback turnaround from 1 hour to 5-10 minutes. I’m NVIDIA Certified in Generative AI and LLMs, with hands-on experience in fine-tuning, evaluation, and secure cloud deployments on AWS and Azure.

I focus on turning AI research into production-grade, business-aligned products for recruitment, marketing, and analytics. I enjoy collaborating with cross-functional teams and translating complex AI capabilities into tangible outcomes that drive efficiency and impact.

See more

Language

English
Fluent

Work Experience

Generative AI Engineer at Webosaurus
October 1, 2025 - October 1, 2025
Designed and implemented an end-to-end Generative AI proof of concept that integrates real-time campaign data with LLM-driven marketing analytics using LangChain, OpenRouter APIs, and FAISS to enable data-driven decision making. Engineered Retrieval-Augmented Generation (RAG) pipelines with open-source embedding models (all-minilm) and semantic search to produce actionable campaign insights and recommendations. Built an asynchronous data ingestion pipeline using httpx/asyncio and a Flask API layer to serve dynamic insights to front-end apps, optimizing data processing and user experience. Implemented a clean, modular 3-Tier architecture (data, service, presentation) with OOP principles to improve maintainability and scalability. Collaborated with CEO and other stakeholders to translate AI capabilities into business outcomes and measurable performance improvements.
Data Science Engineer (Intern) at Spring Software Solutions
August 1, 2024 - June 1, 2025
Led a team of 4 to build a Generative AI-powered recruitment automation platform using Python, FastAPI, GitHub, and Linode, aligning engineering deliverables with business goals and reducing candidate screening time. Designed a scalable speech-to-text pipeline using Google Speech-to-Text and FFMPEG to convert raw audio from APIs into standardized files, storing transcribed speech in analytics-ready MySQL tables, improving throughput and reducing storage footprint by ~10%. Built a BERT (MiniLM) based candidate interview scoring and ranking model that ingests transcriptions and uses cosine similarity on embeddings to cut manual review time from ~20 minutes to ~5 minutes, boosting recruitment efficiency by ~15%. Integrated encoder-decoder and decoder-only LLMs to summarize and generate feedback for interviews and analyze scores, achieving an F1 score of 0.85 and triggering SMTP-based email feedback within 45-60 seconds after submission to deliver near real-time candidate updates.
AI Engineer (Industrial project) at Fargro
May 1, 2024 - September 1, 2024
Designed a Generative AI-driven data engineering framework using Aranet T/RH IP67 sensors for real-time greenhouse climate monitoring, optimising sensor placement and improving efficiency by 35%. Built 4D environmental data pipelines using FFT, Shannon-Nyquist sampling, POD, QR Sparse Sensing, with neural network-based anomaly detection in PyTorch; validated with k-fold cross-validation and SVM benchmarks to improve data quality. Deployed a web tool with interactive 3D/4D Power BI heatmaps for automated sensor placement, cutting manual effort by 70%, and delivered scalable AI pipelines (PyTorch, diffusion models) achieving 40% cost reduction with 87%+ predictive accuracy for precision agriculture.
Junior Software Engineer at Softura Private Ltd
April 1, 2021 - May 1, 2023
Built NLP applications in Python using FastAPI, Snips-NLU, and spaCy for automated language understanding and more accurate text analysis, integrating with Microsoft Bot Framework in Visual Studio to reduce average user interaction time to ~45 seconds per query. Engineered end-to-end data pipelines with Django, Boto3, PostgreSQL, fuzzy matching, and Azure Blob Storage, improving data quality, retrieval accuracy, and system reliability for analytics and modeling. Developed and deployed full-stack services using Node.js, React.js, AWS ECS, DynamoDB, and MDM API integrations, with optimised API endpoints that cut UX response time to ~30 seconds and increased user interactivity by ~75%.

Education

Master of Science, Data Science at University of Roehampton
September 1, 2023 - September 1, 2024
Bachelor of Technology, Computer Science at Vel-Tech Rangarajan and Dr Sagunthala R&D Institute
January 1, 2017 - January 1, 2021

Qualifications

NVIDIA Certified Associate – Generative AI and Large Language Models (LLMs)
January 1, 2025 - December 31, 2027
AWS Certified Cloud Practitioner
January 1, 2022 - December 31, 2025
LangChain & Vector Databases in Production – Activity Loop
January 1, 2025 - December 31, 2025
MATLAB Onramp – MathWorks
January 1, 2024 - January 31, 2024

Industry Experience

Software & Internet, Professional Services, Education, Media & Entertainment