I'm a data scientist and Generative AI engineer with 4+ years of experience in data modeling, statistical analysis, machine learning, and computer vision. I design predictive models and optimize deep learning architectures using PyTorch and TensorFlow, translating complex data into actionable product and clinical insights. I routinely work with SQL, Python, and R to conduct end-to-end statistical studies that inform research and product decisions. I enjoy collaborating with product, engineering, and clinical research teams to evaluate performance in retrospective studies and validate product improvements. I am proficient in cloud platforms (AWS), edge inference optimization, and building scalable analytics pipelines and dashboards that drive business impact. I thrive on hypothesis testing, experimental design, and delivering actionable, data-driven results.

Shyamsundhar Yathirajam

I'm a data scientist and Generative AI engineer with 4+ years of experience in data modeling, statistical analysis, machine learning, and computer vision. I design predictive models and optimize deep learning architectures using PyTorch and TensorFlow, translating complex data into actionable product and clinical insights. I routinely work with SQL, Python, and R to conduct end-to-end statistical studies that inform research and product decisions. I enjoy collaborating with product, engineering, and clinical research teams to evaluate performance in retrospective studies and validate product improvements. I am proficient in cloud platforms (AWS), edge inference optimization, and building scalable analytics pipelines and dashboards that drive business impact. I thrive on hypothesis testing, experimental design, and delivering actionable, data-driven results.

Available to hire

I’m a data scientist and Generative AI engineer with 4+ years of experience in data modeling, statistical analysis, machine learning, and computer vision. I design predictive models and optimize deep learning architectures using PyTorch and TensorFlow, translating complex data into actionable product and clinical insights. I routinely work with SQL, Python, and R to conduct end-to-end statistical studies that inform research and product decisions.

I enjoy collaborating with product, engineering, and clinical research teams to evaluate performance in retrospective studies and validate product improvements. I am proficient in cloud platforms (AWS), edge inference optimization, and building scalable analytics pipelines and dashboards that drive business impact. I thrive on hypothesis testing, experimental design, and delivering actionable, data-driven results.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Data Scientist at Intuit, USA
January 1, 2015 - Present
Designed and deployed intelligent recommendation systems leveraging Large Language Models to enhance product suggestions and personalized financial insights across Intuit's platforms, improving user engagement and discoverability. Built a hybrid recommendation engine combining collaborative filtering and content-based techniques (ALS, SVD), achieving a 15% increase in user conversion and 20% improvement in content discoverability. Implemented on-device performance optimizations and containerized ML apps with Docker and Kubernetes; developed real-time analytics dashboards in Tableau; led ETL optimization with Spark/Hadoop/Hive; built NLP sentiment analysis pipelines with spaCy/NLTK; deployed scalable ML pipelines on AWS (S3, DynamoDB, Lambda, EC2, SageMaker).
Research Data Scientist at California State University, USA
June 1, 2023 - December 1, 2024
Improved data processing speeds by 55x with hyperdimensional computing on CPU/GPU; achieved 98% CNN performance gains by benchmarking and optimizing Python implementations. Analyzed wind farm oscillation data using Fourier transforms for feature extraction. Developed a healthcare-focused conversational AI chatbot using Rasa; built end-to-end ML pipelines for reproducible experimentation; published 6 peer-reviewed papers; conducted A/B testing to evaluate model performance; leveraged GCP BigQuery and Cloud Storage for large-scale data management; engineered hardware-in-the-loop workflows for deploying ML models to edge devices (NVIDIA Jetson) for real-time inference.
Data Scientist at Jade Global Software Pvt. Ltd, India
January 1, 2020 - July 1, 2022
Built time-series forecasting models using Prophet and XGBoost with feature engineering that increased forecast accuracy by 32% and reduced stockouts by 18%; designed and deployed end-to-end ETL pipelines using Apache Airflow and Apache Spark, boosting data processing efficiency by 35%; developed ML-based customer segmentation improving marketing targeting by 43%; led A/B testing with robust statistical methods, resulting in a 25% increase in user engagement and a 15% reduction in bounce rates; performed in-depth EDA to uncover user behavior patterns and business drivers.

Education

Master's in Computer Science at California State University San Marcos
January 11, 2030 - December 15, 2025
Bachelor's in Computer Science at Jawaharlal Nehru Technological University Hyderabad
January 11, 2030 - December 15, 2025
Diploma in Computer Science at Osmania University
January 11, 2030 - December 15, 2025

Qualifications

Add your qualifications or awards here.

Industry Experience

Healthcare, Software & Internet, Financial Services, Life Sciences, Professional Services