Languages: Python, C++, R, SQL, Bash | Data Tools: Spark, Hadoop, Pyspark, Cassandra, Databricks Python ML Ecosystem: pandas, numpy, keras, opencv, scikit-learn, pytorch, tensorflow Platform/Cloud: Databricks, GCP: Google Certified Associate Cloud Engineer, AWS, Kubernetes, Linux

Ajay Madhusudhan Thumala

Languages: Python, C++, R, SQL, Bash | Data Tools: Spark, Hadoop, Pyspark, Cassandra, Databricks Python ML Ecosystem: pandas, numpy, keras, opencv, scikit-learn, pytorch, tensorflow Platform/Cloud: Databricks, GCP: Google Certified Associate Cloud Engineer, AWS, Kubernetes, Linux

Available to hire

Languages: Python, C++, R, SQL, Bash | Data Tools: Spark, Hadoop, Pyspark, Cassandra, Databricks
Python ML Ecosystem: pandas, numpy, keras, opencv, scikit-learn, pytorch, tensorflow
Platform/Cloud: Databricks, GCP: Google Certified Associate Cloud Engineer, AWS, Kubernetes, Linux

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate

Language

Afar
Advanced
Bashkir
Advanced

Work Experience

Software Engineer (Capstone Partner) at SAP, Irvine, CA
January 1, 2025 - Present
Partnering with SAP to improve its open-source Cloud Application Programming (CAP) model used for enterprise-level application development, aiding over 10k developers.
Data Engineer at Quantiphi Inc
July 1, 2023 - June 1, 2024
Deployed a business insights application with 70% performance increase and 50%+ reduction in compute/storage costs using scalable pipelines. Engineered scalable ETL and ML pipelines (Python, PySpark, AWS) to power data-driven insights. Onboarded data sets (30M+ records) from new data partners, defining and implementing data ingestion pipelines (Python, Pandas, NumPy) and created an API for real-time translation of user queries to SQL. Optimized data processing to achieve sub-2-second response times and report generation within 10 minutes (10M users, 3 data partners, 500+ attributes). Optimized EMR batch jobs for high resource utilization, ensuring max report generation time of 17 minutes for 50 concurrent users.
Intern, Data Engineer at Quantiphi Inc
January 1, 2023 - July 1, 2023
Developed a GenAI tool prototype to generate effective git commit messages trained on historical codebase commits. Won 3rd place in internal hackathon. Implemented database schema, integration and deployment of GCP cloud resources for a serverless microservice architecture, resulting in a 35% cost reduction and 10x increase in traffic capacity.
Product Development Intern at Ford Motor Company, India
June 1, 2022 - August 1, 2022
Designed and developed a modern cloud-based architecture prototype for Ford’s legacy technical assistance software (TF-OAMS) used in 10,000 dealerships worldwide. Developed a prioritization plan, analyzing user behavioral patterns across dealership sites including spatial-temporal traffic volumes and usage frequency by assistance request type.
Project: Deceptive Review Detection
January 1, 2023 - March 1, 2023
Developed a deceptive review detection system using an LSTM model on Amazon reviews dataset, achieving 73% accuracy.

Education

Master of Science in Computer Science at University of California, Irvine
January 11, 2030 - December 1, 2025
Bachelor of Technology in Computer Science & Engineering at SRM Institute of Science and Technology (SRMIST), Chennai, India
January 11, 2030 - May 1, 2023

Qualifications

Google Certified Associate Cloud Engineer
January 11, 2030 - December 19, 2025
AWS Certified Practitioner
January 11, 2030 - December 19, 2025
Kubernetes Certification
January 11, 2030 - December 19, 2025

Industry Experience

Software & Internet, Education, Manufacturing, Professional Services, Other