I’m an AI & data architecture leader with 12+ years of experience building large-scale distributed systems, predictive modeling frameworks, and enterprise-grade lakehouse architectures. I’ve led end-to-end data science lifecycles, designed multi-agent orchestration and autonomous negotiation engines, and shipped infrastructure that scales for high-concurrency workloads. I thrive on turning complex problems into well-structured, auditable plans and delivering high-impact, data-driven outcomes in fast-paced environments. Across roles at startups and enterprise teams, I’ve designed robust data lakes, migrated legacy analytics to modern cloud ecosystems, and built NLP/LLM-driven solutions for real-world applications. I enjoy mentoring engineers, driving architectural consistency, and collaborating with cross-functional stakeholders to translate business goals into actionable data and AI strategies.

Abhishek Jha

I’m an AI & data architecture leader with 12+ years of experience building large-scale distributed systems, predictive modeling frameworks, and enterprise-grade lakehouse architectures. I’ve led end-to-end data science lifecycles, designed multi-agent orchestration and autonomous negotiation engines, and shipped infrastructure that scales for high-concurrency workloads. I thrive on turning complex problems into well-structured, auditable plans and delivering high-impact, data-driven outcomes in fast-paced environments. Across roles at startups and enterprise teams, I’ve designed robust data lakes, migrated legacy analytics to modern cloud ecosystems, and built NLP/LLM-driven solutions for real-world applications. I enjoy mentoring engineers, driving architectural consistency, and collaborating with cross-functional stakeholders to translate business goals into actionable data and AI strategies.

Available to hire

I’m an AI & data architecture leader with 12+ years of experience building large-scale distributed systems, predictive modeling frameworks, and enterprise-grade lakehouse architectures. I’ve led end-to-end data science lifecycles, designed multi-agent orchestration and autonomous negotiation engines, and shipped infrastructure that scales for high-concurrency workloads. I thrive on turning complex problems into well-structured, auditable plans and delivering high-impact, data-driven outcomes in fast-paced environments.

Across roles at startups and enterprise teams, I’ve designed robust data lakes, migrated legacy analytics to modern cloud ecosystems, and built NLP/LLM-driven solutions for real-world applications. I enjoy mentoring engineers, driving architectural consistency, and collaborating with cross-functional stakeholders to translate business goals into actionable data and AI strategies.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Senior Software Engineer II - Data & AI Architect at Spark Plug
April 1, 2024 - April 1, 2026
Engineered an autonomous Multi-Agent Negotiation Engine leveraging LangGraph, enabling a sophisticated, stateful multi-agent system with cyclical reasoning. Implemented structured tool-calling via PyDantic-driven schemas and a contracting-data protocol to enforce strict data governance for autonomous negotiation. The architecture supported dynamic planning and re-planning to decompose complex goals into subtasks, aided by episodic and semantic memory for recalling historical negotiation outcomes. Incorporated self-reflection and correction loops to audit proposed strategies against contract constraints, synthesizing multi-modal signals (POS, contracts, market trends) into machine-readable payoffs with high precision. Designed an Enterprise Lakehouse on Databricks/S3, migrating retail analytics from legacy Postgres to Snowflake/Databricks ecosystem, and led infrastructure optimization including Redis caching and Postgres provisioning to reduce AWS operational costs while increasing thro
Founding ML Engineer at Iris Agent
December 1, 2020 - September 1, 2022
Built an intelligent search architecture with semi-supervised labeling to accelerate productization. Developed semantic search and few-shot learning models to detect cross-platform similarities between Jira issues and support tickets. Implemented tagging systems using Sentence Transformers and cosine similarity to automate unlabeled ticket classification. Led early ML product development, connecting research outcomes to production-grade features and scalable experimentation pipelines.
Data Scientist II at GAIA Ericsson
March 1, 2019 - December 1, 2020
Designed unsupervised clustering systems for Root Cause Analysis using t-SNE and Deep Embedded Clustering (autoencoders) to identify anomalies in TAC-timestamp data. Architected an intelligent Question-Answering layer using BERT and Elasticsearch for enterprise-grade corpus indexing. Delivered end-to-end data science solutions with a focus on interpretability and operationalization.
Senior Technical Associate at Sears India
February 1, 2016 - March 1, 2019
Led the cloud migration strategy, moving 150 TB of data from Teradata to Google BigQuery using PySpark, Hive, and Sqoop. Built a predictive propensity modeling pipeline on GCP/Hadoop to forecast purchasing propensity across 81 business units using LightGBM and LSTM-based models, enabling targeted marketing and improved ROI.

Education

Dual Degree (B.Tech & M.Tech) in Industrial Engineering & Management at I.I.T. Kharagpur
January 11, 2030 - May 3, 2026
Class XII at CBSE
January 11, 2030 - May 3, 2026
Class X at CBSE
January 11, 2030 - May 3, 2026

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Professional Services, Manufacturing, Retail