Available to hire
With a decade of experience in Machine Learning, I specialize in Python, R, and Java, delivering advanced AI and ML solutions across industries like healthcare, finance, and telecom. I enjoy building scalable, efficient machine learning pipelines and deploying models with modern technologies like Docker, Kubernetes, and cloud platforms.
I have hands-on experience with deep learning frameworks such as TensorFlow and PyTorch, and I’m passionate about leveraging Generative AI, Large Language Models, and Retrieval-Augmented Generation to create impactful, intelligent systems that improve customer engagement and business outcomes.
Skills
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Language
English
Fluent
Afar
Advanced
Javanese
Advanced
Work Experience
Lead Data Scientist at Remotebase
February 1, 2022 - PresentLed a team of data scientists and engineers to develop and deploy advanced Generative AI, Large Language Models (LLM), and Retrieval-Augmented Generation (RAG) solutions for clients in the US and UK. Designed scalable AI solutions such as Executive Insights Chatbot and Multi-Agent Customer Support Bot using GPT-4, LangChain, LangGraph, and vector databases. Achieved a 50% reduction in query resolution times and saved 20+ hours per week for executives. Built memory-enabled chatbots for real estate with LLaMA 3.2, LangSmith, and Azure Kubernetes Service, improving customer engagement by 40%. Implemented end-to-end MLOps pipelines on GCP and Azure to ensure scalability and maintainability. Enhanced decision-making with predictive and transformer-based models across industries.
Senior Data Scientist at Minute7
January 31, 2022 - July 18, 2025Collaborated with clients to understand requirements and delivered production-ready ML pipelines using Python and GCP Vertex AI. Led data science teams to meet tight deadlines. Conducted research to develop graph-based recommendation and churn prediction models with neural graph architectures, boosting customer engagement. Deployed a hierarchical frequently bought together model based on 4 million transactions, increasing basket size by 8%. Created machine learning pipelines for logistics including data ingestion from BigQuery, time-series construction, and model deployment on Vertex AI.
Data Scientist at HobbyDB
April 30, 2019 - July 18, 2025Managed a team of four to implement an events and dynamic topics detection pipeline on live tweets using NLP. Integrated Twitter Live Tweets API for event extraction and clustering by named entities. Detected bursting events and audience interest with 85% accuracy on entity detection and 82% accuracy on DynamicLDA for latent topic detection.
Lead Data Scientist at 47Billion
February 1, 2022 - PresentLed the development and deployment of advanced AI/ML solutions, including Generative AI, Large Language Models (LLMs), and Retrieval-Augmented Generation (RAG) systems for enterprise clients in healthcare, finance, and telecom. Designed scalable AI architectures such as Executive Analytics Chatbots and Multi-Agent Support Bots using GPT-4, LangChain, LangGraph, and vector databases like Pinecone and FAISS. Achieved improvements in query handling efficiency by 40-60% and saved 20+ hours/week for executive stakeholders through automation. Built persistent conversational agents using LLaMA 3.2 deployed via Azure Kubernetes Service. Implemented end-to-end MLOps pipelines on Google Cloud Platform and Microsoft Azure to automate training, deployment, and monitoring of ML models with cost-effective scalability. Developed transformer-based NLP and predictive analytics workflows that generated actionable insights for clients. Worked closely with cross-functional teams in Agile environments to e
Senior Data Scientist at Minute7
January 31, 2022 - July 31, 2025Led the development and deployment of advanced AI/ML solutions including Generative AI, Large Language Models (LLMs), and Retrieval-Augmented Generation (RAG) systems for enterprise clients in healthcare, finance, and telecom. Designed scalable AI architectures delivering solutions such as Executive Analytics Chatbots and Multi-Agent Support Bots using GPT-4, LangChain, LangGraph, and vector databases like Pinecone and FAISS. Delivered measurable outcomes including 40-60% improvement in query handling efficiency and significant time savings for executive stakeholders. Built persistent conversational agents using LLaMA 3.2, LangSmith, and deployed using Azure Kubernetes Service. Implemented end-to-end MLOps pipelines on Google Cloud Platform and Microsoft Azure enabling automated training, deployment, and monitoring of ML models. Developed transformer-based NLP models and predictive analytics workflows to support faster decision-making across client operations. Collaborated with data sc
Data Scientist at HobbyDB
April 30, 2019 - July 31, 2025Managed a team of four engineers to build a real-time event and dynamic topic detection pipeline on live tweets using NLP techniques across multiple moving time windows. Integrated Twitter Live Tweets API to stream and analyze real-time data, extracting named entities to identify and cluster similar events. Applied burst detection algorithms to surface trending events and measure audience engagement. Achieved 85% accuracy in the entity detection model for identifying bursting events. Utilized Dynamic LDA to uncover latent topics within each event, achieving 82% accuracy in topic classification.
Lead Data Scientist at 47Billion
February 1, 2022 - PresentLed the development and deployment of advanced AI/ML solutions, including Generative AI, Large Language Models (LLMs), and Retrieval-Augmented Generation (RAG) systems for enterprise clients in healthcare, finance, and telecom. Designed scalable AI architectures and delivered solutions like Executive Analytics Chatbots and Multi-Agent Support Bots using GPT-4, LangChain, LangGraph, and vector databases such as Pinecone and FAISS. Achieved a 40-60% improvement in query handling efficiency and saved 20+ hours/week for executive stakeholders through automation. Built persistent, memory-enabled conversational agents using LLaMA 3.2, LangSmith, deployed via Azure Kubernetes Service (AKS). Implemented end-to-end MLOps pipelines on GCP and Microsoft Azure, enabling automated training, deployment, and monitoring of ML models with cost-effective scalability. Developed transformer-based NLP models and predictive analytics workflows to generate actionable insights. Collaborated with cross-functio
Senior Data Scientist at Minute7
January 31, 2022 - July 31, 2025Led the development and deployment of advanced AI/ML solutions, including Generative AI, Large Language Models (LLMs), and Retrieval-Augmented Generation (RAG) systems for enterprise clients in healthcare, finance, and telecom. Designed scalable AI architectures and delivered solutions like Executive Analytics Chatbots and Multi-Agent Support Bots using GPT-4, LangChain, LangGraph, and vector databases such as Pinecone and FAISS. Achieved measurable outcomes such as a 40–60% improvement in query handling efficiency and 20+ hours/week time savings for executive stakeholders. Built memory-enabled conversational agents using LLaMA 3.2, LangSmith, and deployed via Azure Kubernetes Service (AKS). Implemented MLOps pipelines on Google Cloud Platform (GCP) and Microsoft Azure with automated training, deployment, and monitoring. Developed transformer-based NLP models and predictive analytics workflows. Worked closely with cross-functional teams including data scientists, designers, and DevOp
Data Scientist at HobbyDB
April 30, 2019 - July 31, 2025Managed a team of four engineers to build a real-time event and dynamic topic detection pipeline on live tweets using NLP techniques with multiple moving time windows. Integrated Twitter Live Tweets API to stream and analyze real-time data, extracting named entities to identify and cluster similar events. Applied burst detection algorithms within defined time windows to surface trending events and measure audience engagement. Achieved 85% accuracy in the entity detection model for identifying bursting events. Utilized Dynamic LDA to uncover latent topics within each event, reaching 82% accuracy in topic classification.
Education
Qualifications
Google professional Machine Learning engineer
January 1, 2023 - December 31, 2023Azure Data Science Associate
January 1, 2023 - December 31, 2023Google Professional Data Engineer
January 1, 2023 - December 31, 2023Google Data Analytics Professional
January 1, 2023 - December 31, 2023GA Individual Qualification Certified
January 1, 2023 - December 31, 2023MS Azure AI Fundamentals
January 1, 2023 - December 31, 2023L2 Advanced Proficiency in KNIME Certified
January 1, 2023 - December 31, 2023Industry Experience
Software & Internet, Financial Services, Real Estate & Construction, Retail, Transportation & Logistics, Healthcare, Telecommunications
Skills
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in New York today.