Hi, I'm Yosuke Kuroki, an AI Engineer with a strong background in machine learning, natural language processing, and computer vision. I have experience building scalable, production-ready AI systems and enjoy turning innovative ideas into real-world products. Whether it's prototyping from scratch or deploying at scale, I thrive in fast-paced environments that value creativity and efficiency. I specialize in NLP, OCR, and cloud-based AI solutions, and love collaborating with cross-functional teams to bring impactful technologies to market quickly. Outside of work, I'm always eager to explore the latest advancements in AI and contribute to open-source projects that push the boundaries of what's possible.

Hi, I'm Yosuke Kuroki, an AI Engineer with a strong background in machine learning, natural language processing, and computer vision. I have experience building scalable, production-ready AI systems and enjoy turning innovative ideas into real-world products. Whether it's prototyping from scratch or deploying at scale, I thrive in fast-paced environments that value creativity and efficiency. I specialize in NLP, OCR, and cloud-based AI solutions, and love collaborating with cross-functional teams to bring impactful technologies to market quickly. Outside of work, I'm always eager to explore the latest advancements in AI and contribute to open-source projects that push the boundaries of what's possible.

Available to hire

Hi, I’m Yosuke Kuroki, an AI Engineer with a strong background in machine learning, natural language processing, and computer vision. I have experience building scalable, production-ready AI systems and enjoy turning innovative ideas into real-world products. Whether it’s prototyping from scratch or deploying at scale, I thrive in fast-paced environments that value creativity and efficiency.

I specialize in NLP, OCR, and cloud-based AI solutions, and love collaborating with cross-functional teams to bring impactful technologies to market quickly. Outside of work, I’m always eager to explore the latest advancements in AI and contribute to open-source projects that push the boundaries of what’s possible.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

English
Advanced
Japanese
Fluent

Work Experience

SR ML/LLM, Generative AI engineer at Ultim Group
January 1, 2025 - June 14, 2024
Designed a GPT-4o-based evaluation framework for long-text generation and summarization, enabling rapid iteration. Led R&D for 'Client Intelligence' using LLMs for structured insights and text-to-SQL chatbot development. Built automated pipelines with Azure AI and vector databases for search. Scaled AI document processing platform by 100x with UI/UX optimization and prompt engineering, doubling analyst efficiency and saving 8.5k+ hours yearly. Led system refactor reducing costs by 80% and latency from 45 minutes to 5. Developed CNN-based OCR model optimized for GPU inference, improving latency by 41%.
.NET, Full Stack Developer (Contract Position) at ZOJAX GROUP
November 1, 2024 - June 14, 2024
Delivered full-stack web development using .NET, JavaScript, and modern frameworks to create scalable, high-performance applications. Collaborated with product, UX, and QA teams to design and deploy solutions enhancing user experience and business efficiency. Optimized application performance through code refactoring, database tuning, and cloud integration on Azure/AWS. Implemented clean architecture, RESTful APIs, and CI/CD pipelines for maintainability and scalability.
SR Machine Learning Engineer, MLOps at SPOTCUBE.INC
March 1, 2022 - December 31, 2024
Created hyperspectral tree species classifier with 30% accuracy improvement over RGB baseline. Developed LiDAR-RGB fusion pipeline reducing false negatives by 30%. Redesigned training infrastructure cutting model development cycles from 7 days to 20 hours and reducing AWS costs by 80%. Co-developed graph neural network architecture halving 3D point cloud processing time. Streamlined annotation workflows boosting team efficiency by 50-70%.
Machine Learning Engineer at CTI-Construction Testing & Inspection, Inc.
March 1, 2020 - February 28, 2022
Developed medicine ranking system using Python, Databricks, and SparkXGBRanker. Managed cloud infrastructure on AWS EKS and Azure Kubernetes. Designed scalable data pipelines on AWS and Google Cloud Storage. Enhanced model robustness via data augmentation and transfer learning. Containerized ML applications with Docker and ensured HIPAA-compliant deployments using Kubernetes. Migrated legacy ML pipelines for high-frequency trade execution. Established CI/CD pipelines on AWS and GCP enabling real-time risk reporting. Developed automated ETL workflows and optimized big data processing with Hadoop and Spark.
Machine Learning Developer at Google
March 1, 2019 - February 28, 2020
Developed CNN-based OCR Text Detection Model improving speed and accuracy for scanned documents. Fine-tuned OCR pipelines with Tesseract for diverse document types including archival records and handwritten notes. Integrated denoising and text detection for inference time reduction. Created computer vision techniques for character and table detection in document images. Deployed scalable pipelines on AWS for document processing and built RESTful APIs integrating ML models with legacy systems. Developed synthetic data for logo detection, enhancing robustness using OpenCV and C++. Managed ML model deployment with Docker and Kubernetes as UISE JVM Chassis applications.
AI Chatbot, Full Stack Developer at Metadata
May 1, 2018 - February 28, 2019
Developed real-time monitoring dashboards with Datadog for AI system performance. Designed ETL and preprocessing pipelines on AWS Lambda and SageMaker. Conducted load testing to optimize pipeline scalability and speed. Automated NLP pipeline for classification, review, and summarization of legal documents reducing manual effort. Collaborated with legal teams ensuring fairness and transparency in AI deployment for litigation support.
SR ML/LLM, Generative AI engineer at Ultim Group
January 1, 2025 - June 14, 2024
Designed a novel GPT-4o-based evaluation framework for long-text generation and summarization, reducing feedback cycles from days to minutes and enabling rapid iteration. Spearheaded R&D for "Client Intelligence," leveraging LLMs to extract structured insights from historical broker chatrooms and developing a text-to-SQL chatbot for natural language retrieval. Built automated ingestion and preprocessing pipelines using Azure AI and cloud functions, and embedded unstructured documents into a vector database for downstream search. Scaled an AI document processing platform by 100× (17k+ tenants/year), optimizing UI/UX, refining SME-driven prompt engineering, and doubling analyst efficiency—saving 8.5k+ hours annually. Led a major system refactor (10k+ LOC), slashing per-tenant costs by 80% and reducing latency from 45 minutes to 5. Led the development of a CNN-based OCR model optimized for GPU inference, reducing latency by 41% with TensorRT and Docker.
.NET, Full Stack Developer(Contract Position) at ZOJAX GROUP
November 1, 2024 - June 14, 2024
Full-stack web development using .NET, JavaScript, and modern frameworks to build scalable, high-performance applications for enterprise clients. Collaborated with cross-functional teams (product, UX, QA) to design, develop, and deploy end-to-end solutions that enhance user experience and business efficiency. Optimized application performance through code refactoring, database tuning, and cloud integration (Azure/AWS). Ensured maintainability and scalability by implementing clean architecture, RESTful APIs, and CI/CD pipelines.
SR Machine Learning Engineer, MLOps at SPOTCUBE.INC
March 1, 2022 - December 31, 2024
Created hyperspectral tree species classifier achieving 30% higher accuracy than RGB baseline. Implemented LiDAR-RGB fusion pipeline, reducing object detection false negatives by 30%. Redesigned training infrastructure, cutting model development cycles from 7 days to 20 hours and reducing AWS costs by 80%. Co-developed graph neural network architecture that reduced 3D point cloud processing time by 50%. Streamlined annotation workflows, increasing labeling team efficiency by 50-70%.
Machine Learning Engineer at CTI-Construction Testing & Inspection, Inc.
March 1, 2020 - February 28, 2022
Developed a medicine ranking system leveraging Python, Databricks, and SparkXGBRanker, implementing EDA, feature engineering, and hyperparameter tuning to enhance model accuracy. Managed cloud infrastructure on AWS EKS and Azure Kubernetes Service, provisioning resources via Terraform and Helm. Designed and deployed scalable data pipelines on AWS S3/EC2 & Google Cloud Storage, optimizing preprocessing and model training for medical imaging datasets. Enhanced model performance using data augmentation, transfer learning, and cross-validation techniques, improving robustness for real-world clinical applications. Containerized ML applications using Docker, orchestrating secure, HIPAA-compliant deployments with Kubernetes to ensure real-time inference in clinical settings. Deployed ranking models as UISE JVM Chassis applications, implementing realtime tracking, alerting, and periodic retraining, strengthening risk and customer engagement strategies. Refactored and migrated legacy ML pipelin
Machine Learning developer at Google
March 1, 2019 - February 28, 2020
Developed a CNN-based OCR Text Detection Model, reducing inference time for scanned documents and card images, optimizing processing speed and accuracy. Designed and fine-tuned an OCR pipeline using Tesseract, digitizing and extracting textual metadata from archival labels, handwritten notes, and printed documentation. Customized OCR models to recognize specialized fonts and handwriting styles in historical records, enhancing digital metadata enrichment. Integrated denoising, text detection, and Tesseract OCR, leading to reduction in inference time for scanned certificate documents and PDFs. Developed computer vision-based techniques for character & table detection in document scene images, leveraging OpenCV to enhance document processing efficiency. Deployed distributed document processing pipelines on AWS (EC2, S3), ensuring scalability and secure high-volume storage of legal documents. Developed cloud-based RESTful APIs that seamlessly integrated ML models with legacy e-discovery sy
AI Chatbot, Full Stack developer at Metadata
May 1, 2018 - February 28, 2019
Integrated real-time performance monitoring dashboards with Datadog, enhancing model observability and troubleshooting capabilities. Designed automated ETL & preprocessing pipelines for legal & archival datasets, ensuring efficient data ingestion & transformation on AWS Lambda & SageMaker. Conducted load testing with K6 & Locust, documenting findings & optimizing document processing pipelines for scalability & speed. Designed & implemented an end-to-end NLP pipeline to automate the classification, review, and summarization of legal documents, reducing manual e-discovery time. Collaborated with legal teams & compliance specialists to validate model fairness & transparency, ensuring ethical AI deployment in litigation support systems.

Education

Bachelor of Technology - BTech at Tokyo Institute of Technology
April 1, 2014 - May 9, 2018
Department of Information Engineering
Bachelor of Science at Tokyo Institute of Technology
February 1, 2014 - June 30, 2018
Bachelor of Science at Tokyo Institute of Technology
February 1, 2014 - June 30, 2018

Qualifications

Deep Learning Specialization
January 1, 2018 - December 31, 2018
Certified Analytics Professional (CAP)
January 1, 2019 - December 31, 2019
AWS Certified Machine Learning
January 1, 2020 - December 31, 2020
Azure Data Scientist Associate
January 1, 2021 - December 31, 2021
Certified Associate Developer for Apache Spark
January 1, 2022 - December 31, 2022
Deep Learning Specialization
January 11, 2030 - July 3, 2025
Certified Analytics Professional (CAP)
January 11, 2030 - July 3, 2025
AWS Certified Machine Learning
January 11, 2030 - July 3, 2025
Azure Data Scientist Associate
January 11, 2030 - July 3, 2025
Certified Associate Developer for Apache Spark
January 11, 2030 - July 3, 2025

Industry Experience

Software & Internet, Financial Services, Healthcare, Agriculture & Mining, Professional Services
    paper Multi Signal Wallet

    A multi-signature wallet is a cryptocurrency wallet that requires multiple signatures — instead of just one — to execute each transaction.

    paper TryHackMe
    • https://www.twine.net/signin
    • TryHackMe is a browser-based cyber security training platform, with learning content covering all skill levels from the complete beginner to the seasoned hacker. In this Role, I worked as a Full Stack Developer to migrate their application from old stack to latest stack using.
    paper Keller Williams
    • https://www.twine.net/signin
    • The world’s largest real estate franchise network, where I developed their financial and CRM systems. These systems now support transactions worth $4B across 35 countries.
      In this role, I started as a Senior React Engineer and later became the Frontend Team Lead, overseeing the frontend team across Peru, Poland, and the US and migrating their
      old application into React, Next using Bootstrap, later I introduced internationalization using i18n into their application that helped them expand into 35 countries.