I'm a skilled AI Software Engineer for 7+ years of experience building intelligent systems, from generative AI to voice interfaces and agentic automation. Recently, I've focused on full-stack AI development using React Native, React, Flask, and cloud platforms like AWS and GCP — with deep expertise in LLMs, RAG, computer vision, and real-time speech systems across industries like enterprise SaaS, content automation, and virtual media.

I'm a skilled AI Software Engineer for 7+ years of experience building intelligent systems, from generative AI to voice interfaces and agentic automation. Recently, I've focused on full-stack AI development using React Native, React, Flask, and cloud platforms like AWS and GCP — with deep expertise in LLMs, RAG, computer vision, and real-time speech systems across industries like enterprise SaaS, content automation, and virtual media.

Available to hire

I’m a skilled AI Software Engineer for 7+ years of experience building intelligent systems, from generative AI to voice interfaces and agentic automation.

Recently, I’ve focused on full-stack AI development using React Native, React, Flask, and cloud platforms like AWS and GCP — with deep expertise in LLMs, RAG, computer vision, and real-time speech systems across industries like enterprise SaaS, content automation, and virtual media.

See more

Language

English
Advanced

Work Experience

AI Engineer at Novaverse
October 22, 2019 - February 22, 2023
• Developed video generation backend integrating OpenCV + Stable Diffusion Video pipelines for real estate virtual staging — processed 10K+ videos/month. • Implemented LangChain agents to automate video script → voiceover → asset eneration → publishing workflows — cut production time from 3 days to 4 hours. • Engineered GraphQL + Flask APIs to serve multimodal AI outputs (image, text, video) to OpenReact/React Native frontends with <1s latency. • Pioneered use of Ollama + LlamaIndex for local LLM inference in client demos — enabled offline-capable AI features without cloud dependency. • Collaborated with CV researchers (3 co-authored publications) on diffusion model optimization for edge deployment — achieved 3x speedup on mobile GPUs.
Full Stack AI Engineer at Matik AI
January 23, 2024 - March 23, 2025
• Architected and deployed multi-agent LLM systems using LangGraph + Autogen to automate internal tooling and user support workflows, reducing manual ticket resolution by 60%. • Engineered GPT-4o + Claude 3.5 hybrid pipelines with structured output parsing and tool calling for highaccuracy enterprise search and summarization. • Optimized RAG pipelines using Pinecone + LangChain for contextual retrieval across 10M+ document corpus, improving answer relevance by 35%. • Built real-time voice interfaces using Whisper + ElevenLabs integrated into mobile apps (React Native), enabling hands-free user interactions for accessibility features. • Leveraged MCP (Model Context Protocol) to dynamically route prompts between specialized models (Gemini for reasoning, Mistral for speed, GPT-4 for creativity). • Instrumented LangSmith for prompt versioning, A/B testing, and latency monitoring — reduced hallucination rate by 22% through iterative refinement. • Deployed vLLM + Unsloth for cost-efficient fine-tuning and inference of domain-specific LLMs on GCP Vertex AI.
Software Engineer at Glass Health
February 23, 2021 - November 23, 2023
• Engineered high-performance RAG architectures using LangChain + Pinecone (10M+ docs) and Weaviate + PostgreSQL — improving answer relevance by 35% and enabling semantic search with <200ms latency. Built hybrid keyword + vector search systems for enterprise applications. • Implemented and optimized production RAG pipelines using Pinecone, Weaviate, and FAISS — supporting contextual retrieval for LLMs across multimodal datasets (text, image metadata, user logs) at scale. • Implemented async Python workflows (asyncio, Celery) for concurrent model inference, media processing, and webhook handling — enabling scalable, non-blocking AI pipelines serving 10K+ requests/month. • Built real-time data ingestion and preprocessing pipelines for generative AI platforms — cleaning, deduplicating, and structuring user-generated content before vectorization. Integrated streaming via Twilio/Bland.ai for live conversational data.

Education

Bachelor's Degree at Kohoku University
April 23, 2014 - September 23, 2018

Qualifications

Add your qualifications or awards here.

Industry Experience

Healthcare, Software & Internet, Telecommunications, Media & Entertainment, Education, Real Estate & Construction, Computers & Electronics
    paper LyricFlow ai, https://www.lyricflow-ai.com/

    LyricFlow AI is an AI-powered SaaS platform dedicated to redefining songwriting. Designed for musicians, lyricists, and creators of all skill levels, LyricFlow AI generates high-quality, original lyrics that match your creative intent.
    Built on cutting-edge AI trained with millions of lyrical data points, our platform captures the nuances of language, rhythm, and storytelling to provide lyrics that feel authentic and impactful.

    paper Butterflies AI, https://butterflies.ai/

    Butterflies.ai is a social media platform where users can create and interact with AI characters and other humans in a “digital playground”. The platform allows users to build their own AI friends with unique personalities, engage in chats with these AI and human users, and share content through posts and comments. Founded by former Snap employees, Butterflies.ai aims to enhance AI realism and provide a social space for both humans and AI to coexist and create content.