I am a highly skilled AI Engineer with over 8 years of experience delivering AI-driven solutions across sales intelligence, contract management, computer vision, natural language processing, speech-to-text, and image generation. I specialize in building scalable systems and deploying state-of-the-art models to solve real-world business problems.\n\nI have led cross-functional teams, designed data-driven architectures, and delivered measurable impact—reducing latency, boosting conversions, and enabling real-time analytics. I enjoy turning complex requirements into robust APIs, RAG workflows, and multimodal AI applications.

Jose Guerra

I am a highly skilled AI Engineer with over 8 years of experience delivering AI-driven solutions across sales intelligence, contract management, computer vision, natural language processing, speech-to-text, and image generation. I specialize in building scalable systems and deploying state-of-the-art models to solve real-world business problems.\n\nI have led cross-functional teams, designed data-driven architectures, and delivered measurable impact—reducing latency, boosting conversions, and enabling real-time analytics. I enjoy turning complex requirements into robust APIs, RAG workflows, and multimodal AI applications.

Available to hire

I am a highly skilled AI Engineer with over 8 years of experience delivering AI-driven solutions across sales intelligence, contract management, computer vision, natural language processing, speech-to-text, and image generation. I specialize in building scalable systems and deploying state-of-the-art models to solve real-world business problems.\n\nI have led cross-functional teams, designed data-driven architectures, and delivered measurable impact—reducing latency, boosting conversions, and enabling real-time analytics. I enjoy turning complex requirements into robust APIs, RAG workflows, and multimodal AI applications.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

AI Engineer at TMC Motor
April 1, 2015 - October 5, 2025
Optimized LLaMA-3 8B models, reducing size by 75% and latency by 50% while maintaining 0.95 recall and 30 BLEU score. Implemented a RAG-based personalized sales and marketing bot using LangChain with few-shot and chain-of-thought prompting, improving lead qualification by 40% and content personalization by 25%. Built asynchronous backend with FastAPI and Redis caching. Refactored backend into Go microservices on AWS Lambda and Docker, reducing API response times from 1.2s to 0.7s and enabling 200k+ concurrent users. Implemented event-driven architecture with AWS, Apache Kafka and RabbitMQ for real-time inventory updates across WhatsApp and Twilio, reducing discrepancies by 25%. Integrated Stripe payments with analytics dashboard processing over $1.2M monthly with zero downtime in 12 months. Led CI/CD adoption with AWS CDK, Jenkins and GitHub Actions, cutting deployment times from 2 hours to 15 minutes and achieving 100% deployment success.
AI Engineer at Yondu Inc
September 1, 2023 - October 5, 2025
Built personalized product recommendations using transformer models (BERT, T5) and PyTorch for e-commerce, increasing conversions by 32% and adding $2.5M in annual revenue. Designed dynamic pricing with XGBoost using real-time data, boosting revenue optimization by 20%. Fine-tuned a Llama-based model on a dataset combining financial and store data for nuanced insights. Integrated an advanced TTS module with ElevenLabs for audio personalization. Created RESTful APIs with Node.js and integrated Facebook, Twitter, Instagram APIs, reducing cross-API latency from 800ms to 250ms. Implemented WebSocket-based real-time notifications (Socket.IO) for 1.5M weekly events, increasing engagement by ~10k events weekly. Implemented OAuth 2.0 and security hardening with zero incidents for 12+ months.
Backend Developer at Pointwest Technologies
June 1, 2021 - October 5, 2025
Engineered backend services for a healthcare platform using Google Cloud Firebase, Python (Django) and PostgreSQL, processing 100k patient records daily with 99.9% uptime. Built RESTful APIs with Node.js and Django, reducing data retrieval times from 2.5s to 1.2s. Leveraged FastAPI for internal analytics tool handling 100k+ requests per minute with improved latency. Integrated full-text search with Elasticsearch and PostgreSQL. Collaborated with front-end to deliver robust APIs used by over 1,500 doctors and nurses.
AI Engineer at TMC Motor
April 1, 2025 - October 5, 2025
AI Engineer responsible for optimizing large language models and building AI-powered systems. Achievements include optimizing LLaMA-3 8B to reduce size by 75% and latency by 50% while maintaining high recall (0.95) and BLEU (30). Implemented a RAG-based personalized sales/marketing bot with LangChain, set up asynchronous FastAPI backend with Redis caching, and migrated backend to Go microservices on AWS Lambda and Docker to improve API response times (1.2s to 0.7s) and scale to 200k+ concurrent users. Delivered event-driven inventory updates via AWS services, Kafka, and RabbitMQ across channels (WhatsApp, Twilio); integrated Stripe payments with analytics dashboard processing over $1.2M monthly with zero downtime for 12 months. Led CI/CD adoption using AWS CDK, Jenkins, and GitHub Actions, cutting deployment times from 2 hours to 15 minutes with 100% success.
AI Engineer at Yondu Inc
September 1, 2023 - October 5, 2025
Developed personalized product recommendation system using transformer-based models (BERT, T5) and PyTorch for an e-commerce client, increasing conversions by 32% and generating $2.5M in additional annual revenue. Built dynamic pricing model with XGBoost using real-time data, boosting revenue optimization by 20%. Fine-tuned a Llama-based model on a hybrid dataset for nuanced insights, and integrated ElevenLabs TTS for audio personalization. Created RESTful APIs with Node.js, integrated Facebook/Twitter/Instagram for data sync (latency 800ms → 250ms). Implemented WebSocket-based notifications for real-time engagement and OAuth 2.0 security reducing incidents to zero for 12+ months.
Backend Developer at Pointwest Technologies
June 1, 2021 - October 5, 2025
Engineered backend services for a healthcare platform using Google Cloud Firebase, Python (Django), and PostgreSQL, processing 100k+ patient records daily with 99.9% uptime. Built RESTful APIs with Node.js and Django, improving data retrieval times and order fulfilment. Used FastAPI for internal analytics (100k+ requests per minute) with 40% faster data retrieval vs Flask. Implemented Elasticsearch with PostgreSQL full-text search and collaborated with frontend teams to deliver robust APIs for a user base of 1,500 doctors and nurses.

Education

Bachelor of Science in Computer Science at Mapua University
September 1, 2012 - July 1, 2016
Bachelor of Science in Computer Science at Mapua University
September 1, 2012 - July 1, 2016

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Professional Services, Healthcare