I am a PhD in Computer Science with a solid background in research and applied projects across industry and academia. My experience spans machine learning, cloud infrastructure, and algorithm design, with a recent focus on LLM engineering, Python and Kubernetes. I translate complex technical challenges into well-architected software that prioritizes efficiency and performance. I have led multi-university R&D projects, built and optimized cloud deployments, and guided junior data scientists. In my recent roles, I reduced AWS spending by half, built infection-analysis models for a leading hospital, and contributed to LLM training and evaluation. I enjoy turning data into actionable insights and scalable systems.

Augusto Vellozo

I am a PhD in Computer Science with a solid background in research and applied projects across industry and academia. My experience spans machine learning, cloud infrastructure, and algorithm design, with a recent focus on LLM engineering, Python and Kubernetes. I translate complex technical challenges into well-architected software that prioritizes efficiency and performance. I have led multi-university R&D projects, built and optimized cloud deployments, and guided junior data scientists. In my recent roles, I reduced AWS spending by half, built infection-analysis models for a leading hospital, and contributed to LLM training and evaluation. I enjoy turning data into actionable insights and scalable systems.

Available to hire

I am a PhD in Computer Science with a solid background in research and applied projects across industry and academia. My experience spans machine learning, cloud infrastructure, and algorithm design, with a recent focus on LLM engineering, Python and Kubernetes. I translate complex technical challenges into well-architected software that prioritizes efficiency and performance.

I have led multi-university R&D projects, built and optimized cloud deployments, and guided junior data scientists. In my recent roles, I reduced AWS spending by half, built infection-analysis models for a leading hospital, and contributed to LLM training and evaluation. I enjoy turning data into actionable insights and scalable systems.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

Portuguese
Fluent
English
Fluent
French
Advanced
Spanish; Castilian
Beginner

Work Experience

Data Scientist at Hospital Albert Einstein
December 1, 2023 - Present
Built ML models (Random Forest, Neural Networks) for infection analysis on imbalanced datasets; clustered hospital survey data using K-means, HDBSCAN, and PCA; converted hand-filled PDFs into structured data using Gemini Vision Pro API; created interactive dashboards with Plotly and Streamlit; fine-tuned Bert and GPT-family models for text classification with advanced hyperparameter optimization to address gradient instability and class imbalance.
Freelance AI Data Trainer at Mercor
December 1, 2025 - Present
Colosseum Project: produced comparative evaluation reports of LLM responses focusing on correctness, reasoning quality and code reliability. Jupyter Notebook project: created high-quality notebook samples and prompts for training and evaluating LLMs.
Freelance AI Data Trainer at Revelo
July 1, 2025 - Present
Generated detailed thought processes and rationales for debugging Python open-source projects using TDD; modified Dockerfiles to replicate, debug and test reported issues in isolated environments; contributed to the training of large language models by providing structured data for bug-fixing actions, test cases and code reviews.
Freelance Data Science Consultant at Gravitad
December 1, 2024 - Present
Analyzed project objectives and requirements to propose optimal development approaches for AI-driven business applications.
Applied Research Leader and DevOps Manager at TecSinapse
November 1, 2015 - March 31, 2024
Led multi-university R&D projects in recommendation systems, NLP, computer vision, IoT, logistics, and optimization. Managed the cloud infrastructure team and reduced AWS spending by 50% within the first year. Developed customer lifetime value predictors, LOF-based anomaly detection, and dynamic routing systems. Guided internship and junior data scientists and engineers through best practices in modeling, analytics and research.
Researcher at FIT - Flextronics Instituto de Tecnologia
October 1, 2013 - November 30, 2015
Led research in OpenStack, LTE simulators, traffic radar systems and mainframe disaster recovery.
Postdoctoral Researcher at Université Claude Bernard Lyon 1
July 1, 2007 - December 31, 2010
Developed CycADS system for metabolic pathway annotation (Java, MySQL, PathwayTools). Created ACYPICYC database for Acyrthosiphon pisum genome.
Bioinformatics Researcher (Postdoc) at Embrapa
February 1, 2012 - February 28, 2013
Analyzed large-scale NGS datasets using suffix trees and Hadoop. Set up and customized bioinformatics platforms: InterMine, GBrowse, Chado.
Software Developer, Analyst, Professor at Various
January 1, 1997 - December 31, 2011
Developed enterprise applications using Java, Oracle, Visual Basic, Delphi; contributed to home banking systems and distributed architectures; taught algorithms, object-oriented programming, and distributed computing.

Education

PhD in Computer Science at Universidade de São Paulo (USP)
January 1, 2001 - December 31, 2007
Specialist in Marketing and Administration at Universidade de Sorocaba
January 1, 1995 - December 31, 1996
Bachelor in Computer Science at Universidade Estadual de Campinas
January 1, 1985 - December 31, 1989

Qualifications

Add your qualifications or awards here.

Industry Experience

Computers & Electronics, Healthcare, Software & Internet, Education, Life Sciences, Professional Services