I am Bryan Saldivar, a PhD‑trained data scientist and AI researcher focused on generative models for drug discovery and health tech. I have 12 years of experience in Python, ML, and IT, with a track record of applying deep learning, computer vision, and data science to real-world problems. Currently a Postdoc at IRB Barcelona, I develop Generative AI for new chemical entities and collaborate across disciplines to bring research to impact.
I have worked across academia, startups, and industry, including a Marie Curie fellowship and a BBVA Data Challenge top‑3 finish. I enjoy teaching, mentoring, and building tools that accelerate discovery and innovation. In my free time I explore the intersection of biology and AI and share knowledge through talks, courses, and open science initiatives.
Skills
Experience Level
Language
Work Experience
Education
Qualifications
Industry Experience
Anemia is a major health burden worldwide. Examining the hemoglobin level of blood is an important way to achieve the diagnosis of anemia, but it requires blood drawing and a blood test. In this work we propose a non-invasive, fast, and cost-effective screening test for iron-deficiency anemia in Peruvian young children. Our initial results show promising evidence for detecting conjunctival pallor anemia and Artificial Intelligence techniques with photos taken with a popular smartphone.
https://www.twine.net/signin
Preliminary results of using latent difussion for new molecule generation
Predicting SARS-CoV-2 mutations is difficult, but predicting recurrent mutations driven by the host, such as those caused by host deaminases, is feasible. We used machine learning to predict which positions from the SARS-CoV-2 genome will hold a recurrent mutation and which mutations will be the most recurrent. We used data from April 2021 that we separated into three sets: a training set, a validation set, and an independent test set. For the test set, we obtained a specificity value of 0.69, a sensitivity value of 0.79, and an Area Under the Curve (AUC) of 0.8, showing that the prediction of recurrent SARS-CoV-2 mutations is feasible. Subsequently, we compared our predictions with updated data from January 2022, showing that some of the false positives in our prediction model become true positives later on. The most important variables detected by the model’s Shapley Additive exPlanation (SHAP) are the nucleotide that mutates and RNA reactivity. This is consistent with the SARS-CoV-2 mutational bias pattern and the preference of some host deaminases for specific sequences and RNA secondary structures. We extend our investigation by analyzing the mutations from the variants of concern Alpha, Beta, Delta, Gamma, and Omicron. Finally, we analyzed amino acid changes by looking at the predicted recurrent mutations in the M-pro and spike proteins. https://www.twine.net/signin
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in El Vendrell today.