I am a Data Scientist with over 3 years of experience applying data analysis, statistical modeling, and data-driven innovation to deliver business insights and support decision-making. I possess strong quantitative, modeling, and programming skills, and I excel at leading cross-functional projects with clear communication. Throughout my career, I have developed machine learning models, designed data-driven tools for environmental and medical applications, and deployed scalable solutions. I am passionate about leveraging AI technologies, including Large Language Models and Generative AI, to optimize efficiency and reduce operational costs.

Jinxin Dong, Ph.D.

I am a Data Scientist with over 3 years of experience applying data analysis, statistical modeling, and data-driven innovation to deliver business insights and support decision-making. I possess strong quantitative, modeling, and programming skills, and I excel at leading cross-functional projects with clear communication. Throughout my career, I have developed machine learning models, designed data-driven tools for environmental and medical applications, and deployed scalable solutions. I am passionate about leveraging AI technologies, including Large Language Models and Generative AI, to optimize efficiency and reduce operational costs.

Available to hire

I am a Data Scientist with over 3 years of experience applying data analysis, statistical modeling, and data-driven innovation to deliver business insights and support decision-making. I possess strong quantitative, modeling, and programming skills, and I excel at leading cross-functional projects with clear communication.

Throughout my career, I have developed machine learning models, designed data-driven tools for environmental and medical applications, and deployed scalable solutions. I am passionate about leveraging AI technologies, including Large Language Models and Generative AI, to optimize efficiency and reduce operational costs.

See more

Experience Level

Expert
Expert
Intermediate
Intermediate
Intermediate

Work Experience

Data Scientist, Research Assistant at Wuhan Documentation and Information Center
June 1, 2022 - Present
Designed and implemented a publication identifier using binary classification models to detect cutting-edge studies in materials science, achieving a recall score of 0.90 with a BERT-based model and reducing operational cost by 80% ($80K annually). Developed a technical trend analyzer leveraging Generative AI and clustering methods, increasing report generation efficiency by over 500% and reducing costs by 70% ($35K annually). Utilized data extraction, topic modeling, and LLM entity relation extraction to analyze industry trends.
Data Scientist Fellow at Techlent Inc.
August 1, 2025 - August 26, 2025
Developed a machine learning pipeline to classify heart disease risk using clinical and demographic datasets. Conducted exploratory data analysis and feature engineering to build models including Logistic Regression, Random Forest, XGBoost, and SVM. Achieved 0.92 recall with XGBoost for high-risk cases. Deployed a Flask API on Google Cloud Platform to enable scalable early detection for medical intervention.
Postdoctoral Researcher, Data Scientist at Concordia University
April 1, 2022 - August 26, 2025
Led development of a data-driven groundwater contaminant simulation software replacing costly commercial alternatives. Built the system in Python and PyQt5, integrating over 20 environmental parameters via KNN imputation. Modeled contaminant transport through PDE-solving workflows, validated with real-world benchmarks achieving 92% prediction accuracy and providing $12K annual cost savings for users. Delivered actionable contaminant risk insights.
Sales Forecaster at Concordia University
April 1, 2022 - August 26, 2025
Developed a sales forecasting model to optimize inventory turnover and reduce capital costs. Processed data including order dates, customer segments, and product categories. Trained multiple regression models with AdaBoost delivering the best performance (R2 score of 0.77). Model deployment reduced inventory costs by 40%, amounting to $20K annual savings.

Education

Ph.D. in Multimedia Contaminant Modeling at Concordia University
January 1, 2017 - February 1, 2021
M.Eng. in Environmental Engineering at Huazhong University of Science and Technology
September 1, 2013 - June 1, 2016

Qualifications

Add your qualifications or awards here.

Industry Experience

Healthcare, Energy & Utilities, Software & Internet, Professional Services, Other