I am a data scientist and analyst with over 5 years of experience delivering high-impact data solutions across healthcare and financial services. I specialize in predictive modeling, ETL pipelines, and interactive BI dashboards that drive operational efficiency and strategic decision-making. I am a people-focused communicator who translates complex technical findings into actionable insights for executives, and I thrive in cross-functional teams in fast-paced environments. I am excited to explore opportunities in New Zealand's tech ecosystem and bring expertise in Python, SQL, and cloud platforms to solve real-world problems.

Martins O. Okhuakhua

I am a data scientist and analyst with over 5 years of experience delivering high-impact data solutions across healthcare and financial services. I specialize in predictive modeling, ETL pipelines, and interactive BI dashboards that drive operational efficiency and strategic decision-making. I am a people-focused communicator who translates complex technical findings into actionable insights for executives, and I thrive in cross-functional teams in fast-paced environments. I am excited to explore opportunities in New Zealand's tech ecosystem and bring expertise in Python, SQL, and cloud platforms to solve real-world problems.

Available to hire

I am a data scientist and analyst with over 5 years of experience delivering high-impact data solutions across healthcare and financial services. I specialize in predictive modeling, ETL pipelines, and interactive BI dashboards that drive operational efficiency and strategic decision-making.

I am a people-focused communicator who translates complex technical findings into actionable insights for executives, and I thrive in cross-functional teams in fast-paced environments. I am excited to explore opportunities in New Zealand’s tech ecosystem and bring expertise in Python, SQL, and cloud platforms to solve real-world problems.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Intermediate

Work Experience

Data Scientist at Clover Health
January 1, 2024 - Present
Developed and deployed automated ETL pipelines using Python, reducing monthly data processing cycle time by 30%. Engineered 30+ interactive dashboards (Power BI/Tableau) to track real-time metrics, increasing executive stakeholder satisfaction by 20%. Conducted KPI analyses on CAC and LTV to optimize monetization. Performed advanced statistical analysis and A/B testing to validate product feature rollouts and marketing interventions.
Research Assistant / Coordinator at Maternal & Reproductive Health Research
January 1, 2023 - December 31, 2024
Managed end-to-end data collection for clinical research, implementing validation scripts to ensure data integrity for academic publication. Utilized statistical software to perform longitudinal analyses on patient outcomes, identifying key risk indicators. Streamlined laboratory workflows by designing custom project management databases to improve cross-functional coordination.
Community Health Data Volunteer at Save the Children International
January 1, 2025 - Present
Implemented quantitative monitoring frameworks for child protection cases, utilizing trend analysis to prioritize urgent interventions. Synthesized qualitative field observations with quantitative impact metrics to produce reports for international donors. Restructured disparate data sources to build a unified tracking system for community health metrics.

Education

B.Sc. Biochemistry (2:1) at Ambrose Ali University, Ekpoma, Nigeria
January 11, 2030 - January 1, 2018
Data Analytics Certification at Nebiant Analytics, Lagos, Nigeria
January 1, 2022 - January 14, 2026

Qualifications

Project Management Professional (PMP) - Distinction
January 11, 2030 - January 14, 2026
CRM Professional
January 11, 2030 - January 14, 2026
HSE 1 & 2
January 11, 2030 - January 14, 2026

Industry Experience

Healthcare, Financial Services, Education
    paper Predictive Analytics for New Zealand Dairy Yield Optimization

    Executive Summary

    This project addresses a critical challenge in New Zealand’s agricultural sector: predicting dairy production fluctuations based on regional climate variations. Using a decade of historical climate and production data, I developed a machine learning model to forecast monthly yields, enabling farmers and supply chain managers to optimize resource allocation.

    Business Problem

    New Zealand is a global leader in dairy exports. However, production is highly sensitive to climate patterns (rainfall and soil moisture). Unexpected drops in yield lead to supply chain disruptions and financial losses. The goal was to build a predictive tool that provides a 3-month lead time for production forecasts.

    Tech Stack

    Languages: Python (Pandas, NumPy, Scikit-learn)

    Visualization: Matplotlib, Seaborn, Power BI

    Environment: Jupyter Notebook / AWS Sagemaker

    Techniques: Time-Series Analysis, Random Forest Regressor, Feature Engineering

    Methodology

    1. Data Integration & Cleaning

    Aggregated climate data (temperature, precipitation, sunshine hours) from NIWA (National Institute of Water and Atmospheric Research) with regional dairy production metrics.

    Handled missing values using seasonal interpolation to maintain time-series integrity.

    1. Feature Engineering

    Created Lag Features: 1-month and 3-month lags for rainfall to account for the delayed impact of drought on pasture growth.

    Engineered a “Pasture Stress Index”—a derived metric combining soil moisture and temperature.

    1. Machine Learning Modeling

    Compared multiple models: Linear Regression, XGBoost, and Random Forest.

    Winner: Random Forest Regressor yielded the best performance due to its ability to capture non-linear relationships between climate variables.

    Hyperparameter Tuning: Used GridSearchCV to optimize tree depth and estimators.

    1. Evaluation

    Mean Absolute Error (MAE): Achieved an error margin of less than 4% on test data.

    R-Squared: 0.89, indicating a strong correlation between climate features and production output.

    Key Insights

    Rainfall Lag: Rainfall from 2 months prior is the strongest predictor of current-month milk solids production.

    Regional Variation: The Waikato region showed higher sensitivity to temperature spikes compared to the Southland region.

    Impact

    Operational Efficiency: The model provides a 90-day warning for potential yield drops, allowing for proactive feed-stocking.

    Strategic Planning: Insights from the Power BI dashboard helped identify high-risk zones for seasonal drought impact.

    Note: This project demonstrates my ability to apply Data Science to specific New Zealand industrial contexts.