Hello, I’m Andy Malik, a Principal Data Scientist focused on delivering scalable ML and GenAI solutions that drive real business impact. I’ve led cross-functional teams to build end-to-end data platforms, automated ML pipelines, and responsible AI practices across cloud environments (AWS, Azure, GCP). I enjoy turning messy data into clear strategies, improving automation, governance, and decision support through practical, production-ready models. I’m passionate about deploying intelligent systems that blend NLP, deep learning, and analytics with strong collaboration with engineering, product, and leadership. My goal is to create reliable, transparent, and impactful AI solutions that empower teams to move faster and make data-driven decisions with confidence.

Andy Malik

Hello, I’m Andy Malik, a Principal Data Scientist focused on delivering scalable ML and GenAI solutions that drive real business impact. I’ve led cross-functional teams to build end-to-end data platforms, automated ML pipelines, and responsible AI practices across cloud environments (AWS, Azure, GCP). I enjoy turning messy data into clear strategies, improving automation, governance, and decision support through practical, production-ready models. I’m passionate about deploying intelligent systems that blend NLP, deep learning, and analytics with strong collaboration with engineering, product, and leadership. My goal is to create reliable, transparent, and impactful AI solutions that empower teams to move faster and make data-driven decisions with confidence.

Available to hire

Hello, I’m Andy Malik, a Principal Data Scientist focused on delivering scalable ML and GenAI solutions that drive real business impact. I’ve led cross-functional teams to build end-to-end data platforms, automated ML pipelines, and responsible AI practices across cloud environments (AWS, Azure, GCP). I enjoy turning messy data into clear strategies, improving automation, governance, and decision support through practical, production-ready models.

I’m passionate about deploying intelligent systems that blend NLP, deep learning, and analytics with strong collaboration with engineering, product, and leadership. My goal is to create reliable, transparent, and impactful AI solutions that empower teams to move faster and make data-driven decisions with confidence.

See more

Language

English
Fluent

Work Experience

Principal Data Scientist at Anchor
December 1, 2021 - Present
Designed and deployed ML models for predictive analytics, anomaly detection, and customer segmentation across cloud and enterprise systems. Built and maintained end-to-end ML pipelines using Python, Scikit-learn, TensorFlow, and MLflow for scalable production workflows. Led MLOps initiatives with Docker, Jenkins, and Kubernetes to enable automated, reproducible, and version-controlled model deployments. Developed NLP and GenAI solutions including document classification, sentiment analysis, chatbot integration, and LLM-based enhancements using Transformers, SpaCy, Hugging Face, and OpenAI APIs. Integrated LLM-based summarization tools using LangChain and LlamaIndex into enterprise reporting dashboards to enhance decision support and content automation. Drove responsible AI practices by embedding fairness, transparency, and model governance into all stages of development.
Lead Data Scientist at PetraPower
November 30, 2021 - July 24, 2025
Led the development of predictive models for customer retention, demand forecasting, and operational efficiency across energy and utility datasets. Designed and implemented feature engineering pipelines and preprocessing workflows using Python, Pandas, and Scikit-learn. Applied statistical modeling and machine learning algorithms to uncover actionable insights and guide data-driven decision-making. Collaborated with engineering and product teams to deploy models into production systems, improving performance and automation. Conducted rigorous model validation using cross-validation, A/B testing, and performance metrics (ROC-AUC, precision/recall). Mentored junior data scientists and drove adoption of best practices in model development, documentation, and reproducibility.
Machine Learning Engineer at Mindstorm Studios
May 31, 2017 - July 24, 2025
Designed and deployed ML models for user behavior prediction, churn analysis, and recommendation systems in gaming applications. Built robust data pipelines and performed feature engineering using Python, SQL, and early-stage Spark tools. Collaborated with product teams to integrate ML solutions into live systems, enhancing personalization and gameplay. Conducted EDA and A/B testing to validate model effectiveness and drive data-informed strategies. Delivered scalable, end-to-end ML solutions from prototyping to production deployment.
Principal Data Scientist at Anchor
December 1, 2021 - Present
Designed and deployed ML models for predictive analytics, anomaly detection, and customer segmentation across cloud and enterprise systems. Built and maintained end-to-end ML pipelines using Python, Scikit-learn, TensorFlow, and MLflow for scalable production workflows. Led MLOps initiatives with Docker, Jenkins, and Kubernetes to enable automated, reproducible, and version-controlled model deployments. Developed NLP and GenAI solutions including document classification, sentiment analysis, chatbot integration, and LLM-based enhancements using Transformers, SpaCy, Hugging Face, and OpenAI APIs. Integrated LLM-based summarization tools using LangChain and LlamaIndex into enterprise reporting dashboards to enhance decision support and content automation. Drove responsible AI practices by embedding fairness, transparency, and model governance into all stages of development.
Lead Data Scientist at PetraPower
November 30, 2021 - July 24, 2025
Led the development of predictive models for customer retention, demand forecasting, and operational efficiency across energy and utility datasets. Designed and implemented feature engineering pipelines and preprocessing workflows using Python, Pandas, and Scikit-learn. Applied statistical modeling and machine learning algorithms to uncover actionable insights and guide data-driven decision-making. Collaborated with engineering and product teams to deploy models into production systems, improving performance and automation. Conducted rigorous model validation using cross-validation, A/B testing, and performance metrics (ROC-AUC, precision/recall). Mentored junior data scientists and drove adoption of best practices in model development, documentation, and reproducibility.
Machine Learning Engineer at Mindstorm Studios
May 31, 2017 - July 24, 2025
Designed and deployed ML models for user behavior prediction, churn analysis, and recommendation systems in gaming applications. Built robust data pipelines and performed feature engineering using Python, SQL, and early-stage Spark tools. Collaborated with product teams to integrate ML solutions into live systems, enhancing personalization and gameplay. Conducted EDA and A/B testing to validate model effectiveness and drive data-informed strategies. Delivered scalable, end-to-end ML solutions from prototyping to production deployment.
Principal Data Scientist at Anchor
December 1, 2021 - Present
Designed and deployed ML models for predictive analytics, anomaly detection, and customer segmentation across cloud and enterprise systems. Built and maintained end-to-end ML pipelines using Python, Scikit-learn, TensorFlow, and MLflow for scalable production workflows. Led MLOps initiatives with Docker, Jenkins, and Kubernetes to enable automated, reproducible, and version-controlled model deployments. Developed NLP and GenAI solutions including document classification, sentiment analysis, chatbot integration, and LLM-based enhancements using Transformers, SpaCy, Hugging Face, and OpenAI APIs. Integrated LLM-based summarization tools into enterprise reporting dashboards to enhance decision support and content automation. Drove responsible AI practices by embedding fairness, transparency, and model governance into all stages of development.
Lead Data Scientist at Petra Power
November 1, 2021 - September 19, 2025
Led development of predictive models for customer retention, demand forecasting, and operational efficiency across energy and utility datasets. Designed and implemented feature engineering pipelines and preprocessing workflows using Python, Pandas, and Scikit-learn. Applied statistical modeling and machine learning algorithms to uncover actionable insights and guide data-driven decision-making. Collaborated with engineering and product teams to deploy models into production systems, improving performance and automation. Conducted rigorous model validation using cross-validation, A/B testing, and performance metrics (ROC-AUC, precision/recall). Mentored junior data scientists and drove adoption of best practices in model development, documentation, and reproducibility.
Machine Learning Engineer at Mindstorm Studios
May 1, 2017 - September 19, 2025
Designed and deployed ML models for user behavior prediction, churn analysis, and recommendation systems in gaming applications. Built robust data pipelines and performed feature engineering using Python, SQL, and early-stage Spark tools. Col laborated with product teams to integrate ML solutions into live systems, enhancing personalization and gameplay. Conducted EDA and A/B testing to validate model effectiveness and drive data-informed strategies. Delivered scalable, end-to-end ML solutions from prototyping to production deployment.
Associate Machine Learning Engineer at Labelbox
November 1, 2015 - September 19, 2025
Designed and deployed ML models for user behavior prediction, churn analysis, and recommendation systems in gaming applications. Built robust data pipelines and performed feature engineering using Python, SQL, and early-stage Spark tools. Collaborated with product teams to integrate ML solutions into live systems, enhancing personalization and gameplay. Conducted EDA and A/B testing to validate model effectiveness and drive data-informed strategies. Delivered scalable, end-to-end ML solutions from prototyping to production deployment.
Principal Data Scientist at Anchor
December 1, 2021 - Present
Designed and deployed ML models for predictive analytics, anomaly detection, and customer segmentation across cloud and enterprise systems. Built and maintained end-to-end ML pipelines using Python, Scikit-learn, TensorFlow, and MLflow for scalable production workflows. Led MLOps initiatives with Docker, Jenkins, and Kubernetes to enable automated, reproducible, and version-controlled model deployments. Developed NLP and GenAI solutions including document classification, sentiment analysis, chatbot integration, and LLM-based enhancements. Integrated LLM-based summarization tools into enterprise reporting dashboards to enhance decision support and content automation. Drove responsible AI practices by embedding fairness, transparency, and model governance into all stages of development.
Lead Data Scientist at Petra Power
November 1, 2021 - September 19, 2025
Led the development of predictive models for customer retention, demand forecasting, and operational efficiency across energy and utility datasets. Designed and implemented feature engineering pipelines and preprocessing workflows using Python, Pandas, and Scikit-learn. Applied statistical modeling and machine learning algorithms to uncover actionable insights and guide data-driven decision-making. Collaborated with engineering and product teams to deploy models into production systems, improving performance and automation. Conducted rigorous model validation using cross-validation, A/B testing, and performance metrics (ROC-AUC, precision/recall). Mentored junior data scientists and promoted reproducibility and best practices.
Machine Learning Engineer at Mindstorm Studios
May 1, 2017 - September 19, 2025
Designed and deployed ML models for user behavior prediction, churn analysis, and recommendation systems in gaming applications. Built robust data pipelines and performed feature engineering using Python, SQL, and early-stage Spark tools. Collaborated with product teams to integrate ML solutions into live systems, enhancing personalization and gameplay. Conducted EDA and A/B testing to validate model effectiveness and drive data-informed strategies. Delivered scalable, end-to-end ML solutions from prototyping to production deployment.
Associate Machine Learning Engineer at Labelbox
November 1, 2015 - September 19, 2025
Designed and deployed ML models for user behavior prediction, churn analysis, and recommendation systems in gaming applications. Built robust data pipelines and performed feature engineering using Python, SQL, and early-stage Spark tools. Collaborated with product teams to integrate ML solutions into live systems, enhancing personalization and gameplay. Conducted EDA and A/B testing to validate model effectiveness and drive data-informed strategies. Delivered scalable, end-to-end ML solutions from prototyping to production deployment.

Education

Master's Degree at Preston University
January 11, 2030 - July 24, 2025
Master's Degree at Preston University
January 1, 2010 - December 31, 2012
Master's Degree in Computer Science at New Jersey Institute of Technology
January 11, 2030 - September 19, 2025
Master's Degree in Computer Science at New Jersey Institute of Technology
January 11, 2030 - September 19, 2025

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Energy & Utilities, Gaming, Professional Services, Media & Entertainment, Other