Available to hire
I am Yash Vyas, a Python Data Engineer with around seven years of experience building data platforms, ML pipelines, and enterprise applications. I translate business requirements into scalable data solutions using Python, Django, PySpark, Snowflake, AWS, and modern analytics tools.
I enjoy turning complex data into actionable insights through clean code, robust ETL/ELT pipelines, and interactive dashboards. I combine backend engineering with frontend work (React, Next.js) to deliver end-to-end solutions for data-driven decision making across cloud environments.
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Language
Amharic
Advanced
Afar
Intermediate
Bashkir
Intermediate
English
Fluent
Work Experience
Python Data Engineer at ATS
May 1, 2023 - PresentDeveloped a data platform from scratch; designed and implemented data quality rules and frameworks; built end-to-end ML pipelines using Python, PySpark, and Snowpark; designed and managed enterprise data warehouse (Snowflake) with stored procedures, UDFs, and streams; built RESTful APIs with FastAPI; developed front-end components with React.js; created modular and reusable data transformation pipelines using DBT; added AWS S3 and RDS for hosting static media and databases; deployed and managed solutions across Power Platform environments; implemented IAM governance and security practices.
Data/ Python Engineer at OES.INC
October 1, 2021 - April 1, 2023Opened stack Python APIs; leveraged AWS for Tableau server scaling and security (VPC, security groups, IAM); optimized data processing with Spark-based pipelines; built lightweight front-ends with Vue.js; built end-to-end pipelines with Athena + S3; integrated ML models into ETL workflows; implemented model hosting and real-time monitoring with AWS services; designed NoSQL data models (MongoDB, Cassandra, Redis, DynamoDB); automated deployments and governance using GitHub Actions and Power Platform tooling.
Data Engineer at OYO
August 1, 2016 - July 1, 2019Strong foundation in Hadoop ecosystem (Hadoop, Hive, Sqoop, Pig, HBase, Oozie); designed real-time streaming solutions with Spark Streaming, Kafka, Nifi; implemented star and Snowflake schemas for data warehousing; built dashboards (Tableau) and real-time monitoring; developed batch scheduling and orchestration (Airflow); implemented NoSQL data pipelines (MongoDB) and integrated Spark/Presto with EMR; collaborated with cross-functional teams to deliver scalable ML-ready data solutions.
Python Data Engineer at ATS (earlier role continuation)
May 1, 2023 - PresentData/Python Engineer at OES.INC
October 1, 2021 - April 1, 2023Led Python OpenStack API work; built data pipelines and scripts to update databases and handle file transformations; optimized data processing with parallel computing and Spark-based pipelines; implemented data reporting front-ends with Vue.js; built end-to-end pipelines using Athena + S3; integrated ML/LLM-driven recommendations using LangChain and RAG; migrated proprietary binary data to HDFS via ingestion service; performed data transformations in Hive with partitions and buckets for performance; engineered scalable backend components for real-time processing and microservices on AWS; automated deployments with Terraform, Docker, and Kubernetes; integrated custom connectors using OAuth2; implemented monitoring and governance via Power Platform; developed and optimized data models in Microsoft Dataverse and SQL; used noSQL (MongoDB) as needed; collaborated with ML teams on model-ready imaging datasets.
Data / Python Engineer at OES.INC
October 1, 2021 - April 30, 2023OpenStack/API development and database content updates; scaled Tableau server on AWS with VPC, security groups, and IAM. Optimized data processing via parallel computing and Spark-based pipelines. Built front-end components with Vue.js and developed end-to-end ingestion pipelines (Athena + S3). Implemented LLM- and ML-driven recommendations using LangChain and vector databases. Engineered data transformations in Hive with partitions/buckets and built real-time microservices on AWS. Integrated Django/React/Vue front-ends, containerized deployments with Docker, and automated CI/CD via GitHub Actions. Implemented robust IAM strategies, OAuth 2.0 connectors, and NoSQL solutions (MongoDB). Deployed ML inference with TensorFlow Serving and PyTorch Lightning; designed NoSQL/SQL schemas for scalable apps; authored automation scripts in PowerShell/Bash; leveraged Microsoft Dataverse/SQL for scalable data architectures.
Education
M.Sc in Physics at Hemchandracharya North Gujarat University (HNGU)
January 11, 2030 - January 1, 2016Certificate or Diploma in Research and Evaluation at Fanshawe College
January 11, 2030 - January 1, 2020M.Sc at Department of Physics, HNGU, Patan, Gujarat
January 11, 2030 - January 1, 2016Certificate in Research and Evaluation at Fanshawe College, London, Ontario, Canada
January 11, 2030 - January 1, 2020Master of Science (M.Sc.) at HNGU, Patan, Gujarat
January 1, 2016 - January 1, 2016Research and Evaluation at Fanshawe College, London, Ontario, Canada
January 1, 2020 - January 1, 2020Qualifications
Industry Experience
Software & Internet, Professional Services, Media & Entertainment, Education
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in Kitchener today.