Available to hire
Hi, I’m Gyaneshwar Prajapati, a data engineer with over 9 years of experience in building scalable data pipelines and analytics platforms. I thrive on turning complex data into actionable insights and helping organizations make data-driven decisions.
I collaborate with cross-functional teams to design solutions that align with business goals while ensuring data accuracy and security. I have a strong foundation in data modeling, ETL/ELT processes, and advanced statistical methods, and I’m comfortable working across cloud platforms, big data ecosystems, and BI tools to drive impact.
Skills
Language
English
Fluent
Work Experience
Senior Data Engineer at Bluethink Inc
November 1, 2023 - PresentDesigned and implemented end-to-end data pipelines and analytics workflows, leveraging AWS Glue components (Crawlers, Jobs, Triggers) for schema generation, data transformation, and workflow automation. Integrated Pandas and AWS Lambda for additional processing. Employed Spark (Python) and Hive for scalable data processing and data distribution across data lakes and warehouses.
Senior Data Engineer at Bluethink Inc
August 1, 2021 - September 30, 2023Health Channel project: developed data pipelines on GCP using Dataflow to extract, transform, and load data into BigQuery; designed BigQuery datasets and views; implemented serverless Cloud Functions and event-driven data triggers to automate workflows.
Senior Data Engineer at Bluethink Inc
November 1, 2018 - May 31, 2021MCE project: built Apache Airflow-based pipelines, integrated Snowflake, PySpark, SFTP, and AWS S3 for ETL; optimized SQL and data transfer; used Spark for cleansing, transformation, and aggregation.
Senior Data Engineer at Bluethink Inc
February 1, 2016 - October 31, 2018Merchant (Leanscale) project: web-scraped e-commerce sites (Amazon, AliExpress, Walmart, Asos) and stored data in MongoDB; used Python for XML/JSON processing; deployed scripts on AWS with MongoDB; experience with Python 2.7/3.x; order automation scripts.
Sr Data Engineer at Bluethink Inc
November 1, 2023 - PresentLed end-to-end data pipeline design and implementation, leveraging AWS Glue, Spark, Hive, and Hadoop-based ecosystems; defined data flow, schema generation, and automated ETL execution using Crawlers, Jobs, and Triggers; collaborated with stakeholders to deliver scalable data solutions.
Data Engineer at Bluethink Inc
August 1, 2021 - September 30, 2023Health channel project: Developed data management and automation using Google Cloud Platform services. Built ETL pipelines with Google Dataflow, loaded data into BigQuery, designed datasets, views, and stored procedures, and implemented serverless functions and event-driven processes for data automation.
Data Engineer at Bluethink Inc
November 1, 2018 - May 31, 2021MCE project: Streamlined data operations by integrating multiple data sources with Apache Airflow, Snowflake, PySpark, SFTP, and AWS S3. Optimized SQL queries, data transfers, and data processing pipelines.
Data Engineer at Leanscale
February 1, 2016 - October 31, 2018Merchant project: Web scraping and data automation for e-commerce. Created Python scripts to extract data from Amazon, AliExpress, Walmart, and Asos, stored in MongoDB, and deployed on AWS. Implemented data exchange and business logic via XML/JSON processing.
Education
Masters in Computer Application at Uttar Pradesh Technical University, India
January 11, 2030 - December 10, 2025Master of Computer Applications at Uttar Pradesh Technical University, India
January 11, 2030 - December 10, 2025Qualifications
Industry Experience
Software & Internet, Media & Entertainment, Professional Services, Computers & Electronics
Skills
Hire a Data Analyst
We have the best data analyst experts on Twine. Hire a data analyst today.