I am Krushit Sakaria,
a Cloud Data Engineer
with 7+ years of
experience in AWS,
GCP, and Azure.
About
Accomplished Cloud Data Engineer with 7+ years of experience in architecting and delivering scalable cloud-based data solutions leveraging AWS, GCP, and Azure platforms. Proficient in developing efficient data storage and analytics solutions using technologies like AWS S3, Redshift, BigQuery, and Snowflake.
Download ResumeCore Expertise
- Cloud Architecture (AWS, GCP, Azure)
- Data Engineering & ETL
- Big Data Processing
- Machine Learning Integration
- Data Warehousing
- Real-time Analytics
Experience
Key Bank
Data Engineer
March 2022 - Present
Architected comprehensive analytics platforms using AWS services, designed ETL processes, and built interactive dashboards. Integrated machine learning models and implemented real-time data processing solutions while maintaining robust CI/CD pipelines.
Candid Org
Cloud Data Engineer
September 2020 - February 2022
Led development of hybrid cloud solutions using Azure and AWS, implemented real-time analytics dashboards, and automated ETL processes. Developed and maintained data lake architectures while integrating machine learning capabilities.
NextEra Energy FPL
Data Engineer
June 2019 - August 2020
Architected cloud-based data analytics platforms using AWS S3 and GCP BigQuery, designed ETL pipelines, and implemented serverless data transformation solutions. Enhanced system performance through stream processing and real-time analytics.
UltraCab
Data Engineer
August 2016 - December 2017
Designed scalable data integration pipelines, managed Apache Kafka clusters, and implemented Snowflake data warehousing solutions. Created real-time dashboards and optimized data processing workflows.
Education
Texas A & M University
Masters in Electrical & Computer Engineering
January 2018 - May 2019
Gujarat Technological University
Bachelor in Computer Engineering
May 2012 - May 2016
Technical Skills
Scripting Languages
Python, SQL, Bash, PowerShell, Java
Databases
SQL, MySQL, SAP HANA, Amazon RDS, AWS DynamoDB, MongoDB
AWS Services
S3, RDS, Redshift, Glue, Lambda, EMR, Kinesis, EC2, DynamoDB, CloudFormation, IAM
Visualization Tools
Tableau, Power BI, Looker, Microsoft Excel
ETL Tools
Alteryx, Apache NiFi, Apache Airflow, AWS Glue, Azure Data Factory (ADF), Apache Spark, Talend
AI/ML & GenAI
Azure ML Studio, TensorFlow, PyTorch, Keras, OpenAI API, ChatGPT, Hugging Face Transformers
Big Data
Apache Hadoop, Apache Kafka, Apache Flink
Cloud & DevOps
AWS, Azure, GCP, Docker Containers, Kubernetes, Amazon EKS, CI/CD Pipeline, GitLab
Data Warehousing
Amazon Redshift, Google BigQuery, Snowflake
Monitoring & Logging
CloudWatch, Datadog, ELK Stack, Prometheus, Grafana
Recent Works
Here are some of my notable projects showcasing my expertise in data engineering and cloud technologies.

Real-Time Stock Market Data Processing Pipeline
Developed a real-time stock market analytics system that ingests streaming data from multiple APIs using AWS Kinesis and processes it using Apache Flink. The data is stored in AWS Redshift for analytics and displayed via an interactive Tableau dashboard.
- AWS Kinesis
- Apache Flink
- Python
- AWS Lambda
- Redshift
- Tableau

Cloud-Based AI-Powered Customer Sentiment Analysis
Built a sentiment analysis tool that processes customer reviews from multiple sources using AI/ML models from OpenAI and Hugging Face. The pipeline automates data extraction, transformation, and storage using AWS Glue and Apache Airflow, providing dynamic insights in Power BI.
- OpenAI API
- Hugging Face
- AWS Glue
- Apache Airflow
- Power BI

Hybrid Cloud Data Lake with Automated ETL Pipelines
Designed a hybrid cloud data lake architecture to store, process, and analyze structured and unstructured data across AWS and GCP. Used Apache NiFi and Talend for ETL automation, enabling seamless cross-cloud data movement and analytics using Snowflake and BigQuery.
- AWS S3
- GCP BigQuery
- Snowflake
- Apache NiFi
- Talend

Automated Data Quality Monitoring System
Built an automated data quality validation system that monitors and validates incoming data for anomalies, missing values, and inconsistencies. The system is orchestrated with Apache Airflow, leveraging Python for data validation and alerts via Datadog and AWS CloudWatch.
- Python
- Pandas
- Apache Airflow
- Datadog
- AWS CloudWatch

A/B Testing Framework for Marketing Campaigns
Developed an end-to-end A/B testing framework to evaluate the effectiveness of marketing campaigns. Integrated with Google BigQuery for data storage and Python for statistical analysis, with results visualized using Matplotlib.
- Python
- SQL
- Matplotlib
- Google BigQuery
- AWS Redshift

AI-Driven Customer Segmentation & Behavior Analysis
Developed an end-to-end customer segmentation framework using Azure cloud services to analyze customer behavior based on transaction history, demographics, and engagement patterns. The data is ingested, cleaned, and stored in Azure Synapse Analytics, processed using Python-based AI models in Azure ML Studio.
- Python
- Azure Data Factory
- Azure Synapse Analytics
- Azure ML Studio
- Matplotlib
Get In Touch
I love to hear from you. Whether you have a question or just want to chat about design, tech & art — shoot me a message.