Shivam Bhatt

Abu Dhabi.

About

Data-driven professional with 4+ years of experience in analytics, data engineering, and delivering actionable insights to solve complex business problems. Skilled in designing and maintaining scalable data pipelines, conducting in-depth data analysis, and supporting cross-functional decision-making. Proven track record in leading end-to-end data projects across finance, education, e-commerce, and pharmaceutical sectors. Proficient in SQL, Python, Spark, AWS, Databricks, and modern BI tools. Passionate about transforming data into strategic value and continuously improving systems to drive impact.

Work

G42 (Presight)
|

Senior Data Analyst

Highlights

Worked with the Financial Investigative Unit to develop an advanced analytics platform that streamlined the resolution of STRs and SARs. Engineered advanced methodologies for AML/CTF risk scoring, structuring analysis, and shell company detection, leading to a 30% improvement in case triaging efficiency and enhanced identification of high-risk entities.

Designed and implemented a scalable data engineering workflow, enabling efficient ingestion of high-volume datasets. Developed Smart Uploader tool using PySpark to automate data ingestion. Built a robust Data Quality Framework to support risk assessment methodologies - reducing manual effort by 80% and significantly enhancing data reliability across critical decision-making processes.

Led a team of 4 data analysts and modelers, delivering analytical solutions with 100% on-time completion. Oversaw client requirement, task prioritization, and quality assurance to ensure high-impact, data-driven outcomes for client.

Applied advanced statistical techniques, developed custom methodologies and implemented machine learning algorithms to solve complex business use cases—resulting in a 15% improvement in prediction accuracy, 30% reduction in operational inefficiencies, and delivering insights that directly influenced strategic decisions.

Zomato (Blinkit)
|

Data Analyst

Highlights

Supported all analytics needs including a/b testing, ad-hoc requests, root cause analysis, and custom dashboards to aid business. Worked extensively with SQL and Python to solve use cases.

Developed algorithms for weighted availability, demand forecasting and capacity planning, contributing to a 10% month-over-month increase in platform growth.

Planned growth campaigns by targeting various customer segments' behaviour and key "aha" moments to create a retention framework, resulting in a 15% increase in customer activation.

Partnered with cross-functional teams including product development, operations and marketing to design data-driven solutions, resulting in a 20% improvement in fill rates and a 4% reduction in dump.

Established and maintained data pipelines and infrastructure to ensure that data is accurate, timely and dependable for analysis and decision-making.

Led the analysis of critical e-commerce business metrics and trends, providing data-driven recommendations that improved sales by 15% and maintained over 90% availability.

ZS Associates
|

Data Analyst (Business Technology Solutions Associate)

Highlights

Analyzed data from SAP, Hyperion, and Rapid Response using SQL and PySpark to build reporting tables for Amgen Pharmaceuticals, addressing supply chain and cost of sales challenges. Contributed to an estimated $2.5M in annual savings through improved forecasting accuracy.

Designed and managed infrastructure and backend workflow for clinical trial web applications using AWS, Aurora, EEA APIs, and Databricks. Developed data pipelines with Airflow.

Led the migration of the DSC project using Databricks, Git, Postgres, and Jenkins to automate deployment workflows—reducing deployment time by 50% and improving release reliability across environments. Developed robust IQ scripts to ensure seamless transitions and system integrity.

Developed common utility functions library utilized across various modules and projects and a Python framework to generate summary dashboards of business rules compliance.

Designed data models, source-to-target mappings (STTM), table interactions, and architectural designs, including both low-level and high-level designs.

Newgen Software
|

Data Analyst Intern

Highlights

Automated document-centric, industry-specific processes using Omnidocs and Newgen's BPM tools, enabling end-to-end process automation and continuous operational improvement.

Maruti Suzuki India Ltd
|

Data Analyst Intern

Highlights

Built a MySQL-based data warehouse to manage inventory for Maruti retailers, implementing data models and triggers to optimize data structure and automate real-time information updates.

UNV India
|

Digital Marketing Team Leader

Highlights

Led a team of 10 to develop data-driven marketing strategies to drive community engagement and event participation, resulting in a 35% increase in outreach effectiveness and participant sign-ups.

Education

Maharaja Agrasen Institute of Technology

Bachelor of Technology

Information Technology

Grade: 8.8/10

KIIT World School

CBSE(Class XII)

Grade: 90.8%

KIIT World School

CBSE(Class X)

Grade: 10/10

Skills

Programming Languages and Data Tools

C, C++, Python, SQL, Advance MS Excel, Tableau.

Big Data Technology and Tech Stacks

Hadoop, Spark, Hive, PySpark.

Database

MySQL, Oracle, PostgreSQL.

Cloud Services

AWS (S3, EC2, Glue), Databricks CLI, Azure Synapse (Azure Data Factory, Datalake Gen2).

Orchestration Tools

Airflow, Jenkins, Gitlab Inbuilt CI/CD.

Agile and SDLC

Admin of DevOps Team, Worked on Jira, Kanban Workflow, Agile Scrum.

Others

Machine Learning (Regression, Clustering), Scikit-learn, Model Development, Predictive Analytics.

MOOCs

Advanced SQL (HackerRank Certified), Python Specialization (University of Michigan), Fundamentals of Analytics (AWS), Introduction to Machine Learning (Kaggle), Microsoft Azure (DP-900).

Projects

Data Quality Framework

Summary

Designed and implemented a scalable Data Quality Engine using PySpark to automate rule-based validations across enterprise datasets, covering all 7 key data quality dimensions. The framework features advanced pattern matching, reference checks, and DQ scorecard generation—reducing manual effort by 80% and enhancing auditability and data profiling efficiency.

Customer Segmentation

Summary

Designed and implemented data analysis framework to profile customer behavior and drive personalized mar- keting strategies using EDA, statistical analysis, and KMeans clustering with PCA. Segmented 2,000+ customers based on demographics and engagement, engineered behavioral metrics like loyalty scores and churn indicators, and visualized insights using Seaborn and Matplotlib to support targeted retention and campaign optimization.

COVID-19 Analysis

Summary

Conducted in-depth analysis of 60,000+ global COVID-19 records from Johns Hopkins University, integrating them with World Happiness Report 2020 indicators across 150+ countries. Applied correlation and regression analysis to reveal inverse relationships between pandemic severity and factors like healthcare access, socio- economic disparity, and social stability.

Motor Vehicle Collision Analysis in New York City (Streamlit Web App)

Summary

Developed an interactive web app using Streamlit, Plotly, and Pydeck to analyze 1.5M+ NYC motor vehicle collisions. Enabled dynamic filtering, geospatial mapping, and animated heatmaps to visualize traffic patterns and accident hotspots, enhancing insights for both technical and non-technical users. Showcased end-to-end skills in data processing, visualization, and dashboard development.