Featured Projects

A selection of my recent analytical work, from machine learning to interactive web apps.

Python RFM Analysis Time Series

Retail Customer Segmentation

Executed a comprehensive clustering analysis on a dataset of 9,800+ retail customers. By applying RFM (Recency, Frequency, Monetary) analysis and seasonal trend modeling, I successfully categorized the customer base into high-value, at-risk, and dormant tiers. This strategic segmentation directly informed and improved the timing and targeting of marketing campaigns.

Streamlit ARIMA Prophet

Interactive Sales Forecasting Web App

Engineered and deployed an end-to-end interactive web application tailored for small business owners. The platform allows users to upload raw sales data, which is then automatically processed through advanced time-series forecasting models (ARIMA and Facebook Prophet) to generate actionable future sales predictions and intuitive visual reports.

Python Logistic Regression ANOVA ETL

Large-Scale Demographic Risk Analysis

Engineered a robust data pipeline to ingest, merge, and clean raw .xpt files spanning 12 years of complex survey data. This project was conducted using NHANES data. I deployed advanced statistical methods including logistic regression, ANOVA, and Chi-square tests to successfully pinpoint key lifestyle predictors and health risk factors across demographics.

Python NLP (TextBlob) Logistic Regression

Employee Sentiment Survey Analysis

Conducted an in-depth analysis of 1,259 internal survey responses. Utilizing Natural Language Processing (TextBlob) for automated sentiment scoring alongside logistic regression, I discovered a statistically significant correlation proving that strong management support is the primary predictor of positive employee behavior and retention.

SQL Database Design Data Analytics

Superstore Database Architecture & Analytics

Architected a comprehensive relational database schema for a superstore, modeling stocks, categories, receipts, customers, and sales. Developed advanced SQL queries employing window functions, CTEs, and aggregations to track running totals, calculate category sales contributions, pinpoint inactive customers, and monitor price volatility over time. Engineered a clean view for seamless CSV data pipeline exports.

Looker Studio BI Dashboard Stakeholder Reporting

Executive Performance & Operations Dashboard

Designed a clear, highly actionable Looker Studio dashboard that simplifies complex daily operations into digestible insights (including peak customer hours, revenue vs. spendings, and due payments). Configured automated, scheduled weekly email deliveries directly to stakeholders to ensure consistent visibility into core KPIs without any manual intervention. Included an integrated FAQs/documentation section to streamline data interpretation for non-technical executives.

Python Data Merging A/B Testing

Quantium Retail Strategy Analysis

Conducted extensive retail transaction and customer behavior analysis. Processed complex datasets across 272 stores and 114 products to identify key demographics ("OLDER SINGLES/COUPLES") driving the highest total sales volume. Formulated data-driven recommendations for targeted marketing on top-performing products. Evaluated specific store trials (stores 77, 86, and 88) against control stores to assess total revenue, customer count, and average transactions per customer to guide broader rollout decisions.

Data Cleaning Feature Engineering Python

SLU Opportunity Data Enhancement

Spearheaded rigorous data cleaning and advanced feature engineering for the SLU Opportunity Wise dataset. Systematically identified massive data gaps (e.g., 99% missing values in reward-related columns) and removed duplications. Extracted immense value from existing date columns by engineering entirely new metrics, including exact Age, Engagement Duration, Time in Opportunity, and structured cohort analysis features (signup month/year), outputting a highly refined data pipeline.

EDA Demographic Profiling Statistical Insights

Adult Demographic Exploratory Analysis

Conducted a comprehensive exploratory data analysis (EDA) on adult census demographic data. Investigated the complex intersectionality of race, age, education, and geographic location with earning potential (income > 50K). Extracted key insights, such as revealing that 10.65% of individuals with advanced degrees earn over 50K, while analyzing the nuanced relationships between minimum work hours and total compensation across diverse geographic markets like the US and India.