Exploratory analysis
Segmentation, trends, and clear charts that explain why a metric moved.
Hi, I'm
Product Analyst · Analytics & Data Science
EDA ML Metrics & KPIs Experimentation
4+ years turning raw data into models, dashboards, and decisions that move product and revenue. From funnel diagnostics to churn and fraud models — I care about the so what for the business.
Real numbers from production work — not side projects.
I'm a Product Analyst based in Bengaluru, Karnataka with 4+ years of full-time experience in the analytics domain. I have a B.Tech from IIT (ISM) Dhanbad and have worked across startups and growth-stage companies including Dunzo, FloBiz, and ALLEN Digital.
I help teams make smarter, faster, and data-backed decisions. Whether it's building an XGBoost model to identify at-risk students, optimizing a credit funnel from 2% to 7%, or consolidating 134 dashboards down to 39 — I focus on the "so what?" that moves the business forward.
My work spans product analytics, ML-driven predictions, data pipeline engineering (SparkSQL, Airflow, Hevo), and strategic analysis at the founder's office level.
An end-to-end analytics loop — the same rhythm behind my portfolio repos and my product work.
Clarify the business question, success metric, and trade-offs (e.g. precision vs recall for fraud).
Source tables, validate joins, handle missing values, and sanity-check distributions before modeling.
Visualize segments, correlations, and leakage risks; turn patterns into testable hypotheses.
Baselines first, then sklearn / XGBoost-style models, with proper splits and imbalance handling when needed.
Metrics that match the use case — not just accuracy — plus confusion matrices, ROC, and feature importance.
READMEs, notebooks, or dashboards so stakeholders see the insight and the recommended next action.
Segmentation, trends, and clear charts that explain why a metric moved.
Classification & regression with scikit-learn — from churn and fraud to cost prediction.
SMOTE, recall-focused evaluation, and business-aligned thresholds for rare events.
Streamlit and BI-style views that make metrics self-serve and actionable.
From founder's office strategy roles to product analytics — always at the intersection of data and business impact.
Bengaluru, Karnataka
Bengaluru, Karnataka
Bengaluru, Karnataka · 1 year 6 months
Khopoli, Maharashtra
Dhanbad, Jharkhand
Dhanbad, Jharkhand
Quick, private utilities that run entirely in your browser — no sign-up, no AI, no server. Useful for experiment readouts, retention tables, and readable SQL.
100% client-side · $0 infraTwo-proportion z-test. Enter visitors and conversions for each variant. Large-sample normal approximation — for small samples, treat as directional only.
How many users per variant do you need before running the test? Enter your baseline conversion rate, minimum detectable effect (MDE), desired power, and significance level.
Paste minified or messy JSON on the left. Click Format to pretty-print with syntax highlighting on the right. If the JSON is an array of objects you can also download it as a CSV file.
Indents SELECT columns one-per-line, AND / OR / ON under their clause, and colour-highlights keywords, functions, strings, and numbers. Heuristic — works well for standard analytical SQL. Copy button outputs plain text.
CSV format (wide): first column = cohort name; remaining columns = period 0, 1, 2… (counts or retention %). Example header: cohort,0,1,2,3. Values between 0 and 1 are treated as fractions.
Public repos with notebooks, EDA, and models — each README summarizes methods, metrics, and business takeaways.
Interactive Streamlit dashboard on the Superstore dataset with KPI cards, trend charts, discount-vs-profit analysis, and real-time filters.
ML pipeline predicting customer churn using Logistic Regression & Random Forest. Surfaces top reasons customers leave and concrete retention actions.
Customer & Revenue Intelligence Report — deep-dive into an e-commerce dataset with segmentation, revenue driver identification, and growth recommendations.
Regression models predicting medical insurance costs — EDA on what drives healthcare spending plus business recommendations for insurers.
ML pipeline tackling extreme class imbalance (0.17% fraud) using SMOTE, Logistic Regression, Random Forest & Gradient Boosting with focus on recall optimization.
Classification models predicting student pass/fail outcomes using demographic, social, and academic features from the UCI Student Performance dataset.
Educational content on analytics — from experimentation to ML, metric design, and data storytelling.
In-depth write-ups on analytics concepts, SQL patterns, A/B testing, cohort analysis, and real-world data lessons are on their way.
Analytics games for data people — SQL quiz, daily word game & metric blitz.
Clause & Effect · Syntax Circuit · The JOIN Tribunal
10 questions · 10 minutes · no lifelines. Read real SQL, spot errors, predict output. Enter your name and go — certificate at the finish line.
You scored . You are operating at professional analyst level on this SQL set.
Based on the questions you missed, spend time on:
Copied — paste into LinkedIn if the share window did not open.
Guess the 5-letter SQL / data keyword. 6 tries. New word every day.
5 KPI & SQL questions. 30 seconds on the clock. No looking it up.