Involved in gathering data, interpreting statistics, writing reports, and designing charts and
graphs. Executed analysis by utilizing data science platforms as well as creating predictive models
for classification, regression, and clustering by implementing the following algorithms, including
logistic regression, decision tree, decision forest, decision jungle, gradient boosted trees, and
random forest. Performed ABC, XYZ, and RFM marketing analysis, and created dashboards with KPIs and
other business metrics.
Projects completed:
Python - solving Churn task - XGBoost, CatBoost, Random Forest, Light Gradient Boosting Machine,
Gradient Boosting, Extremely Randomized Trees, H2O AutoML: https://github.com/KateBaburina/Predictive-Modeling---Employee-Attrition
Google BigQuery - predictive models - solving Churn task - Linear Regression, XGB: https://github.com/KateBaburina/Predictive-Modeling---BigQuery
Data Science Platforms - predictive models - solving Churn task - Logistic Regression, Decision
Tree, Decision Forest, Decision Jungle, Gradient Boosted Trees, Random Forest: https://github.com/KateBaburina/Predictive-Modeling---Data-Science-Platforms
Final report with Predictive Modeling for Sales Predicting using Time Series (BigQuery, Python,
Tableau): https://github.com/KateBaburina/Predictive-Modeling---Time-Series-Tableau-Visualizations
~~Tableau visualizations and dashboards, clustering, and Time Series:
Customers with the Highest Profit
Customers with the Lowest Profit
Sales Forecast using Tableau TS
Dashboard with KPI and Clustering for Customers
~~QlikSense dashboards:
Retail KPIs
Sales/Sub-Category Insights