National Economics University logo Faculty of Data Science and Artificial Intelligence logo BAI Lab logo

Topic 13

Retail Store Sales Time Series

Medium โ†’ Hard +1 Bonus Point
Retail Store Sales Time Series

Topic 13 โ€“ Retail Store Sales Time Series (Kaggle Competition Data)

Level: Medium โ†’ Hard Goal: Daily sales forecasting for multiple stores/items (panel + time series).

Dataset

Download Instructions

  1. Open the competition link above.
  2. Go to "Data" tab.
  3. Accept competition rules (if needed).
  4. Download train.csv and save to data/store_sales/.

Data Loading

import pandas as pd
df = pd.read_csv("data/store_sales/train.csv")
df["date"] = pd.to_datetime(df["date"])
df = df.set_index("date").sort_index()
print(df.head())

Columns usually include: date, store_nbr, family, sales, onpromotion, oil_price, etc.

Implementation Steps

1. Data Exploration

2. Exploratory Data Analysis (EDA)

3. Data Preprocessing

4. Feature Engineering

5. Model Building

6. Model Evaluation

7. Forecasting

8. Advanced Analysis (Optional)

Expected Deliverables

  1. EDA Report:
  1. Model Results:
  1. Code:

Tips

Starter Notebook

The starter notebook contains installation instructions and data loading code to help you get started with this topic.

๐Ÿ““ View Starter Notebook on GitHub

Note: You can view the notebook directly on GitHub or download it to run locally in Jupyter.

Getting Started

This topic includes:

Navigate to the Topic/13.Store_Sales/ directory to access all resources.

โ† Back to All Topics