Favorita Benchmarking

This template is brought to you by Optimizely, a leading experimentation platform.

Experiment plan and results	Replicating Results of Transformer Fusion Model using Favourita Dataset https://arxiv.org/pdf/1912.09363.pdf
Experiment owner	kriti mahajan
Reviewers	Isaac Godfried
Approver
Optimizely link
Jira ticket(s)
Status	IN REVIEW / IN PROGRESS / COMPLETE
On this page

Stakeholder summary

📋 Experiment planning

Overview: Replicating Results of Transformer Fusion Model using Favourita Dataset
https://arxiv.org/pdf/1912.09363.pdf

Current Roadblock:
- Fitting data into memory for modeling

Tasks Completed Till Now

Step 1: Batchwise Data Processing for Preprocessing
The favorita dataset is too large to fit into memory for in one go. So, it is processed in chunks of 300000.

Step 2: Data Preprocessing
In this following dataset each product number-store number pair is treated as a separate entity and is denoted by an embedding of the following variables:
```
['holiday_type',
'locale',
'locale_name',
'description',
'transferred',
'city',
'state',
'store_type',
'cluster',
'family',
'class',
'perishable']
```

We treat each product number-store number pair as a separate entity
We include an additional ’open’ flag to denote whether data is present on a given day
Data is resampled at regular daily intervals,imputing any missing days using the last available observation
We apply a log-transform on the sales data, and adopt z-score normalization across all entities
Dropping where any record missing
The training set is made up of samples taken between 2015-01-01 to 2015-12-01. The validation set of samples from the 30 days after the training set. The test set of all entities over the 30-day horizon following the validation set.
We consider log sales, transactions, oil to be real-valued and the rest to be categorical.

Step 3: Modelling
Use DA-RNN with transfer learning. Notebook for the same can be found at:
https://colab.research.google.com/drive/15kyHxeZCQLx9PpFSZpB8TKECiCZj_Pg7

Hypothesis

We hypothesize that DA-RNN with Transfer learning/ Transformer model with Transfer Learning

will decrease MSE/RMSE

because of the incorporation of embeddings

Metrics

MSE
RMSE

Targeting

Where will this experiment run?
Who will see it?
What is the traffic allocation (% in total let in based on targeting)?

Variations

	A: Control	B: Variation	C: Variation
Screenshot
% of visitors/users to see each variation

Pre-analysis

Add any baseline data or pre-analysis you have for this experiment. Add planned sample size and time to run.

Notes

📊 Results

Experiment start	26 Nov 2020
Experiment end	26 Nov 2020
Link to results in Optimizely
Conclusion	INCONCLUSIVE / HYPOTHESIS PROVED

Add a short summary of the metrics below and whether you hit significance.

A: Control

B: Variation

change

A: Control

C: Variation

change

Cohort size

Primary metric

Δ=

p-value=

power=

confidence=

Δ=

p-value=

power=

confidence=

Other metrics

✨ Conclusions

Highlights

Primary goal

Stakeholder summary

📋 Experiment planning

Overview: Replicating Results of Transformer Fusion Model using Favourita Dataset
https://arxiv.org/pdf/1912.09363.pdf

Hypothesis

Metrics

Targeting

Variations

Pre-analysis

Notes

📊 Results

✨ Conclusions

Highlights

Takeaways

Follow-up

Favorita Benchmarking

Stakeholder summary

📋 Experiment planning

Overview: Replicating Results of Transformer Fusion Model using Favourita Datasethttps://arxiv.org/pdf/1912.09363.pdf

Hypothesis

Metrics

Targeting

Variations

Pre-analysis

Notes

📊 Results

✨ Conclusions

Highlights

Takeaways

Follow-up

Overview: Replicating Results of Transformer Fusion Model using Favourita Dataset
https://arxiv.org/pdf/1912.09363.pdf