H&M Personalized Fashion Recommender

1. Introduction

Online shopping offers shoppers an extensive selection of products to browse through. But with too many choices, customers might not quickly find what interests them or what they are looking for, and ultimately, they might not make a purchase. To enhance the shopping experience, product recommendations are key. More importantly, helping customers make the right choices also has a positive implications for sustainability, as it reduces returns, and thereby minimizes emissions from transportation. This project is initiated by H&M as a Kaggle challenge to provide product recommendations based on previous purchases. For this project, we are given the purchase history of customers across time, along with supporting customer and product metadata. Our goal is to predict what articles each customer will purchase in the 7-day period immediately after the training data end date.

2. Preliminary Literature Review

A comprehensive review and classification of existing works on deep learning recommender models was performed by the article "Deep Learning Based Recommender System: A Survey and New Perspectives". The author classified deep learning based recommendation systems(RS) to two catagories: RS with neural building blocks and RS with deep hybrid models. The catagories are composed of different models like MLP, AE, RNN, CNN, GAN and RNN+CNN, AE+CNN respectively.

The autor also discussed challenges of current RS researches and proposed future research directions. The propsed future research direction includes joint representation learning, more explainable models for both user and researchers, RS models with more layers, machine reasoning, etc.
To the end, the author urged RS researching community to establish a standard dataset and hidden/blinded test set for better model evaluation. According to the study, the baseline and dataset of RS is not as standarized as the CV/NLP community, which will create an inconsistent reporting of scores, with each autor using his/her own dataset and evaluation method. Thus, there's seemingly no consensus on the general ranking of the models.

3. Objectives and expected contributions

We hereby posit two research questions, formalized in the following, and by leveraging a large H&M transactions and product dataset. We provide empirical evidence as well as theoretical grounding to answer these questions:

Research Question 1 (RQ1): What are the SOTA RS models that can be efficiently implemented to this problem?
Research Question 2 (RQ2): How do those models perform against each other in the Kaggle test set?

We expect this work can make the following contributions:

We provide deep learning approaches to identify customers shopping habits and give accurate prediction to the future purchases.
We quantitatively evaluate the SOTA deep learning RS models with the hidden/blinded test set provided by Kaggle. This work can provide evidence for which model works best in real world business situations.

4. Methodology

In this project, we will use the dataset from H&M Kaggle competition: the transaction history from 2018/09/20 to 2020/09/21; the product with price and simple descriptions; images that are corresponding to each article_id; metadata for each customer_id and article_id.
We will analyze existing models from previous research to find the best fit for our problem and data. Then we will use Pythorch to reproduce the models. The successfully reproduced models will be trained with the complete training dataset and then will be tested with the test dataset constructed from the original dataset. The top performers will be submitted to Kaggle to evaluate.
The models are classified as follows:

Recommendation with Neural Building Blocks:

Multilayer Perceptron Based Recommendation
1. Neural Extension of Traditional Recommendation Methods
2. Feature Representation Learning with MLP
3. Recommendation with Deep Structured Semantic Model
Autoencoder Based Recommendation
1. Autoencoder-Based Collaborative Filtering
2. Feature Representation Learning with Autoencoder
Convolutional Neural Networks Based Recommendation
1. Feature Representation Learning with CNNs
2. CNN-Based Collaborative Filtering
3. Graph CNNs for Recommendation
Recurrent Neural Networks Based Recommendation
1. Session-Based Recommendation Without User Identifier
2. Sequential Recommendation with User Identifier
3. Feature Representation Learning with RNNs
Restricted Boltzmann Machine Based Recommendation
Neural Attention Based Recommendation
1. Recommendation with Vanilla Attention
2. Recommendation with Co-Attention
Neural AutoRegressive Based Recommendation
Deep Reinforcement Learning for Recommendation
Adversarial Network Based Recommendation

Recommendation with Deep Hybrid Models：

CNNs and Autoencoder
CNNs and RNNs
RNNs and Autoencoder
RNNs with DRL

5. Brief Project Schedule

Task	Assignee
Preliminary research on existing models	All
Data preprocessing	Lyu Zhang
Constructing models	Zongshuo Wei, Hechen Wang, Yufu Liao
Model training	All
Performance evaluation	All
Research report writing	All

6. Links and references

Project: https://github.com/ZonW/HM-Personalized-Fashion-Recommender
Kaggle: https://www.kaggle.com/c/h-and-m-personalized-fashion-recommendations
Reference: https://arxiv.org/abs/1707.07435

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal.md

Proposal.md

H&M Personalized Fashion Recommender

1. Introduction

2. Preliminary Literature Review

3. Objectives and expected contributions

4. Methodology

Recommendation with Neural Building Blocks:

Recommendation with Deep Hybrid Models：

5. Brief Project Schedule

6. Links and references

Files

Proposal.md

Latest commit

History

Proposal.md

File metadata and controls

H&M Personalized Fashion Recommender

1. Introduction

2. Preliminary Literature Review

3. Objectives and expected contributions

4. Methodology

Recommendation with Neural Building Blocks:

Recommendation with Deep Hybrid Models：

5. Brief Project Schedule

6. Links and references