I was given the assignment to consider self-reported recommendation data for a product. The broad topic was to understand recommendation dynamics.
The data consisted of five variables:
buyer_purchase_ts
: Timestamp of the buyer's ("recommendee's") purchasebuyer_id
: Unique ID for each "buyer"recommender_id
: Unique ID for each recommenderbuyer_trial_starts_ts
: Timestamp for a buyers trial startrecommender_trial_start_ts
: Timestamp for a recommenders trial start, if available
I was also told that the standard pathway to a purchase is through a trial.
The assignment asked me to consider two questions:
- Explore the data and present the three, in my opinion, most interesting findings
- Present the three hypotheses I would like to explore next and explain why
I could present my solution in any way I deemed appropriate.
My solution consisted of a presentation available here and a supplementary Jupyter notebook available here.
While the company was so generous to grant me permission to share this work, they asked that I remove any mentioning of the company's name. Thus, the name is either replaced with Company X or I speak of the product or the company.