From 834419ce65e39fdf6ae5f393bcd6517aeb94410e Mon Sep 17 00:00:00 2001 From: Hongbo Fang <39660755+fhbzc@users.noreply.github.com> Date: Sun, 12 Mar 2023 17:37:09 -0400 Subject: [PATCH] Update README.md --- README.md | 23 +++++++++++++++++++++-- 1 file changed, 21 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index a4b42ef..cab5318 100644 --- a/README.md +++ b/README.md @@ -1,7 +1,26 @@ # Replication package -This is the replication pacakge for the submitted paper "Understanding information diffusion about open-source projects on Twitter, HackerNews, and Reddit" +The repository contains the replication pacakge for the paper "Understanding information diffusion about open-source projects on Twitter, HackerNews, and Reddit" in CHASE 2023. -File: +## File replication_main_socialmedia_oss.Rmd (main replication pacakge) replication_validation.Rmd (robustness check package) + +# Open dataset + +The repository contains the open dataset for social media posts that mention open source projects in R and Python ecosystems. + +## File +raw_data_repo2media.zip + +## How to use +Unzip "raw_data_repo2media.zip", and obtain two files "repo2parsed_social_media_list_com_Python_release.json" and "repo2parsed_social_media_list_com_R_release.json", which corresponds to projects in R and Python ecosystem apiece. + +Each json file contains a dictionary, with the key being the project repo_slug (e.g. *mjskay/tidybayes*), and the value being a list of social media posts. +Each post was represented by a list of length four, with the meaning of each item shown below: + +item[0]: link to the social media post. +item[1]: time when the post was created. +item[2]: the platform where the post was on. +item[3]: the corresponding project (same as the key). +