Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Subreddit restriction is not an exact name match and includes subreddits with a superset of that name #60

Open
brendon-wong opened this issue Mar 31, 2023 · 3 comments

Comments

@brendon-wong
Copy link

Hi all! I was trying to restrict to a specific subreddit "web10" and I noticed that content from other subreddits with a name that includes the text "web10" is also coming up! I assume this is not the intended behavior because it could cause a lot of unrelated results to appear, especially if the subreddit name has a common word like "science" which is used by many subreddits besides r/science.

In this example (https://api.pushshift.io/reddit/search/submission?subreddit=web10), content from r/web10, r/u_Psychological-Web10, and r/u_ronaldo-web10 appears. r/u_Psychological-Web10 is a subreddit, and r/u_ronaldo-web10 is handled differently by Reddit (Reddit displays a page indicating the user has been banned, rather than a page indicating the subreddit doesn't exist) so perhaps it was previously a subreddit.

@eddvrs
Copy link
Contributor

eddvrs commented Apr 18, 2023

This appears to be an issue with the API itself rather than PMAW. PMAW can only reflect what's returned by Pushshift. There's a similar issue with searches by author.

@brendon-wong
Copy link
Author

My bad, I even used a link to the API itself as an example of the issue! This issue first arose when using pmaw, so I must've opened the issue here first. Just created an issue in the Pushshift repo itself.

If Pushshift doesn't fix this issue, I wonder if it'd make since for pmaw to implement filtering to get an exact word match?

@brendon-wong
Copy link
Author

Here is the issue in the Pushift repo: pushshift/api#144

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants