Skip to content
/ sotawhat Public
forked from chiphuyen/sotawhat

Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday.

Notifications You must be signed in to change notification settings

42B/sotawhat

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

sotawhat

This script runs using Python 3.

First, install the required packages. This script only requires nltk and PyEnchant.

$ pip3 install -r requirements.txt

If you run the error that the package punkt doesn't exist, download it by going into your Python environment and running:

$ python3

>>> import nltk
>>> nltk.download('punkt')

In MacOS, you can get the SSL error

[nltk_data] Error loading punkt: <urlopen error [SSL:
[nltk_data]     CERTIFICATE_VERIFY_FAILED] certificate verify failed:
[nltk_data]     unable to get local issuer certificate (_ssl.c:1045)>

this will be fixed by reinstalling certificates

$ /Applications/Python\ 3.x/Install\ Certificates.command

To query for a certain keyword, run:

$ python3 sotawhat.py "[keyword]" [number of results]

For example:

$ python3 sotawhat.py "perplexity" 10

If you don't specify the number of results, by default, the script returns 5 results. Each result contains the title of the paper with author and published date, a summary of the abstract, and link to the paper.

We've found that this script works well with keywords that are:

  • a model (e.g. transformer, wavenet, ...)
  • a dataset (e.g. wikitext, imagenet, ...)
  • a task (e.g. 'language model', 'machine translation', 'fuzzing', ...)
  • a metric (e.g. BLEU, perplexity, ...)
  • random stuff

About

Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%