Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Information on global article collection used #9

Open
kermitt2 opened this issue Nov 16, 2021 · 6 comments
Open

Information on global article collection used #9

kermitt2 opened this issue Nov 16, 2021 · 6 comments
Assignees
Labels
enhancement New feature or request

Comments

@kermitt2
Copy link
Collaborator

kermitt2 commented Nov 16, 2021

We need to give more information about the collection of publications used for text mining extraction:

  • global information on the total of processed publications, its distribution per year, etc.
  • when providing for a software the number of documents mentioning the software, we should add background information on the number of documents (per year, etc.)

For this, the harvester should generate a description of the collection to be ingested by the KB together with the papers.

@jameshowison
Copy link

Commenting to follow here :)

@kermitt2
Copy link
Collaborator Author

For a quick banner solution, what about this:

Screenshot from 2023-02-16 20-32-20

Maybe the message could be better expressed?

https://cloud.science-miner.com/software_kb/frontend/index.html

@jameshowison
Copy link

jameshowison commented Feb 16, 2023 via email

@kermitt2
Copy link
Collaborator Author

Random 2.5M open access papers from Unpaywall, I add that.

All literature yes, considering at least 125M publications, a conservative guesstimate!

@kermitt2
Copy link
Collaborator Author

What about this:

Screenshot from 2023-02-16 22-27-38

@jameshowison
Copy link

jameshowison commented Feb 18, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants