Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fetcher tests failing #6369

Closed
tobiasdiez opened this issue Apr 28, 2020 · 24 comments
Closed

Fetcher tests failing #6369

tobiasdiez opened this issue Apr 28, 2020 · 24 comments
Assignees
Labels
bug Confirmed bugs or reports that are very likely to be bugs fetcher type: code-quality Issues related to code or architecture decisions
Milestone

Comments

@tobiasdiez
Copy link
Member

Some of the fetcher tests are failing (ISBN, Inspire, ...)
https://github.com/JabRef/jabref/actions?query=workflow%3A%22Fetcher+Tests%22

@tobiasdiez tobiasdiez added this to the v5.1 milestone Apr 28, 2020
@tobiasdiez tobiasdiez added bug Confirmed bugs or reports that are very likely to be bugs type: code-quality Issues related to code or architecture decisions labels Apr 28, 2020
@calixtus calixtus mentioned this issue Jun 5, 2020
5 tasks
@Siedlerchr
Copy link
Member

@Siedlerchr Siedlerchr added the good first issue An issue intended for project-newcomers. Varies in difficulty. label Jun 6, 2020
@koppor
Copy link
Member

koppor commented Jun 9, 2020

API keys: Howto for getting a sandbox key for developers

ADs-Fetcher: This is the PR switchting to the new API, isn't it? --> #5501

@Siedlerchr
Copy link
Member

Just got mail that the INSPIRE has now an official API:
https://github.com/inspirehep/rest-api-doc

@koppor
Copy link
Member

koppor commented Jun 23, 2020

@Siedlerchr
Copy link
Member

Springerlink is giving m,e 401 locally when I debug

@koppor
Copy link
Member

koppor commented Jul 27, 2020

@DominikVoigt Can you confirm for SpringerLink? 😅

@DominikVoigt
Copy link
Contributor

DominikVoigt commented Jul 27, 2020

If I substitute the API key that is hardcoded into the SpringerLink fetcher with my own API key all tests pass.

@koppor koppor removed the good first issue An issue intended for project-newcomers. Varies in difficulty. label Jul 27, 2020
@koppor
Copy link
Member

koppor commented Aug 2, 2020

Springer mailed us on 2020-04-29:

Dear User,

Thank you for using Springer Nature APIs! While we are most pleased at your interest, we see traffic that is very high and well over the limit of 300 calls/min and 5000/day. The limit is 300 hits/min and 5000/day. Can you please adjust your code to stay within these limits? Please advise/confirm so we may restore service.

Sincerely,
SpringerNature API Team

Internal stats show "On November 28, your application called the API more than 13,000 times."

@koppor
Copy link
Member

koppor commented Aug 2, 2020

State as of 2020-08-02

Source: https://github.com/JabRef/jabref/runs/937189942

  • Grobid: java.net.ConnectException: Connection timed out
  • IEEE: Results differ
  • CollectionOfComputerScienceBibliographiesFetcher: Assertion error
  • LibraryOfCongressTest: Results differ
  • CiteSeerTest: no answer
  • DoiFetcherTest: Results differ
  • AstrophysicsDataSystemTest: Results differ
  • Springer: @DisabledOnCIServer("Disable on CI Server to not hit the API call limit")

@Siedlerchr
Copy link
Member

We should add the Springer key to the build secrets as well

@koppor
Copy link
Member

koppor commented Aug 2, 2020

I will aquire a sepearate key for Springer and enable Springer tests in a separate PR.

@koppor
Copy link
Member

koppor commented Aug 2, 2020

Rate limit of IEEE

  • 10 calls / second
  • 200 calls / day

Source: Key agreement

@Siedlerchr
Copy link
Member

We should execute the fetcher tests maybe once a month on CI and not on every PR

@koppor
Copy link
Member

koppor commented Aug 2, 2020

Since a few months, we DO NOT execute the fetcher tests on each PR.

We currently the fetcher tests, when

  • something in the fetcher code changed
  • each wednesday

See https://github.com/JabRef/jabref/blob/master/.github/workflows/tests-fetchers.yml

@koppor
Copy link
Member

koppor commented Aug 2, 2020

The discussion on this move can be found at #5807.

@calixtus
Copy link
Member

calixtus commented Aug 7, 2020

  • IEEE returns a bad URL: https:/ieeeexplore… (Note: only one slash). Problem seems to be serverside
  • LibraryOfCongress returns bad year ( year = {c201} instead of year = {2011} )
  • The JabRef BibEntry builder seems to automatically add an url-field, which is not included in the returned result from the server. Will do some research on this.
  • ADS seems to be identical on first sight. Not sure why failing, maybe something I overlooked... Will also do some research on this.

@Siedlerchr
Copy link
Member

citesser => journaltitle instead of journal

@koppor
Copy link
Member

koppor commented Aug 8, 2020

@DominikVoigt added a conversion layer at #6687. Thus, it should not matter whether journaltitle (BibLaTeX) or journal (BibTeX) is returned. -- the layer converts to the format required by the target database

This was referenced Aug 24, 2020
@koppor
Copy link
Member

koppor commented Aug 25, 2020

@Siedlerchr Thank you for the hint --> @DominikVoigt's layer doesn't help here as the fetcher tests directly check the fetcher result - and not the reformatting by the intermediate layer.

@koppor
Copy link
Member

koppor commented Aug 26, 2020

Refs #3854 (as background information)

@koppor koppor modified the milestones: v5.1, v5.2 Aug 26, 2020
@koppor
Copy link
Member

koppor commented Aug 26, 2020

  • Grobid runs in a time out. @LinusDietz can you check please?
  • org.jabref.logic.importer.fetcher.ArXivTest has some differences, need to check later
  • IEEE and Google (currently) do not run on the CI

@Siedlerchr
Copy link
Member

Currently there seems to be a latex/unicode conversion issue in the INPIRE test

 org.opentest4j.AssertionFailedError: expected: <[@mastersthesis{Diez:2014ppa,
    archiveprefix = {arXiv},
    author = {Diez, Tobias},
    eprint = {1405.2249},
    primaryclass = {math-ph},
    school = {Leipzig U.},
    title = {Slice theorem for Fr'echet group actions and covariant symplectic field theory},
    year = {2013},
    _jabref_shared = {sharedId: -1, version: 1}
  }]> but was: <[@mastersthesis{Diez:2014ppa,
    archiveprefix = {arXiv},
    author = {Diez, Tobias},
    eprint = {1405.2249},
    primaryclass = {math-ph},
    school = {Leipzig U.},
    title = {Slice theorem for Frtextbackslash{}'echet group actions and covariant symplectic field theory},
    year = {2013},
    _jabref_shared = {sharedId: -1, version: 1}
  }]>
      at org.jabref.logic.importer.fetcher.INSPIREFetcherTest.searchByQueryFindsEntry(INSPIREFetcherTest.java:45)

@koppor
Copy link
Member

koppor commented Dec 21, 2020

  • Jstor: Result differs
  • CiteSeer: Result differs
  • TitleFetcher: Change format
  • Disable CompositeFetcherTest on CI
  • Grobid: Server down


@koppor
Copy link
Member

koppor commented Dec 24, 2020

We disabled ACM, Google Scholar and JSTOR at #7229. We have different follow-up tickets and PRs. Thus, I can close this issue.

@koppor koppor closed this as completed Dec 24, 2020
@koppor koppor moved this to Done in Prioritization Nov 10, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Confirmed bugs or reports that are very likely to be bugs fetcher type: code-quality Issues related to code or architecture decisions
Projects
Archived in project
Development

No branches or pull requests

5 participants