Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Search for homo naledi fails (v. 1.0.3) #100

Closed
pvanheus opened this issue Jun 28, 2017 · 4 comments
Closed

Search for homo naledi fails (v. 1.0.3) #100

pvanheus opened this issue Jun 28, 2017 · 4 comments
Assignees
Labels
Milestone

Comments

@pvanheus
Copy link

With the eLife datasource enabled, search for "Homo naledi" fails.

It should return papers such as DOI 10.7554/eLife.10627.

As discussed on IRC, this might be related to the title having <italics>Homo naledia</italics> markup.

@blahah
Copy link
Member

blahah commented Jun 28, 2017

Thanks for reporting this. As discussed, I think the tokeniser we run before indexing the documents is failing to strip the XML tags. Should be a simple fix (famous last words!).

@blahah blahah added the bug label Jun 28, 2017
@blahah blahah added this to the v1.1 milestone Jun 28, 2017
@blahah blahah self-assigned this Jun 28, 2017
@blahah
Copy link
Member

blahah commented Jun 28, 2017

Fixed in the search engine by adding a pre-processing step to strip XML tags blahah/yunodb@15a706d

@blahah blahah closed this as completed Jun 28, 2017
@pvanheus
Copy link
Author

Seems not to fix things:

image

is there an index that needs to be rebuilt or a cache that needs to be emptied or something?

@blahah
Copy link
Member

blahah commented Jun 29, 2017

ah, yes you'll need to delete the search index:

rm -rf ~/.sciencefair

will nuke the whole thing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants