Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] OCR search text in images #296

Closed
lethefrost opened this issue Jul 12, 2024 · 10 comments
Closed

[Feature Request] OCR search text in images #296

lethefrost opened this issue Jul 12, 2024 · 10 comments
Labels
feature request New feature or request

Comments

@lethefrost
Copy link

It would be especially helpful when you have a lot of screenshots, diagrams, photo of slides, etc., embedded in documents or as stand alone image files. Text in images may contain a large amount of information. However, it's not very easy to retrieve them in the traditional ways of file management. It would be greatly appreciated if you could consider making them searchable.

@MohamedBassem MohamedBassem added the feature request New feature or request label Jul 13, 2024
@MohamedBassem
Copy link
Collaborator

hmmm, OCR is a cool idea indeed. My only concern is finding a good OCR tool that would work with different languages.

@lethefrost
Copy link
Author

hmmm, OCR is a cool idea indeed. My only concern is finding a good OCR tool that would work with different languages.

This might be helpful - I am thinking probably we can let each user configure a list of possible languages that would occur in their hoard - which usually are the languages they know, so the list wouldn't be too long (for most people it might be 1-3?). It seems that Tesseract.js supports recognizing multiple languages at the same time when you concatenate the lang codes with +?

@MohamedBassem
Copy link
Collaborator

tesseract.js looks cool indeed. We can probably add it to the roadmap at some point

@akshara-tg
Copy link

Without OCR (which allows for searching text within images), the hoarding images become somewhat pointless.

@Arcturuss
Copy link

+1 for OCR in images.
Personally I wanted to make a "meme catalog" in Hoarder. few thoughts about that:

  • in addition to OCR, semantic search is needed. similar to suggested in [Feature request] Selfhosted semantic search #441 but for images, like Immich does
  • maybe option to enable OCR separately for hoarded single images only and not for the images from webpages

@MohamedBassem
Copy link
Collaborator

@Arcturuss OCR for uploaded images is something on our roadmap and I'm definitely planning to do it pretty soon.

@github-project-automation github-project-automation bot moved this from Backlog to Done in Hoarder's Roadmap Oct 20, 2024
@MohamedBassem
Copy link
Collaborator

OCR is now implemented and will be available in the next release.

@lethefrost
Copy link
Author

OCR is now implemented and will be available in the next release.

Thank you! It's very great to hear that! Appreciate it a lot.

kamtschatka pushed a commit to kamtschatka/hoarder-app that referenced this issue Nov 2, 2024
@drycounty
Copy link

Can you tell me how this is implemented? Do I need to specify any of the ENV variables for it to work? Can't seem to get it to work from photos of pages of text.

@MohamedBassem
Copy link
Collaborator

@drycounty it's enabled by default. Currently, we don't expose the extracted text, but we only index it for search. Try searching for the content of the page and see if it'll showup.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
Status: Done
Development

No branches or pull requests

6 participants
@Arcturuss @MohamedBassem @drycounty @lethefrost @akshara-tg and others