Skip to content
This repository has been archived by the owner on Feb 28, 2024. It is now read-only.

Improve and document ingestion workflow #41

Open
anupdhml opened this issue Sep 3, 2018 · 1 comment
Open

Improve and document ingestion workflow #41

anupdhml opened this issue Sep 3, 2018 · 1 comment
Labels

Comments

@anupdhml
Copy link
Member

anupdhml commented Sep 3, 2018

It should be easy for anyone in the team to add new documents to elasticsearch, once we have the raw docs.

  • schema/format to follow (for all our sources: crawled docs, OCR)
  • where to store the raw docs
  • how to start indexing the docs to elasticsearch
  • how to verify results of indexing
  • rollback to the previous state in case of issues
@anupdhml
Copy link
Member Author

Partially done in #80

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

1 participant