You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Same refactoring is needed for the other nodes in our indexing pipeline (e.g. Preprocessor).
When working on this, we should take into account that we create the document's id at the moment when we instantiate the Document. If this now happens within different nodes, we probably need to add a param there to control the way how these ids are created (default: hash of content). For example, in DC we want to create the ID based on content + a pipeline_id. This can be done via id_hash_keys parameter in Document.init().
Currently, the file converters are returning
List[Dict]
. We should make use of ourDocument
primitive here and returnList[Document]
instead.The text was updated successfully, but these errors were encountered: