Skip to content

Commit

Permalink
docs: add Data Prep Kit integration (#316)
Browse files Browse the repository at this point in the history
Signed-off-by: Panos Vagenas <[email protected]>
  • Loading branch information
vagenas authored Nov 12, 2024
1 parent 777237e commit 93fc1be
Show file tree
Hide file tree
Showing 3 changed files with 17 additions and 3 deletions.
13 changes: 13 additions & 0 deletions docs/integrations/data_prep_kit.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
## Get started

Docling is used by the [Data Prep Kit \[\]](https://ibm.github.io/data-prep-kit/) open-source toolkit for preparing unstructured data for LLM application development ranging from laptop scale to datacenter scale.

Below you find the Data Prep Kit modules powered by Docling.

## PDF ingestion to Parquet
- 💻 [GitHub \[\]](https://github.com/IBM/data-prep-kit/tree/dev/transforms/language/pdf2parquet)
- 📖 [API docs \[\]](https://ibm.github.io/data-prep-kit/transforms/language/pdf2parquet/python/)

## Document chunking
- 💻 [GitHub \[\]](https://github.com/IBM/data-prep-kit/tree/dev/transforms/language/doc_chunk)
- 📖 [API docs \[\]](https://ibm.github.io/data-prep-kit/transforms/language/doc_chunk/python/)
2 changes: 1 addition & 1 deletion docs/integrations/llamaindex.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
## Get started

Docling is available as an official LlamaIndex extension!
Docling is available as an official [LlamaIndex \[\]](https://docs.llamaindex.ai/) extension.

To get started, check out the [step-by-step guide in LlamaIndex \[\]](https://docs.llamaindex.ai/en/stable/examples/data_connectors/DoclingReaderDemo/)<!--{target="_blank"}-->.

Expand Down
5 changes: 3 additions & 2 deletions mkdocs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -81,8 +81,9 @@ nav:
# - CLI: examples/cli.md
- Integrations:
- Integrations: integrations/index.md
- "LlamaIndex 🦙 extension": integrations/llamaindex.md
# - "LangChain 🦜🔗 extension": integrations/langchain.md
- "Data Prep Kit": integrations/data_prep_kit.md
- "LlamaIndex 🦙": integrations/llamaindex.md
# - "LangChain 🦜🔗": integrations/langchain.md
# - API reference:
# - API reference: api_reference/index.md

Expand Down

0 comments on commit 93fc1be

Please sign in to comment.