https://learn.deeplearning.ai/courses/preprocessing-unstructured-data-for-llm-applications
Improve your RAG system to retrieve diverse data types
-
Learn to extract and normalize content from a wide variety of document types, such as PDFs, PowerPoints, Word, and HTML files, tables, and images to expand the information accessible to your LLM.
-
Enrich your content with metadata, enhancing retrieval augmented generation (RAG) results and supporting more nuanced search capabilities.
-
Explore document image analysis techniques like layout detection and vision and table transformers, and learn how to apply these methods to preprocess PDFs, images, and tables.
-
Beginner
-
Matt Robinson
-
Prerequisite recommendation: This is a beginner-friendly course.
- https://dyckms5inbsqq.cloudfront.net/Unstructured/unstructured-c1/unstructured_c1_01/video/unstructured_c1_01_720p/unstructured_c1_01_720p.m3u8
- https://dyckms5inbsqq.cloudfront.net/Unstructured/unstructured-c1/unstructured_c1_02/video/unstructured_c1_02_720p/unstructured_c1_02_720p.m3u8
- https://dyckms5inbsqq.cloudfront.net/Unstructured/unstructured-c1/unstructured_c1_03/video/unstructured_c1_03_720p/unstructured_c1_03_720p.m3u8
- https://dyckms5inbsqq.cloudfront.net/Unstructured/unstructured-c1/unstructured_c1_04/video/unstructured_c1_04_720p/unstructured_c1_04_720p.m3u8
- https://dyckms5inbsqq.cloudfront.net/Unstructured/unstructured-c1/unstructured_c1_05/video/unstructured_c1_05_720p/unstructured_c1_05_720p.m3u8
- https://dyckms5inbsqq.cloudfront.net/Unstructured/unstructured-c1/unstructured_c1_06/video/unstructured_c1_06_720p/unstructured_c1_06_720p.m3u8
- https://dyckms5inbsqq.cloudfront.net/Unstructured/unstructured-c1/unstructured_c1_07/video/unstructured_c1_07_720p/unstructured_c1_07_720p.m3u8
- https://dyckms5inbsqq.cloudfront.net/Unstructured/unstructured-c1/unstructured_c1_08/video/unstructured_c1_08_720p/unstructured_c1_08_720p.m3u8