Skip to content

Commit

Permalink
feat: add layout detection (#228)
Browse files Browse the repository at this point in the history
* feat: create first format modules

* add: example file

* add: structured output formatter

* fix: all parsers outputs list of elements & compatibility formatters

* feat: new basemodel for document

* add: structured output

* fix: test

* fix: add uncategorized text handling

* add: skip on flaky pdf

* add: section block

* fix: change load logic & reate page element

* fix: add pages

* add: split onnxtr det and reco

* feat: Doctr in MegaParse

* fix : Update ReadMe

* fix: add config as constructor parameters

* add: to_numpy to bbox

* add: onnx layout detection

* feat: add category to Document Blocks

* add: List handling and table / image ingestion

* structured output

* blk100 ignore

* fix

* feat: fix output

---------

Co-authored-by: chloedia <[email protected]>
  • Loading branch information
AmineDiro and chloedia authored Feb 13, 2025
1 parent 32101e9 commit 77f7040
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion libs/megaparse_sdk/megaparse_sdk/schema/mp_outputs.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
from enum import Enum, auto
from typing import Dict

from llama_parse.base import Dict
from pydantic import BaseModel, Field

from megaparse_sdk.schema.document import Document
Expand Down

0 comments on commit 77f7040

Please sign in to comment.