Skip to content

feat/Infer the hierarchical heading/title levels such as H1, H2, H3, H4 for PDFs #4204

@codebymikey

Description

@codebymikey

Is your feature request related to a problem? Please describe.
This was supposedly raised and addressed as part of #1354

But I can't seem to get any metadata regarding the inferred heading of any content I test with.

All Title elements all have the same level at the moment which isn't an accurate representation of the parsed document.

Describe the solution you'd like
The parser should be able to infer PDF hierarchy in a couple ways:

Describe alternatives you've considered
N/A

Additional context

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions