Skip to content

Conversation

@PeterStaar-IBM
Copy link
Member

@PeterStaar-IBM PeterStaar-IBM commented Jan 12, 2026

Summary of Changes

  1. ✅ Updated pyproject.toml
  1. ✅ Backend Consolidation Complete

Files Deleted:

  • docling/backend/docling_parse_backend.py (V1)
  • docling/backend/docling_parse_v2_backend.py (V2)
  • tests/test_backend_docling_parse.py (V1 tests)
  • tests/test_backend_docling_parse_v2.py (V2 tests)

Files Renamed:

  • docling/backend/docling_parse_v4_backend.py → docling/backend/docling_parse_backend.py
  • tests/test_backend_docling_parse_v4.py → tests/test_backend_docling_parse.py

Classes Renamed:

  • DoclingParseV4DocumentBackend → DoclingParseDocumentBackend
  • DoclingParseV4PageBackend → DoclingParsePageBackend
  1. ✅ Backward Compatibility Maintained

Updated pipeline_options.py:

  • Added new enum value: PdfBackend.DOCLING_PARSE = "docling_parse"
  • Kept deprecated values: DLPARSE_V1, DLPARSE_V2, DLPARSE_V4 (marked as deprecated)
  • Added normalize_pdf_backend() function that:
    • Automatically maps deprecated values to DOCLING_PARSE
    • Emits DeprecationWarning when deprecated values are used
  1. ✅ Updated All References

Core Files:

  • docling/cli/main.py - Updated imports, backend selection logic, and default value
  • docling/document_converter.py - Updated imports and default backend
  • docling/datamodel/pipeline_options.py - Added enum and normalization function

Test Files:

  • tests/test_backend_docling_parse.py - Updated imports and class names
  • tests/test_options.py - Updated imports and removed V2/V4 from test loop
  • tests/test_e2e_ocr_conversion.py - Updated imports
  • tests/test_pdf_password.py - Updated imports and class names

Documentation/Examples:

  • docs/examples/batch_convert.py - Updated imports and class names

Migration Path for Users

Users with existing code using deprecated backends will:

  1. See a DeprecationWarning when using DLPARSE_V1, DLPARSE_V2, or DLPARSE_V4
  2. Have their code continue to work (mapped automatically to DOCLING_PARSE)
  3. Be guided to update to PdfBackend.DOCLING_PARSE
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
@mergify
Copy link

mergify bot commented Jan 12, 2026

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:
@github-actions
Copy link
Contributor

github-actions bot commented Jan 12, 2026

DCO Check Passed

Thanks @PeterStaar-IBM, all your commits are properly signed off. 🎉

Signed-off-by: Peter Staar <taa@zurich.ibm.com>
@codecov
Copy link

codecov bot commented Jan 13, 2026

Codecov Report

❌ Patch coverage is 94.50549% with 5 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
docling/backend/docling_parse_backend.py 96.00% 3 Missing ⚠️
docling/datamodel/pipeline_options.py 81.81% 2 Missing ⚠️

📢 Thoughts on this report? Let us know!

Signed-off-by: Peter Staar <taa@zurich.ibm.com>
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants