[MS] Extend table support for wide tables (#1552)

* feat: enhance PDF table extraction to support complex forms and add new test cases
* feat: enhance PDF table extraction with adaptive column clustering and add comprehensive test cases
* fix: correct formatting and improve assertions in PDF table tests
This commit is contained in:
lesyk
2026-02-13 19:45:39 +01:00
committed by GitHub
parent 7fdaefb724
commit c83de14a9c
11 changed files with 784 additions and 5 deletions
+3
View File
@@ -1,2 +1,5 @@
packages/markitdown/tests/test_files/** linguist-vendored
packages/markitdown-sample-plugin/tests/test_files/** linguist-vendored
# Treat PDF files as binary to prevent line ending conversion
*.pdf binary