Tables recognition on PDF · pymupdf/PyMuPDF · Discussion #3170 (original) (raw)

This is an error in the logic that recognizes clusters of vector graphics. I have developed a fix.
[Ping me if you urgently need it].

I am going to open an issue here.

You must be logged in to vote

4 replies

@JorjMcKie

Thank you for the feedback!
Again the question: If you have access to your PyMuPDF installation (e.g. it is on your computer), simply replace its table.py ... if urgent.

@bjmvercelli

If it's not a problem for you, i would really appreciate it :)

@JorjMcKie

no problem at all, here is the file
table.zip

Just unpack and copy it over to site-packages/fitz/, replacing the file present there. Note that there also currently still is /fitz_old/, the "classic" implememtnation of PyMuPDF. We are maintaining this for a while more as a fallback. It contains the same file, so you might want to do the same replacement there.

@bjmvercelli

Looks perfect! Thank you so much.

I'll be keeping an eye on PR's, looking foward to check on this one 🏆

image