[PDFBOX-4550] Poor performance with corrupt ToUnicode stream (original) (raw)
Details
- Type:
Bug
- Status: Closed
- Priority:
Major
- Resolution: Fixed
- Affects Version/s: 2.0.15
- Fix Version/s: 2.0.16, 3.0.0 PDFBox
- Component/s: Rendering, Text extraction
- Labels:
None
Description
A confidential file with lots of corrupt streams has ToUnicode stream with corrupt contents in the beginbfrange segment where start and end have different lengths. This leads to poor performance. Such entries can be skipped.
Attachments
Attachments
- 169997-p1.pdf
15/Jun/19 09:29
130 kB
Tilman Hausherr - EF6E2XR2XAXWUHZV5STGMYPWNNDLXDDT-p2.pdf
15/Jun/19 09:29
94 kB
Tilman Hausherr - PDFBOX-3442-DirectResources_unc.pdf
20/May/19 16:56
501 kB
Tilman Hausherr - PDFBOX-3442-DirectResources.pdf
20/May/19 04:39
73 kB
Tilman Hausherr - PDFBOX-4550-LG5S35JUXSEH5XJC6QYISY3OBUXCKAKR-p1-reduced.pdf
13/Jun/19 03:56
18 kB
Tilman Hausherr - pdnekz1gvl7.pdf
20/May/19 04:43
74 kB
Tilman Hausherr
Activity
People
Assignee:
Andreas Lehmkühler
Reporter:
Tilman Hausherr
Votes:
0 Vote for this issue
Watchers:
4 Start watching this issue
Dates
Created:
15/May/19 19:42
Updated:
28/Jun/19 04:39
Resolved:
19/Jun/19 05:10