PyMuPDF 1.24.2 DocumentationTEXT_PRESERVE_LIGATURES | fitz.TEXT_PRESERVE_WHITESPACE >>> tp = dl.get_textpage(flags) This will save ca. 25% overall execution time for the HTML, XHTML and JSON text extractions and hugely reduce the amount The following table shows average relative speeds (“RSpeed”, baseline 1.00 is TEXT), taken across ca. 1400 text-heavy and 1300 image-heavy pages. Method RSpeed Comments no images TEXT 1.00 no images wrappers for text extraction and text searching are now based on this, which should improve performance by ca. 5%. Changes in Version 1.16.4 • Fixed issue #381 (“TextPage.extractDICT ... failed ... after upgrading0 码力 | 565 页 | 6.84 MB | 1 年前3
PyMuPDF 1.12.2 documentationTEXT_PRESERVE_LIGATURES | fitz.TEXT_PRESERVE_WHITESPACE >>> tp = dl.getTextPage(flags) This will save ca. 25% overall execution time for the HTML, XHTML and JSON text extractions and hugely reduce the amount0 码力 | 387 页 | 2.70 MB | 1 年前3
共 2 条
- 1













