Issues
- 8
Shutdown of Apache Tika Corpora
#3035 opened by stefan6419846 - 4
- 4
Support for PANTONE colors
#3033 opened by stefan6419846 - 0
Improve handling of LZW decoder table overflow
#3032 opened by stefan6419846 - 13
- 3
Intermittent `IndexError` when accessing `PdfReader.pages` with `ThreadPoolExecutor`
#3024 opened by blairfrandeen - 7
Question about documentation for xmp_metadata.dc_description and xmp_metadata.dc_subject
#3023 opened by dfkettle - 1
TypeError when extracting text from PDF: Unsupported operand type(s) for '/' (IndirectObject and float)
#3020 opened by HEKUCHAN - 1
pypdf.errors.PdfReadError: startxref not found
#3017 opened by neeraj9 - 1
Crash during page text extraction
#2975 opened by neeraj9 - 7
- 1
- 0
Generated single page PDF is huge
#3011 opened by Vafilor - 0
- 6
- 5
`PageObject.transfer_rotation_to_content()` hides some content since pypdf 4.3.0
#2927 opened by stefan6419846 - 3
- 2
PdfReadError: Image data is not rectangular
#2993 opened by Verdant31 - 0
Collapsing outlines not working: parameter is_open in add_outline_item has no effect
#2994 opened by dowo-2987 - 4
Capitalization in metadata
#2992 opened by dfkettle - 6
Update namespace links in xmp.py
#2951 opened by j-t-1 - 6
Text visitor example in docs does not work
#2881 opened by lucasgadams - 5
- 0
Exception on indirect object during text extraction
#2966 opened by nsw42 - 1
PdfWriter().append throwing 'NullObject' object is not subscriptable for a specific PDF file
#2958 opened by eth-wa - 5
Transferred Annotations not Rendering Correctly
#2960 opened by eth-wa - 2
- 1
- 5
- 1
- 1
`UnboundLocalError` error when extracting text
#2933 opened by vodkar - 1
- 1
Inverted colors when extracting CMYK image
#2931 opened by AnzhiZhang - 2
#7 Using PdfReader causes a crash
#2886 opened by Avgor46 - 5
- 6
ENH: Ensure PyPI marks URLs as "verified"
#2892 opened by MartinThoma - 0
Regression when reading partially broken PDF files
#2926 opened by stefan6419846 - 5
DEV: Switch to latest pinned dependencies
#2914 opened by stefan6419846 - 1
Add an argument ``layout_mode_height_weight`` to control inference of vertical space when extracting text in layout mode
#2915 opened by hpierre001 - 3
DEV: Mirror freely licensed arXiv documents locally
#2904 opened by stefan6419846 - 2
Cloning errors when using context manager
#2912 opened by pubpub-zz - 1
Images merged between pages
#2923 opened by pprados - 0
How to remove watermark with pypdf2
#2916 opened by Estelle-gqy - 4
- 1
How to extract internal links using PyPDF
#2910 opened by swathiJayav - 1
- 3
- 1
#6 Using PdfReader causes a crash
#2875 opened by Avgor46 - 7
`PdfReader` causes memory overflow for a particular PDF
#2876 opened by JaMe76 - 3
BUG: infinite loop on damaged pdf file
#2877 opened by pubpub-zz