Skip to content
Advertisement

How can I get the total count of total pages of a PDF file using PDFMiner in Python?

In pypdf, len(reader.pages) gives me the total number of pages of a PDF file.

How can I get this using PDFMiner?

Advertisement

Answer

I hate to just leave a code snippet. For context here is a link to the current pdfminer.six repo where you might be able to learn a little more about the resolve1 method.

As you’re working with PDFMiner, you might print and come across some PDFObjRef objects. Essentially you can use resolve1 to expand those objects (they’re usually a dictionary).

JavaScript
User contributions licensed under: CC BY-SA
10 People found this is helpful
Advertisement