Error occurred while using PyPdf2 PdfFileMerger in Python

Question

I have been creating a Python program using PyPdf2 to merge multiple pdf files. Here is the code while running the code i encountered the following error:- Note - I ensured that none of the pdf file is protected with password. Answer Update for July 2022 This was fixed and will be in the next release of PyPDF2! Original Answer

Accepted Answer

Update for July 2022This was fixed and will be in the next release of PyPDF2!Original AnswerIt seems this is caused by bad destination syntax in the outline of one of the PDFs you&#8217;re trying to combine.If you don&#8217;t care about the outline, you should be able to get around this by updating import_bookmarks kwarg to False in PdfFileMerger.append, like this:import osfrom PyPDF2 import PdfFileMergersource_dir = os.getcwd()merger = PdfFileMerger()for item in os.listdir(source_dir):    if item.endswith('pdf'):        merger.append(item, import_bookmarks=False)merger.write('completed_file.pdf')merger.close()More detailPdfFileMerger.append calls PdfFileMerger.merge and passes the import_bookmarks kwarg to it. By default this is set to True.In PyPDF2.generic, the Destination class is raising this error during initialization. The Merger is trying to build destinations into the new outline by reading them from the original outlines.def __init__(self, title, page, typ, *args):    DictionaryObject.__init__(self)    self[NameObject("/Title")] = title    self[NameObject("/Page")] = page    self[NameObject("/Type")] = typ    # from table 8.2 of the PDF 1.7 reference.    if typ == "/XYZ":        (self[NameObject("/Left")], self[NameObject("/Top")],            self[NameObject("/Zoom")]) = args    elif typ == "/FitR":        (self[NameObject("/Left")], self[NameObject("/Bottom")],            self[NameObject("/Right")], self[NameObject("/Top")]) = args    elif typ in ["/FitH", "/FitBH"]:        self[NameObject("/Top")], = args    elif typ in ["/FitV", "/FitBV"]:        self[NameObject("/Left")], = args    elif typ in ["/Fit", "/FitB"]:        pass    else:        raise utils.PdfReadError("Unknown Destination Type: %r" % typ)Since the destination type &#8220;0&#8221; isn&#8217;t a valid type according to PDF Reference 1.7, it raises an error.

Advertisement

Answer

More detail