Skip to content
Advertisement

Loop through folder and subfolders and merge pdf

I tried to create a script to loop through parent folder and subfolders and merge all of the pdfs into one. Below if the code I wrote so far, but I don’t know how to combine them into one script.

Reference: Merge PDF files

The first function is to loop through all of the subfolders under parent folder and get a list of path for each pdf.

JavaScript

Second, I created a list to append all of the path to pdf files in the subfolders and merge into one combined file. At this step, I was told:

TypeError: listdir: path should be string, bytes, os.PathLike or None, not list

JavaScript

Where and what should I modify the code in order to avoid the error and also combine two functions together?

Advertisement

Answer

First you have to create functions which create list with all files and return it.

JavaScript

I use also .lower() to catch extensions like .PDF.

endswith() can use tuple with all extensions.

It is good to get external values as arguments – list_dir(root) instead of list_dir()


And later you can use as

JavaScript

in

JavaScript

EDIT:

First function could be even more universal if it would get also extensions

JavaScript

EDIT:

For single extension you can also do

JavaScript

It needs ** with recursive=True to search in subfolders.

User contributions licensed under: CC BY-SA
3 People found this is helpful
Advertisement