I want to iterate through directories to find PDF files but I also want to be able to profile the file path without iterating through directories. So this is what I wrote:
JavaScript
x
10
10
1
rootdir = "C:\Users\user\Downloads\Final.pdf"
2
3
for subdir, dirs, files in os.walk(rootdir):
4
for file in files:
5
path = os.path.join(subdir, file)
6
safe_text = ""
7
print(path)
8
with pdfplumber.open(str(path)) as pdf:
9
print("this is a PDF file")
10
Advertisement
Answer
Use os.path.isdir
/os.path.isfile
to check if the path is a valid directory/file. If it’s a directory – walk it with a for
loop, otherwise process it as file. It may help to extract the logic to a function so you don’t have to repeat yourself.
JavaScript
1
25
25
1
import os.path
2
3
4
def process(path):
5
"""Process a single PDF file."""
6
safe_text = ""
7
print(path)
8
with pdfplumber.open(str(path)) as pdf:
9
print("this is a PDF file")
10
11
12
path = "C:\Users\user\Downloads\Final.pdf"
13
14
if os.path.isdir(path):
15
# it's a directory - walk it
16
for subdir, dirs, files in os.walk(path):
17
for file in files:
18
subpath = os.path.join(subdir, file)
19
process(subpath)
20
elif os.path.isfile(path):
21
# it's a single file - process it
22
process(path)
23
else:
24
raise FileNotFoundError(path)
25