Print number of occurrences of any items in a list in paths

Question

I am using os.walk to identify paths in a generic source directory (SRC) that contain any strings in my_list: And let's say that print(source_path) gives the following: My goal is to shutil.move my source_path's, but since, for example, moving /User/dir_1/bird_files/ and then trying to move /User/dir_1/bird_files/bird_a_files/ will result in a FileNotFound Error, I want to filter my source_path's to include

Accepted Answer

Use a comprehension to filter by count, then sum the result (True is cast to 1) to get the &#8220;any&#8221; behavior.paths = """/User/dir_1/cat_test//User/dir_1/cat_test/bird_results//User/dir_1/dir_2/dog_test//User/dir_1/dir_2/dog_test/cat_results//User/dir_1/mouse_test//User/dir_1/mouse_test/mouse_results//User/dir_1/unknown_test/dog_results//User/dir_1/bird_files//User/dir_1/bird_files/bird_a_files//User/dir_1/bird_files/bird_b_files/""".split()my_list = ["dog", "cat", "mouse", "bird"]out = []for path in paths:    if sum(True for term in my_list if path.count(term) == 1) == 1:        out.append(path)print(*out, sep='n')Output/User/dir_1/cat_test//User/dir_1/dir_2/dog_test//User/dir_1/mouse_test//User/dir_1/unknown_test/dog_results//User/dir_1/bird_files/EDIT: From the comment, a os.walk approach.Idea: remove terms from the dirnames parameterRemark: I used as filtering condition (see comment in the code) the method substring is contained in string which is quite poor. In this special case a more robust one could be d.startswith(c). For more flexibility use a regex-like solution.import osconstraints = 'dog', 'cat', 'mouse', 'bird'wdir = './User' # your reference directoryres = []for path, dirs, _ in os.walk(wdir, topdown=True):    # local to each directory's content    counter = dict.fromkeys(constraints, False)    dirs_to_skip = []        # filter by constraint    for c in constraints:        for d in dirs:            if c in d: # <-- filter condition!                if not counter[c]: # 1st match                    counter[c] = True                    res.append(os.path.join(path, d))                dirs_to_skip.append(d)        # remove matched paths              for d in dirs_to_skip:        dirs.remove(d)print(*res, sep='n')

Advertisement

Answer