How to load multiple text files from a folder into a python list variable

Question

I have a folder full of text documents, the text of which needs to be loaded into a single list variable. Each index of the list, should be the full text of each document. So far I have this code, but it is not working as well. Is there a better way to do this? Edit: Replaced the csv reading

Accepted Answer

You just need to read() each file in and append it to your corpus list as follows:import globimport osfile_list = glob.glob(os.path.join(os.getcwd(), "FolderName", "*.txt"))corpus = []for file_path in file_list:    with open(file_path) as f_input:        corpus.append(f_input.read())print(corpus)Each list entry would then be the entire contents of each text file. Note, using readlines() would give you a list of lines for each file rather than the raw text.With a list-comprehensionfile_list = glob.glob(os.path.join(os.getcwd(), "FolderName", "*.txt"))corpus = [open(file).read() for file in file_list]This approach though might end up with more resource usage as there is no with section to automatically close each file.

Advertisement

Answer

With a list-comprehension