Output is an empty file

Question

My code does not throw an error, it simply creates the files, but of which are empty. I tried it from the command line, and it works using the wildcard training_set_pssm/*.pssm path, but I must do it from the IDE because it is not printing the correct output anyway.
The input file is a set of checkpoint files that look like this:

Accepted Answer

First and foremost lets fix your paths, you imported from pathlib import Path but never used it.lets declare infile = Path('/Users/name/Desktop/PDB/training_set_pssm/idlist/'), we now have some helpfull functions we can use for finding problems.try out some of these to make sure you are searching in the right place.#this will write out the absolute filepath usefull to check if it is correctinfile.absolute()#this tells you if this path existsinfile.exists()#this tells you if this is a fileinfile.is_file()let&#8217;s start at the beginningI&#8217;ll try and explain what is happening in your code line by line.if __name__ == '__main__':    # i don't really know what this infile is, is it a file containing    # d1s7za_.fasta.pssm     # d1s98a_.fasta.pssm     # d1s99a_.fasta.pssm     #or a directory containing files named    #d1s7za_.fasta.pssm     #d1s98a_.fasta.pssm     #d1s99a_.fasta.pssm     #...    infile = Path('/Users/name/Desktop/PDB/training_set_pssm/idlist')    # this returns a list of string presumably in the form of    # d1ciya2.fastan    # d1ciya3.fastan    # d1cq3a_.fastan    idlist = lines_to_list("/Users/name/Desktop/PDB/training_set_idlist")   # loop over that list   for ids in idlist:        # strips the 'n' from the id and adds '.pssm'        # you now have something like 'd1d0qa_.fasta.pssm'        # you never use this        part2 = ids.rstrip() + '.pssm'                # was 'if os.path.isfile(infile) == True:' but should be :        if infile.is_file():            # strips the 'n' from the id and adds '.profile'            # you now have something like 'd1d0qa_.fasta.profile'            ofile = ids.rstrip() + '.profile'            # here is where it becomes a bit weird            # in relevant_lines you say:            # Takes list (extracted from a .pssm file) and extracts the Sequence Profile Portion only.            # is infile a .pssm file?            # is this correct?            profile_list = relevant_lines(infile)            # this seems fine, it writes the normalized data to ofile.            # ofile will be something like 'd1d0qa_.fasta.profile'            write_normalized_profile(profile_list, ofile)solution:if __name__ == '__main__':    pssm_directory = Path('/Users/name/Desktop/PDB/training_set_pssm/idlist/') #the directory    idlist = lines_to_list("/Users/name/Desktop/PDB/training_set_idlist")    for ids in idlist:        infile = pssm_directory.joinpath(ids.rstrip() + '.pssm') #generate filename from id        if infile.is_file(): #check if filename exists            ofile = ids.rstrip() + '.profile'            profile_list = relevant_lines(infile)            write_normalized_profile(profile_list, ofile)

Advertisement

Answer