python-docx returning empty cells when they should be full

Question

I am trying to iterate through all tables in a document and extract the text from them. As an intermediate step I am just trying to print the text to the console. I have looked at other code provided by scanny in similar posts but for some reason it is not giving me my expected output from the document I

Accepted Answer

Found the error. I was using a third party tool (multiDoc converter) to convert old .Doc files into Docx format. works for the most part, however there must be some meta data that doesn&#8217;t convert properly because it was causing the issue. Opening the file and manually saving it as Docx solved the issue. Only problem is that I want to convert 2000+ files into Docx, so I&#8217;ll need to find another solution for convertiing the files.

Advertisement

Answer