Skip to content
Advertisement

Python & MS Word: Convert .doc to .docx?

I found several questions that were similar to mine, but none of the answers came close to what I need.

Specifications: I’m working with Python 3 and do not have MS Word. My programming machine is running OS X and cloud machine is linux/ubuntu too.

I’m using python-docx to extract values from a .doc file that is sent to me nightly. However, python-docx only works with .docx files, so I need to convert the file to that extension first.

So, I’ve got a .doc file that I need to convert to .docx. This script might have to run in the cloud so I can’t install any kind of Office or Office-like software. Can this be done?

Advertisement

Answer

You are working with Linux/ubuntu, you can use LibreOffice’s inbuilt converter.

SYNTAX

lowriter --convert-to docx *.doc

Example

lowriter --convert-to docx testdoc.doc

This will convert all doc files to docx and save in the same folder itself.

User contributions licensed under: CC BY-SA
9 People found this is helpful
Advertisement