Skip to content
Advertisement

Using openpyxl module to write to spreadsheet creates a damaged spreadsheet, how to fix with zipfile module?

I have a program which writes to a spreadsheet using openpyxl. Upon execution of the program, the cells are filled as expected but the spreadsheet becomes damaged. Excel repairs the spreadsheet and I can then view it again.

JavaScript

I have tried using the Open XML SDK Productivity Tool from Microsoft to compare a good and bad file with each other and noticed that styles.xml is missing. I try to copy this over using the following source code I have obtained from another question, but it does not solve the issue for me.

JavaScript

I can confirm from the repair log Excel generates, that the problem is with xl/styles.xml. I need to copy this xml file from the good copy, to the bad copy.

How can I get the xl/styles.xml file copied so that the program can run without damaging output.xlsx?

I have made another attempt to fix this issue. In the off chance that styles.xml cannot be copied from a different Excel file; I have opened styles.xml from output.xlsx prior to book.save("output.xlsx"). After saving, I then get the styles.xml from before the save statement, and write it. Unfortunately, this has not changed anything and I am still getting a damaged Excel file. With this attempt, my test code looks like this:

JavaScript

I have tried saving as a completely new Excel File, but still have the same issue. I tried using zip file to open from output.xlsx and writing to the newly saved file, but still no result.

JavaScript

Although I have already fixed this issue, it is worth noting that this problem only seems to occur when loading a workbook. I have created another program with spreadsheets that creates a workbook, rather than loading it. As a result of this, the spreadsheet does not saves damaged.

Advertisement

Answer

After confirming that the issue was with styles.xml, I identified that the issue was most likely with the style formatting of the written cells. By using styles from the openpyxl module, I have fixed the issue.

I declare a variable, fontStyle in this case, and set all the style settings:

JavaScript

When writing amounts to each cell, I also set the style of these cells using fontStyle:

JavaScript

The completed code, looks like this:

JavaScript

I believe this has worked because the writing method has no default style settings. This would explain why styles.xml was missing when using the Open XML SDK Productivity Tool. Upon checking this Excel file again after the fix, I can confirm that styles.xml is no longer missing.

The file is no longer damaged upon being saved and can be opened normally again. Also, I am now able to execute this script to write to the Excel file again, without having to open and close to repair it.

Note that I have also changed my loop from the original loop – as part of one of my attempts to fix the issue. This has not had an effect on the final outcome – it is all down to the styling of the cells written.

This doesn’t exactly answer the question of solving the issue specifically with zipfile but it does solve the problem.

Advertisement