How to import all fields from xls as strings into a Pandas dataframe?

Question

I am trying to import a file from xlsx into a Python Pandas dataframe. I would like to prevent fields/columns being interpreted as integers and thus losing leading zeros or other desired heterogenous formatting. So for an Excel sheet with 100 columns, I would do the following using a dict comprehension with range(99). These import files do have a varying

Accepted Answer

Try this:xl = pd.ExcelFile(r'C:DemoFile.xlsx')ncols = xl.book.sheet_by_index(0).ncolsdf = xl.parse(0, converters={i : str for i in range(ncols)})UPDATE:In [261]: type(xl)Out[261]: pandas.io.excel.ExcelFileIn [262]: type(xl.book)Out[262]: xlrd.book.Book

Advertisement

Answer