Serialize ‘csv’ file as binary and append to file

Question

How is it possible to achieve the following at the same time in python 3: Serialize column names and numerical data as a binary file Reopen the file and append additional numerical data For example with the following data: My approach with numpy This approach allows to save data and append additional data. However the column names are missing and

Accepted Answer

import pickle# Write first df to pickledata = {    "name": ["Joe", "Mike", "Tony", "Susan"],    "course": ["Masters", "Doctorate", "Graduate", "Bachelors"],    "age": [27, 23, 21, 19],}df = pd.DataFrame(data)df.to_pickle(path)# Create new row dfnew_row = {"name": "Phil", "course": "Associates", "age": 30}new_row_df = pd.DataFrame(new_row, index=[0])print(f"{new_row_df}n")# read original df from picklepickled_df = pd.read_pickle(path)# concat dfs df_appended = pd.concat([new_row_df, pickled_df]).reset_index(drop=True)# Dump concat df to picklewith open(path, "wb") as f:    pickle.dump(df_appended, f)# read concat df from pickledf = pd.read_pickle(path)print(df)You can append to the file without reading but the dfs wont be concatenated they are seperate entries. You can ofcourse read all the entries in a loop and concat later when it&#8217;s time to read the file.# Add new entrieswith open(path, "ab") as f:    pickle.dump(new_df, f)# When ready to read and concat.with open(path, "rb") as f:    entries = []    while True:        try:            entry = pickle.load(f)        except EOFError:            break        entries.append(entry)df = pd.concat(entries).reset_index(drop=True)print(df)

Advertisement

Answer