Unexpected tail of the floating point from using arange loop to name the columns

Question

I tried to create some numbers and rename the output columns with the np.arange loop as the following: The conditional number part worked for me. The columns&#8217; names were fine between columns reject0.6-reject0.68. After that, all columns&#8217; names turned to reject with the unexpected numbers, e.g., re…

Accepted Answer

You are seeing the tail of the floating point precision.  It is impossible to exactly represent most floats, and we end up with a tail that end at the numeric precision.I think you can solve this by formatting the strings you are using for column names.def conditional_zero_column(filename="random.csv"):    df = pd.read_csv(filename)    for i in np.arange(0.6, 1.0, 0.01):        col = f'reject{i:.2f}'        df[col] = np.where(df['expected_discount'] < i, df['expected_discount'], i)        df[col] = np.where(df[col] >= i, df[col], 0.0)        df.to_csv("random_data4.csv", index=False)

Advertisement

Answer