Pandas: efficiently inserting a large number of rows

Question

I have a large dataframe in this format, call this df: index val1 val2 0 0.2 0.1 1 0.5 0.7 2 0.3 0.4 I have a row I will be inserting, call this myrow: index val1 val2 -1 0.9 0.9 I wish to insert this row 3 times after every row in the original dataframe, i.e.: index val1 val2 0

Accepted Answer

reset_index so that df has a simple RangeIndex. Then we can do math with tiling and repeats to create an Index that when sorted will place 3 of the myrow rows between each row of your DataFrame. Finally remove this Index and get back to a normal RangeIndex.Sample Dataimport pandas as pdimport numpy as npmyrow = pd.DataFrame({'index': [-1], 'val1': [0.9], 'val2': [0.9]})df = pd.DataFrame({'index': [0,1,2],                   'val1': [0.2, 0.5, 0.3],                   'val2': [0.1, 0.7, 0.4]})Code# Ensure starting from a RangeIndexdf = df.reset_index(drop=True)NR = 3  # Number of repeatsmr = pd.concat([myrow]*len(df)*NR, ignore_index=True)mr.index = df.index.repeat(NR) + np.tile(np.arange(0, 1, 1/NR), len(df))# `mr` second in the `concat` so rows go belowdf = pd.concat([df, mr]).sort_index().reset_index(drop=True)    index  val1  val20       0   0.2   0.11      -1   0.9   0.92      -1   0.9   0.93      -1   0.9   0.94       1   0.5   0.75      -1   0.9   0.96      -1   0.9   0.97      -1   0.9   0.98       2   0.3   0.49      -1   0.9   0.910     -1   0.9   0.911     -1   0.9   0.9

index	val1	val2
0	0.2	0.1
-1	0.9	0.9
-1	0.9	0.9
-1	0.9	0.9
1	0.5	0.7
-1	0.9	0.9
-1	0.9	0.9
-1	0.9	0.9
2	0.3	0.4
-1	0.9	0.9
-1	0.9	0.9
-1	0.9	0.9

index	val1	val2
0	0.2	0.1
-1	0.9	0.9
-1	0.9	0.9
-1	0.9	0.9
1	0.5	0.7
-1	0.9	0.9
-1	0.9	0.9
-1	0.9	0.9
2	0.3	0.4
-1	0.9	0.9
-1	0.9	0.9
-1	0.9	0.9

Advertisement

Answer

Sample Data

Code

index	val1	val2
0	0.2	0.1
-1	0.9	0.9
-1	0.9	0.9
-1	0.9	0.9
1	0.5	0.7
-1	0.9	0.9
-1	0.9	0.9
-1	0.9	0.9
2	0.3	0.4
-1	0.9	0.9
-1	0.9	0.9
-1	0.9	0.9