Split Single Column(1,000 rows) into two smaller columns(500 each)

Question

How to split a single column containing 1000 rows into chunk of two columns containing 500 rows per column in pandas. I have a csv file that contains a single column and I need to split this into multiple columns. Below is the format in csv. Steps I took: I had multiple csv files containing one column with 36…

Accepted Answer

Answer updated to work for arbitrary number of columnsYou could start with number of columns or row length.  For a given initial column length you could calculate one given the other. In this answer I use desired target column length &#8211; tgt_row_len.nb_groups = 4tgt_row_len = 5df = pd.DataFrame({'column1': np.arange(1,tgt_row_len*nb_groups+1)})print(df)    column10         11         22         33         44         55         66         7...    17       1818       1919       20Create groups in the index for the following grouping operationdf.index = df.reset_index(drop=True).index // tgt_row_len   column10        10        20        30        40        51        61        7...3       173       183       193       20dfn = (    df.groupby(level=0).apply(lambda x: x['column1'].reset_index(drop=True)).T        .rename(columns = lambda x: 'col' + str(x+1)).rename_axis(None))print(dfn)   col1  col2  col3  col40     1     6    11    161     2     7    12    172     3     8    13    183     4     9    14    194     5    10    15    20Previous answer that handles creating two columnsThis answer just shows 10 target rows as an example. That can easily be changed to 364 or 500.A dataframe where column1 contains 2 sets of 10 rowstgt_row_len = 10df = pd.DataFrame({'column1': np.tile(np.arange(1,tgt_row_len+1),2)})print(df)    column10         11         22         33         44         55         66         77         88         99        1010        111        212        313        414        515        616        717        818        919       10Move the bottom set of rows to column2df.assign(column2=df['column1'].shift(-tgt_row_len)).iloc[:tgt_row_len].astype(int)   column1  column20        1        11        2        22        3        33        4        44        5        55        6        66        7        77        8        88        9        99       10       10

How to split a single column containing 1000 rows into chunk of two columns containing 500 rows per column in pandas.

Advertisement

Answer