Split Single Column(1,000 rows) into two smaller columns(500 each)

Question

How to split a single column containing 1000 rows into chunk of two columns containing 500 rows per column in pandas. I have a csv file that contains a single column and I need to split this into multiple columns. Below is the format in csv. Steps I took: I had multiple csv files containing one column with 364 rows.

Accepted Answer

Answer updated to work for arbitrary number of columnsYou could start with number of columns or row length.  For a given initial column length you could calculate one given the other. In this answer I use desired target column length &#8211; tgt_row_len.nb_groups = 4tgt_row_len = 5df = pd.DataFrame({'column1': np.arange(1,tgt_row_len*nb_groups+1)})print(df)    column10         11         22         33         44         55         66         7...    17       1818       1919       20Create groups in the index for the following grouping operationdf.index = df.reset_index(drop=True).index // tgt_row_len   column10        10        20        30        40        51        61        7...3       173       183       193       20dfn = (    df.groupby(level=0).apply(lambda x: x['column1'].reset_index(drop=True)).T        .rename(columns = lambda x: 'col' + str(x+1)).rename_axis(None))print(dfn)   col1  col2  col3  col40     1     6    11    161     2     7    12    172     3     8    13    183     4     9    14    194     5    10    15    20Previous answer that handles creating two columnsThis answer just shows 10 target rows as an example. That can easily be changed to 364 or 500.A dataframe where column1 contains 2 sets of 10 rowstgt_row_len = 10df = pd.DataFrame({'column1': np.tile(np.arange(1,tgt_row_len+1),2)})print(df)    column10         11         22         33         44         55         66         77         88         99        1010        111        212        313        414        515        616        717        818        919       10Move the bottom set of rows to column2df.assign(column2=df['column1'].shift(-tgt_row_len)).iloc[:tgt_row_len].astype(int)   column1  column20        1        11        2        22        3        33        4        44        5        55        6        66        7        77        8        88        9        99       10       10

How to split a single column containing 1000 rows into chunk of two columns containing 500 rows per column in pandas.

Advertisement

Answer