Python/Pandas:How to process a column of data like a dictionary

Question

i have a csv lie this i would like to sum the values from column &#8220;PDCP.RxBytesUl&#8221;, PDCP.RxBytesUl = 5QI1+5QI2+5QI3+5QI4+5QI5+5QI6+5QI7+5QI8+5QI9 finally,the result is like this At first I wanted to convert this column into a dict(), but I found the format was not right, i have no idea, please help…

Accepted Answer

You can use Regex based solution:df = pd.read_csv('input.csv',delimiter='|')df['sum'] = df['PDCP.RxBytesUl'].str.extractall(':(d+(?:.d+)?)').astype('float').unstack().sum(axis=1)df.drop('PDCP.RxBytesUl', axis=1, inplace=True)df:    cel_id      sum0   1001-1234-1 0.00451   1001-1234-2 0.01632   1001-1234-4 0.0095Better code Suggested by Shubham :)df['sum'] = df['PDCP.RxBytesUl'].str.extractall(':([^;]+)').astype('float').sum(level=0)df.drop('PDCP.RxBytesUl', axis=1, inplace=True)

Advertisement

Answer