Row-level cumulative sum with condition

Question

I have a table that looks like this. m1 m2 m3 m4 m5 m6 m7 m8 s 0 1 0 0 5 0 4 10 4 4 1 8 0 15 0 4 10 10 I need to know at which position or column the row-level cumulative sum for the first six columns (m1 to m6) either equals or exceeds

Accepted Answer

you can use cumsum on axis=1 with get_indexer on the df.columns:df['output'] = df.columns.get_indexer(df.drop("s",1).cumsum(axis=1)                       .ge(df['s'],axis=0).idxmax(axis=1))+1print(df)   m1  m2  m3  m4  m5  m6  m7  m8   s  output0   0   1   0   0   5   0   4  10   4       51   4   1   8   0  15   0   4  10  10       3EDIT:There can be situations where none of the column in a row satisfies this condition , in that case, you may use a condition to check (expect a -1 where the condition doesnot match for any column in a row):c = df.drop("s",1).cumsum(axis=1).ge(df['s'],axis=0)df['output'] = df.columns.get_indexer(c.idxmax(1).where(c.any(1)))+1

m1	m2	m3	m4	m5	m6	m7	m8	s	output
0	1	0	0	5	0	4	10	4	5
4	1	8	0	15	0	4	10	10	3

Advertisement

Answer