How to use Excel’s SUMIF function in Pandas

Question

I have a difficulty in calculating "total_sum." If someone didn't apply to subject, I expressed N/A. When total_sum is calculated, total_sum refer to Standard field and N/A is excluded. I'm not good at Python, So I don't know how to calculate "total_sum" Answer Suppose this dataframe is the same as yours (with index of strings) then you can apply .dot()

Accepted Answer

Suppose this dataframe is the same as yours (with index of strings)dataframe = {'index' :['Standard', 'A', 'B', 'C'],             'ENG' : [10, 10, np.nan, 3],             'MATH' : [10, np.nan, 5, 3],             'ART' : [5, np.nan, 3, 2],             'COM' : [5, 1, 5, 2]}df = pd.DataFrame(dataframe).set_index('index').rename_axis(None)df['subject_sum'] = df.sum(axis=1)df            ENG     MATH    ART   COM   subject_sumStandard    10.0    10.0    5.0   5     30.0A           10.0    NaN     NaN   1     11.0B           NaN     5.0     3.0   5     13.0C           3.0     3.0     2.0   2     10.0then you can apply .dot() of every .notna() subject values to the values of Standardstandard = df.loc['Standard', ['ENG', 'MATH', 'ART', 'COM']]df['total_sum'] = df[['ENG', 'MATH', 'ART', 'COM']].notna().dot(standard)dfresult :            ENG     MATH    ART     COM     subject_sum     total_sumStandard    10.0    10.0    5.0     5       30.0            30.0A           10.0    NaN     NaN     1       11.0            15.0B           NaN     5.0     3.0     5       13.0            20.0C           3.0     3.0     2.0     2       10.0            30.0

Advertisement

Answer