Panda dataframe of distribution of particles: group by ID and find the half flux and the half flux radius

Question

I am using Panda dataframe; I have a distribution of particles, their distance from the center of the distribution, and the associated fluxes. I want to find the total flux enclosed in the "half flux radius" (or "half light radius"), which is the radius that encloses half of the flux, by definition. I make you an example and then I

Accepted Answer

Can be done in this way:import pandas as pddata = {'dist_ID':  [702641.0,702641.0,702641.0,702641.0,702641.0,702641.0,802663.0,802663.0,802663.0,802663.0],        'R':        [5.791781,2.806051,3.254907,8.291544,4.901959,8.630691,95.685763,116.070396,112.806001,229.388117],        'flux':     [0.097505,0.015750,0.086941,0.081764,0.053561,0.144661,0.025735,0.026012,0.022163,0.026154]}df = pd.DataFrame(data)# Sort DFdf = df.sort_values(['dist_ID', 'R'])# Calculate cumsumdf['flux_cumsum'] = df.groupby('dist_ID')['flux'].transform(pd.Series.cumsum)# Calculate half_fluxdf_halfflux = df.groupby('dist_ID').apply(lambda x: x.flux.sum() / 2).to_frame().rename(columns={0:'half_flux'})df = pd.merge(df,df_halfflux, how="left", on=['dist_ID'])# Calculate discrepancydf['flux_diff'] = abs(df.half_flux- df.flux_cumsum)print(df)# Find R_hf-rowdf = df.groupby(['dist_ID', 'half_flux']).agg({'flux_diff': 'min'}).rename(columns={'flux_diff': 'R_hf'})print(df)Upper code output this:    dist_ID           R      flux  flux_cumsum  half_flux  flux_diff0  702641.0    2.806051  0.015750     0.015750   0.240091   0.2243411  702641.0    3.254907  0.086941     0.102691   0.240091   0.1374002  702641.0    4.901959  0.053561     0.156252   0.240091   0.0838393  702641.0    5.791781  0.097505     0.253757   0.240091   0.0136664  702641.0    8.291544  0.081764     0.335521   0.240091   0.0954305  702641.0    8.630691  0.144661     0.480182   0.240091   0.2400916  802663.0   95.685763  0.025735     0.025735   0.050032   0.0242977  802663.0  112.806001  0.022163     0.047898   0.050032   0.0021348  802663.0  116.070396  0.026012     0.073910   0.050032   0.0238789  802663.0  229.388117  0.026154     0.100064   0.050032   0.050032                        R_hfdist_ID  half_flux702641.0 0.240091   0.013666802663.0 0.050032   0.002134

Advertisement

Answer