Filter pandas column based on ranges in a huge list

Question

Trying to filter &#8216;time&#8217; data into &#8216;time_filtered&#8217; based on lut_lst ranges, ergo if &#8216;time&#8217; value falls in any of the ranges, exchange with NaN otherwise copy value into new column. The output for df is not filtered. I tried using any(lut_lst) or all(lut_lst) but that just th…

Accepted Answer

Use tuples instead of ranges in lut_lst, and change your filter slightly:import numpy as np# creates look up list for ranges that need to be excludedlut_lst = []for i in range(0,2235,15):    a= i,2+i    b= 14+i, 15+i    lut_lst.append(a)    lut_lst.append(b)## if 'time' value falls in any of the ranges of lut_lst, replace values with NaN (drop row)data_cols = ['filename', 'time']data_vals = [['cell1', 0.0186],    ['cell1', 0.0774],    ['cell1', 2.2852],    ['cell1', 2.3788],    ['cell1', 14.62],    ['cell1', 15.04],    ['cell2', 20.3416],    ['cell2', 20.9128],    ['cell2', 29.6784],    ['cell2', 30.1194],    ['cell2', 32.3304]]    df = pd.DataFrame(data_vals, columns=data_cols)df['time_filtered'] = df['time'].apply(lambda x: x if not any([a < x < b for a,b in lut_lst]) else np.nan)dfOutput:    filename    time    time_filtered0   cell1   0.0186  NaN1   cell1   0.0774  NaN2   cell1   2.2852  2.28523   cell1   2.3788  2.37884   cell1   14.6200 NaN5   cell1   15.0400 NaN6   cell2   20.3416 20.34167   cell2   20.9128 20.91288   cell2   29.6784 NaN9   cell2   30.1194 NaN10  cell2   32.3304 32.3304

Advertisement

Answer