How to flag an outlier(s) /anomaly in selected columns in python?

Question

In the dataset df below. I want to flag the anomalies in all columns except A, B,C and L. Any value less than 1500 or greater than 400000 is regarded as an anomaly. Attempt: Result of the code: Desired output should look like this: Thanks for the effort! Answer If you set the subset as the argument of the app…

Accepted Answer

If you set the subset as the argument of the apply function, you will get what you want.exclude_cols = ['A','B','C','L']def flag_outliers(s, exclude_cols):    if s.name in exclude_cols:        print(s.name)        return '' # or None, or whatever df.style() needs    else:        s = pd.to_numeric(s, errors='coerce')        indexes = (s<1500)|(s>400000)        return ['background-color: yellow' if v else '' for v in indexes]df.style.apply(lambda s: flag_outliers(s, exclude_cols), axis=1, subset=['D','E','F','G','H','J','K'])

Advertisement

Answer