Using np.select to change mix data types (int and str) in a Pandas column

Question

I've been trying to map a column from my df into 4 categories (binning) but, the column contains mixed values in it: int and str, it looks something like this: The categories I've been tring to change them to: This has been the way I've been trying to solve this: But, I get this error: ValueError: shape mismatch: objects cannot

Accepted Answer

All values in your condlist for np.select must be the same length. Yours are not.You can use pd.to_numeric with errors='coerce' to force values to convert to numeric.Then, use pd.cut to create your bins. Convert back to strings from categorical, and replace 'nan' entries with 'text'.Given:  data_column0          221           82          113        Text4          175        Text6           6Doing:df.data_column = pd.to_numeric(df.data_column, 'coerce')df.data_column = (pd.cut(df.data_column, [1, 10, 20, 30], labels=['superb','awesome','great'])                    .astype(str)                    .replace('nan', 'text'))Output:  data_column0       great1      superb2     awesome3        text4     awesome5        text6      superb

Advertisement

Answer