gcp dlp python / how to reduce likelyhood when a column does not contain a string

Question

I have a numeric client id to find. I created a custom info types : As expected, a lot of findings came out from the job and all with a very_likely likelyhood. To reduce the findings, I'd like to use hotwords in "reverse" mode : if there's not the string "cli" in the column name, then reduce likelyhood. In the

Accepted Answer

In order to accomplish this you want to set the default likelihood for your custom_info_type to be VERY_UNLIKELY and then keep your hotword rule as-is.   This way if something matches it will flag as VERY_UNLIKELY unless the header/context contains your match for &#8220;cli&#8221; in which case it will boost to VERY_LIKELY.Something like:custom_info_types = [    {        "info_type": {"name": "CLIENTID"},        "regex": {"pattern": r'd{7,8}'},        "likelihood": "VERY_UNLIKELY"    }]When you leave the likelihood blank in the custom_info_type definition, then it defaults to VERY_LIKELY.Let me know if this works.

Advertisement

Answer