Skip to content
Advertisement

Python – Find distinct domains inside a list of dictionaries

I have a list (with dictionaries inside) and I want to know how many different domains are inside it.

I have something like this:

JavaScript

The desired result would look like this:

JavaScript

Or maybe just:

JavaScript

Both would be OK, so whatever is easier or faster I guess.

I think I could use Regex for this, but maybe there are more pythonic and/or efficient ways to do this?

Thanks!

Advertisement

Answer

You can use urllib.parse.urlparse (from standard library) together with set comprehension (to avoid duplicates):

JavaScript

If you need, you can convert set to list via list(unique_domains). This is more reliable than regex solution.

(please don’t call variable list, it shadows useful builtin).

User contributions licensed under: CC BY-SA
9 People found this is helpful
Advertisement