Python: What is an efficient way to loop over a list of strings and group substrings in the list?

Question

Background I would like to find and group the substrings in the list into a list of tuples where the first element of the tuple would be the substring and the second element would be the larger string that contains the substring. The expected output is given below I've written the following code which achieves the desired outcome Is there

Accepted Answer

Combining suggestions in the comments and @ZabielskiGrabriel&#8217;s answer, you can do it by first sorting the list and then comparing each element in the sorted list with those that follow it in a list comprehension:my_list = sorted(my_list, key=len)[(x, y) for i, x in enumerate(my_list, 1) for y in my_list[i:] if x in y]Benchmarks (with supplied test list):%timeit op(my_list)%timeit zabiel(my_list)%timeit nin17(my_list)Output:3.92 µs ± 31 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each)2.76 µs ± 34.6 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each)2.25 µs ± 7.75 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each)

Advertisement

Answer