Skip to content
Advertisement

Python concatenate (multiple) strings to number

I have a python list ['a1', 'b1', 'a2', 'b2','a3', 'b3']. Set m=3 and I want get this list using loops, because here m=3 could be a larger number such as m=100.

Since we can have

JavaScript

then I try to get ['a1', 'b1', 'a2', 'b2','a3', 'b3'] using

JavaScript

and have TypeError: can only concatenate list (not "str") to list

Then I try

JavaScript

and I still get errors as UFuncTypeError: ufunc 'add' did not contain a loop with signature matching types (dtype('<U1'), dtype('<U1')) -> None.

How can I fix the problem? And even more, how to get something like ['a1', 'b1', 'c1', 'a2', 'b2','c2','a3', 'b3', 'c3'] through similar ways?

Advertisement

Answer

A simple combined list comprehension would work as pointed out in the @j1-lee’s answer (and later in other answers).

JavaScript

Similarly, one could use itertools.product(), as evidenced in Nick’s answer, to obtain substantially the same:

JavaScript

However, it is possible to write a NumPy-vectorized approach, making use of the fact that if the dtype is object, the operations do follow the Python semantics.

JavaScript

Note that the final numpy.ndarray.tolist() could be avoided if whatever will consume the output is capable of dealing with the NumPy array itself, thus saving some relatively small but definitely appreciable time.


Inspecting Output

The following do indicate that the functions are equivalent:

JavaScript
JavaScript

Benchmarks

For larger inputs, this is substantially faster, as evidenced by these benchmarks:

JavaScript

to be plotted with:

JavaScript

benchmarks

These show that the explicitly looped versions (letter_number_loop() and letter_number_it()) are somewhat comparable, while the NumPy-vectorized (letter_number_np()) fares much better relatively quickly for larger inputs, up to ~2x speed-up.

User contributions licensed under: CC BY-SA
4 People found this is helpful
Advertisement