Skip to content
Advertisement

python creating new list using a “template list”

Suppose i have:

JavaScript

and:

JavaScript

with the same shape

now i want to “put x2 ontop of x1” and sum up all the numbers of x1 corresponding to the numbers of x2

so the end result is:

JavaScript

this is a naive implementation using list to further clarify the question

JavaScript

so my question: is there is a way to implement this in numpy without using loops or in general just faster?

Advertisement

Answer

In this particular example (and, in general, for unique, duplicated, and groupby kinds of operations), pandas is faster than a pure numpy solution:

A pandas way, using Series (credit: very similar to @mcsoini’s answer):

JavaScript

A pure numpy way, using np.unique and some fancy indexing:

JavaScript

Note: a better pure numpy way is inspired by @Woodford’s answer:

JavaScript

Yet another pure numpy way is inspired by a comment from @mapf about using argsort(). That in itself already takes 45ms, but we may try something based on np.argpartition(x2, len(x2)-1) instead, since that takes only 7.5ms by itself on the benchmark below:

JavaScript

(Slightly modified) example

JavaScript

Speed

JavaScript

Going via pandas is faster, in part because of numpy issue 11136.

User contributions licensed under: CC BY-SA
9 People found this is helpful
Advertisement