I am wondering if anyone knows some of the key differences between the parakeet and the Numba jit? I am curious, because I was comparing Numexpr to Numba and parakeet, and for this particular expression (which I expected to perform very very well on Numexpr, because it was the one that is mentioned in its documentation)
So the results are
and the functions I tested (via timeit - minimum of 3 repetitions and 10 loops per function)
import numpy as np
import numexpr as ne
from numba import jit as numba_jit
from parakeet import jit as para_jit
def numpy_complex_expr(A, B):
return(A*B-4.1*A > 2.5*B)
def numexpr_complex_expr(A, B):
return ne.evaluate('A*B-4.1*A > 2.5*B')
@numba_jit
def numba_complex_expr(A, B):
return A*B-4.1*A > 2.5*B
@para_jit
def parakeet_complex_expr(A, B):
return A*B-4.1*A > 2.5*B
I you can also grab the IPython nb if you'd like to double-check the results on your machine.
If someone is wondering if Numba is installed correctly... I think so, it performed as expected in my previous benchmark:
As of the current release of Numba (which you are using in your tests), there is incomplete support for ufuncs with the
@jit
function. On the other hand you can use@vectorize
and it faster:Timing results:
If you want to leverage numba to its fullest, then you want to unroll any vectorized operations:
Also note that if you set the number of threads that numexpr uses to 1, then you'll see that its main speed advantage is that it's parallelized:
By default numexpr uses
ne.detect_number_of_cores()
as the number of threads. For my original timing on my machine, it was using 8.