I need to generate many hundreds of millions of random numbers for a clustering analysis. I am using numpy.random and was wondering if anyone knows the maximum number of pseudo-randoms that can be generated with numpy.random before the sequence begins to repeat? A quick look in the numpy documentation didn't help.

I know I can generate numbers in chunks using different seeds, but I'm curious as to the maximum number.

1

There are 1 best solutions below

0
On BEST ANSWER

It is, I believe, Mersenne Twister with period 219937-1

https://docs.scipy.org/doc/numpy-1.15.1/reference/generated/numpy.random.set_state.html