Unfortunately, Intel for some reason decided to set the return value of PILAENV (the recommended block size) for their ScaLAPACK implementation to 512. This value gives less than ideal performance (on Intel's own CPUs!) and I am confused why they did it. Is there a way to overwrite this value, e.g. by an environment variable?
1
There are 1 best solutions below
Related Questions in INTEL-MKL
- 3D FFT with data larger than cache
- Can't open mkl_intel_s_dll.lib in 64bit Visual Studio environment
- Restrict MKL optimized scipy to single thread
- Win7, dependency error in 64bit visual studio
- Makefiile with many flags
- On entry to NIT parameter number 9 had an illegal value
- Error linking clang++ with dlib and intel mkl
- Confused with pdpotrf arguments
- Complex to real IFFT using MKL
- How to link Intel Math Kernel Library (MKL) with GNU Octave on Windows
- Getting max FLOPS for dense matrix multiplication with the Xeon Phi Knights Landing
- How save memory for a solving a symmetric (or upper traingular) matrix?
- MKL init when static linking MKL
- 3D Convolution using Intel MKL
- Anaconda mkl not licensed properly when called from R
Related Questions in SCALAPACK
- How to initialize two distinct blacs contexts?
- On entry to NIT parameter number 9 had an illegal value
- Confused with pdpotrf arguments
- Blacs: processor grids that do not use all MPI cores
- Calling BLACS with more processes than used
- scalapack matrix diagonalization (pdsyevd)
- Problems compiling example scalapack application
- Writing a configurable scalapack linear system solver that prints execution time
- Adapt dgemm example code to use sgemm (scalapack)
- Scalapack Routine PDGESVD_ does not return when called
- Algorithm to balance a set of differently sized matrix blocks between processes
- What is the blocking factor used to distribute the array?
- How to link library scalapack and blacs while compilation
- MKL ScaLapack problems
- ScaLapack operation, word and message counts
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
pilaenv returns the positive integer value of the logical blocking size. This value is machine and precision specific. This version provides a logical blocking size which should give good though not optimal performance on many of the currently available distributed-memory concurrent computers. You are encouraged to modify this subroutine to set this tuning parameter for your particular machine. You are encouraged to modify this subroutine to set the tuning parameters for your particular machine using the option and problem size information in the arguments.