I was learning about cache line, and the effect of loop stride on the cache. I came across this page which shows the execution time of a loop vs the loop stride. According to the benchmark, increasing the loop stride decreases the execution time which is very confusing to me. As I understand if the cache line is 64 bytes, and lets assume if in the first case the loop stride is just 1 which means the loop goes over the array element sequentially then that should have the least execution time because 16 integers (4byte x 16 = 64bytes) are loaded into the cache. The execution time should be lowest up to a stride of 16 because all 16 elements are loaded into the same cache line. When the stride is increased above 16 that should increase the execution time because the array element won't be in the cache line, but the graph on the page is completely opposite.
Loop stride and cache line
1.4k Views Asked by zer0c00l At
1
There are 1 best solutions below
Related Questions in ARRAYS
- Media files end up in in a pycharm subdirectory when uploading
- Android Studio (Intellij) keeps adding second line to modules.xml
- Visual Studio 2013 CE keeps overwriting Task list
- Vim: last word on user specific settings?
- C# - Settings.Default.Save() is very slow
- Ivy Settings - Changing the way conflicts are resolved
- How to check if drop down menu of ActionBar is active
- how to permanently save options(dplyr.width = Inf) in Rstudio?
- change the settings file items in .Net console application
- Eclipse error: "Path must include project and resource name" on invoking clean
Related Questions in CACHING
- Media files end up in in a pycharm subdirectory when uploading
- Android Studio (Intellij) keeps adding second line to modules.xml
- Visual Studio 2013 CE keeps overwriting Task list
- Vim: last word on user specific settings?
- C# - Settings.Default.Save() is very slow
- Ivy Settings - Changing the way conflicts are resolved
- How to check if drop down menu of ActionBar is active
- how to permanently save options(dplyr.width = Inf) in Rstudio?
- change the settings file items in .Net console application
- Eclipse error: "Path must include project and resource name" on invoking clean
Related Questions in CPU-ARCHITECTURE
- Media files end up in in a pycharm subdirectory when uploading
- Android Studio (Intellij) keeps adding second line to modules.xml
- Visual Studio 2013 CE keeps overwriting Task list
- Vim: last word on user specific settings?
- C# - Settings.Default.Save() is very slow
- Ivy Settings - Changing the way conflicts are resolved
- How to check if drop down menu of ActionBar is active
- how to permanently save options(dplyr.width = Inf) in Rstudio?
- change the settings file items in .Net console application
- Eclipse error: "Path must include project and resource name" on invoking clean
Related Questions in CPU-CACHE
- Media files end up in in a pycharm subdirectory when uploading
- Android Studio (Intellij) keeps adding second line to modules.xml
- Visual Studio 2013 CE keeps overwriting Task list
- Vim: last word on user specific settings?
- C# - Settings.Default.Save() is very slow
- Ivy Settings - Changing the way conflicts are resolved
- How to check if drop down menu of ActionBar is active
- how to permanently save options(dplyr.width = Inf) in Rstudio?
- change the settings file items in .Net console application
- Eclipse error: "Path must include project and resource name" on invoking clean
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
In that example the Length is constant, so the larger the stride - the less elements you go through.
The interesting phenomena is that it doesn't apply below a cache line, and that's because you can't bring parts of a line. So below 16, you pay the same penalty of fetching all cache lines. Above 16, you start skipping some lines. above 32 for example (128B) you fetch every other line - hence +/- half the time (assuming your execution time is dominated by memory latency)