I want an estimative of, for example, how much more expensive is a division in relation to addition, with IEEE 754 doubles on intel processors. I need that in order to estimate which operations could lead to faster performance in an specific inner loop.
How expensive are IEEE 754 double operations in respect to each other on Intel I7 chips?
179 Views Asked by MaiaVictor At
1
There are 1 best solutions below
Related Questions in X86
- Why do we need AX instead of MOV DS, data directly with a segment?
- Drawing circles of increasing radius
- Assembly print on screen using pop ecx
- How to add values from vector to each other
- Intel x64 instructions CMPSB/CMPSW/CMPSD/CMPSQ
- Compact implementation of logical AND in x86 assembly
- Can feenableexcept hurt a program performance?
- How do I display the result and remainder in ax and dx in Assembly (tasm)
- ASM : Trouble using int21h on real machine
- jmp instruction *%eax
- What steps are needed to load a second stage bootloader by name on a FAT32 file system in x86 Assembly?
- Assembly code to print a new line string
- Write System Call Argument Registers
- How to jump to an address saved in a register in intel assembly?
- Find middle value of a list
Related Questions in DOUBLE
- How to change Boolean array to double array
- Issues with adding doubles. Arithmetic Coding
- How do I convert a double into an n-character string using exponential notation?
- Addition of floating point, Why the First code work
- parseDouble exception on Android while parsing a valid Double value
- Add data fromEeditText to double
- Function to convert double to string with given options
- Convert DoubleBinding value to IntegerBinding
- Java: I want to get an Integer output in double datatype
- Can std::uniform_real_distribution<double>(0,1) return a value greater than 0.99999999999999994?
- Decoding an IEEE double precision float (8 byte)
- >= not working, R
- Screen flickers when setting background
- Java issue converting String to double
- print double in scientific format with no integer part
Related Questions in BIT-MANIPULATION
- Understanding ~ Operator
- Getting four bits from the right only in a byte using bit shift operations
- How this bitshift to build the number works?
- A + B without arithmetic operators, Python vs C++
- Faster way of adding negative signed to unsigned
- Setting a bit in hexadecimal number
- reverse a number's bits
- php synatax $b = (6 << 1); clarification
- Turning off a single GPIO pin on ARM9 (LPC3141)
- Toggle a given range of bits of an unsigned int in C
- javascript shifting >32-bit to get 64-bit Int
- Setting bits in a bit stream
- Efficient comparison of small integer vectors
- Perform integer division using multiplication
- Bitwise (Bitshift) operations on 64-bit integers in C++
Related Questions in INTEL
- How can I compile *without* various instruction sets enabled?
- Restrict MKL optimized scipy to single thread
- Why is genymotion running so slowly?
- Intel VT-X not found
- Intel Edison with Kinect
- Formatting a MicroSD card within OSX
- Can I run Cuda or OpenCl on Intel processor graphics I7 (3rd or 4rd generation)
- Contrast reduction - intel x86
- x86 assembly fading bmp with linear interpolation
- Why I'm getting "error expected an expression" while compile cilk program
- Intel HAXM's intelhaxm-android.exe is not running
- Cordova - Media Plugin - Intel XDK - IOS build fail
- intel xdk: my links are not working
- running a python script that requires matplotlib gives: ImportError: undefined symbol: __libm_sse2_sincos
- To which cache a function pointer belongs to?
Related Questions in IEEE
- How to prevent C99 floating point code from changing results with optimization level
- Casting between int, float and double in C
- How expensive are IEEE 754 double operations in respect to each other on Intel I7 chips?
- Standard Data Types
- IEEE single precision floating-point format
- Fast compare IEEE float greater than zero by cheating
- IEEE Interface Specification Standard
- Last-page column equalization for latex IEEEtran class?
- decimal32 vs float32, which is better for storage
- Bookmark error when using latex to generate pdf file
- Why are acknowledgments used in WLAN but not in IEEE 802.3?
- Why \markboth command doesn't work in my overleaf?
- IEEEtran: spaces between words in title
- How can I output bit pattern of infinity and NaN in C++?(IEEE standard)
- How to float long table in latex?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
From the Intel 64 and IA-32 Architectures Optimization Reference Manual, pages 14-14 and 14-15,
ADDSDhas a latency of 5 cycles and a throughput of 1/cycle.DIVSDhas a latency of 62 cycles and a throughput of 1/61 cycles.So, if your code has tough data dependency problems, addition is about 12 times faster than division. With careful optimization, however, you can make addition 61 times faster than division. Other factors will probably make the gap larger.