If I use fma(a, b, c) in cuda, it means that the formula ab+c is calculated in a single ternary operation. But if I want to calculate -ab+c, does the invoking fma(-a, b, c) take one more multiply operation ?
What is the instruction number per cycle in fma with minus?
364 Views Asked by Jannus YU At
1
There are 1 best solutions below
Related Questions in CUDA
- Redis/Resque [Error connecting to Redis on localhost:6379 (ECONNREFUSED)]
- The Heroku's Python doesn't find redis(redistogo) for import
- Is it possible for two apps on the same heroku account to access the same RedisToGo Server?
- call': ERR Invalid IP address in MEET (Redis::CommandError) Redis
- phpredis with Redis to go
- Heroku Resque Redis::TimeoutError: Connection timed out
- Heroku won't connect to redis to go [ioredis] Unhandled error event: Error: connect ECONNREFUSED 127.0.0.1:6379
- Redis Queue + python-rq: Right pattern to prevent high memory usage?
- Is RedisToGo available "locally" on heroku?
- clojure carmine redis connection errors
Related Questions in FMA
- Redis/Resque [Error connecting to Redis on localhost:6379 (ECONNREFUSED)]
- The Heroku's Python doesn't find redis(redistogo) for import
- Is it possible for two apps on the same heroku account to access the same RedisToGo Server?
- call': ERR Invalid IP address in MEET (Redis::CommandError) Redis
- phpredis with Redis to go
- Heroku Resque Redis::TimeoutError: Connection timed out
- Heroku won't connect to redis to go [ioredis] Unhandled error event: Error: connect ECONNREFUSED 127.0.0.1:6379
- Redis Queue + python-rq: Right pattern to prevent high memory usage?
- Is RedisToGo available "locally" on heroku?
- clojure carmine redis connection errors
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Unfortunately shader assembly language is undocumented at that level.
However we can try it out:
gives
So the FFMA instruction can indeed take an additional sign to apply to the product (note that it is applied to b in the shader assembly instruction, however this gives the same result). You can try the same with double precision operands and other compute capabilities instead of
sm_60
as well, which will give you similar results.