I'm trying to identify valgrind's support for different Programing languages, I just want to find the valgrind's support for the SYCL applications, if supports how to profile the SYCL Application, If not why?
I tried finding the documents related to SYCL profiling and I found that SYCL has its own profiler and also found a blog related to debugging SYCL using Valgrind, but I didn't get the documents related to profiling using Valgrind.
No, Valgrind doesn't support any form of partitioned execution.
The component that executes on the CPU should be OK to run in Valgrind. But Valgrind contains no code to instrument the part that runs on GPU/FPGA/DSP. There is also a major conceptual difference between the execution models. On CPUs Valgrind runs with a global lock and behaves as if there is just one CPU whilst GPUs are massively parallel. If you could only use one GPU element at a time I imagine that it would be unfeasibly slow.