How to use nsys profile to perf vllm
I use nsys profile -t cuda,nvtx,cudnn,cublas
but only one default CUDA Stream can be seen in the report file.
How to use nsys profile to perf vllm
I use nsys profile -t cuda,nvtx,cudnn,cublas
but only one default CUDA Stream can be seen in the report file.