关于vllm-ascend的性能采集的问题

gbdjxgp · January 5, 2026, 2:03am

您好，相关问题链接：https://www.hiascend.com/forum/thread-0261202960094059300-1-1.html

我想了解的是：vllm-ascend采集数据的逻辑是怎么样的？

msserviceprofiler通过entry_points机制实现了自动profile功能，我了解到：v0.11.0并没有profiling_config.py这个文件但却能正常用msserviceprofiler，

那是不是profiling_config这个文件就不需要了？还是说这个文件存在的意义是给用户看的呢

RunLLM · January 5, 2026, 2:03am

我的理解是，vllm-ascend 的数据采集逻辑主要依赖于 torch_npu 的 profiler 或 msserviceprofiler，两者本质上是不同的采集工具，但采集流程类似，都是在模型推理过程中收集算子、内存等信息。profiling_config.py 文件的存在与否，主要影响的是配置采集参数和采集方式的灵活性，如果没有该文件，默认配置下 torch_npu 的 profiler 依然可以正常工作，但无法自定义采集细节；而 msserviceprofiler 是独立于 torch_npu profiler 的另一套采集实现，二者不是同一个东西，但采集流程类似，都是 hook 到推理流程中收集数据。据我所查，v0.11.0 没有 profiling_config.py 也不会影响 msserviceprofiler 的正常使用，因为它们的调用入口和实现机制是分离的。

至于“如果有 profiling_config.py 就不会调用 torch_npu 的 profiler 而是调用钩子函数吗”，我的理解是 profiling_config.py 只是提供配置选项，不会直接决定采集工具的切换，采集工具的选择还是由代码逻辑和环境变量决定。torch_npu 的 profiler 和 msserviceprofiler 实际上是两套采集体系，采集流程类似但实现不同，二者的数据采集流程都是在推理主流程中插桩收集算子、内存等信息。是否需要更详细的采集流程代码分析？
参考：vllm profiling 官方文档、vllm profiling 相关 issue

Sources:

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
Vllm ascend能够查看模型的计算图吗 Ascend Support	2	116	November 20, 2025
Vllm-ascend是否支持async推理？ Ascend Support	2	123	October 15, 2025
Vllm-ascend如何验证使用的是npu还是cpu General	5	245	August 6, 2025
The files in profile folder are not json General	1	41	November 20, 2025
使用vllm_ascend0.9.1提示Failed to import vllm_ascend_C: Ascend Support	2	207	November 20, 2025

关于vllm-ascend的性能采集的问题

Related topics