Skip to main content
Glama

MCP vLLM Benchmarking Tool

by Eliovp-BV

MCP vLLM 基准测试工具

这是关于如何使用 MCP 以交互方式对 vLLM 进行基准测试的概念证明。

我们对基准测试并不陌生,请阅读我们的博客:

对 vLLM 进行基准测试

这只是对 MCP 可能性的探索。

用法

  1. 克隆存储库
  2. 将其添加到您的 MCP 服务器:
{ "mcpServers": { "mcp-vllm": { "command": "uv", "args": [ "run", "/Path/TO/mcp-vllm-benchmarking-tool/server.py" ] } } }

然后你可以像这样提示:

Do a vllm benchmark for this endpoint: http://10.0.101.39:8888 benchmark the following model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B run the benchmark 3 times with each 32 num prompts, then compare the results, but ignore the first iteration as that is just a warmup.

待办事项:

  • 由于 vllm 的一些随机输出,它可能显示发现了一些无效的 json。我还没有深入研究过这个问题。
-
security - not tested
F
license - not found
-
quality - not tested

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

一种交互式工具,使用户能够通过 MCP 对 vLLM 端点进行基准测试,从而允许使用可自定义的参数对 LLM 模型进行性能测试。

  1. 用法
    1. 待办事项:

      Related MCP Servers

      • -
        security
        A
        license
        -
        quality
        A comprehensive toolkit that enhances LLM capabilities through the Model Context Protocol, allowing LLMs to interact with external services including command-line operations, file management, Figma integration, and audio processing.
        Last updated -
        17
        Python
        Apache 2.0
        • Linux
        • Apple
      • -
        security
        A
        license
        -
        quality
        An MCP server that allows agents to test and compare LLM prompts across OpenAI and Anthropic models, supporting single tests, side-by-side comparisons, and multi-turn conversations.
        Last updated -
        Python
        MIT License
      • A
        security
        F
        license
        A
        quality
        An experimental MCP gateway that provides specialized LLM enhancement prompts based on the L1B3RT4S repository, primarily intended to enhance weaker models' capabilities.
        Last updated -
        1
        2,012
        7
        JavaScript
      • -
        security
        A
        license
        -
        quality
        An MCP server that enables LLMs to autonomously reverse engineer applications through Cutter, allowing them to decompile binaries, analyze code, and rename methods programmatically.
        Last updated -
        6
        Python
        Apache 2.0

      View all related MCP servers

      MCP directory API

      We provide all the information about MCP servers via our MCP API.

      curl -X GET 'https://glama.ai/api/mcp/v1/servers/Eliovp-BV/mcp-vllm-benchmark'

      If you have feedback or need assistance with the MCP directory API, please join our Discord server