---
id: 20260302-T0-02
title: "HumanMCP数据集发布：模拟人类查询评估MCP工具检索性能"
title_en: "HumanMCP Dataset: Evaluating MCP Tool Retrieval Performance"
url: https://ai.daily.yangsir.net/daily/20260302-T0-02
issue_date: 2026-03-02
publish_date: 2026-03-02T05:00:00.000Z
source_name: "arXiv cs.AI"
source_url: https://arxiv.org/abs/2602.23367
---

# HumanMCP数据集发布：模拟人类查询评估MCP工具检索性能

研究人员发布HumanMCP数据集，专门用于评估模型上下文协议（MCP）工具检索性能。该数据集包含数千个模拟真实人类用户查询的样本，解决了现有评估数据集缺乏真实人类交互场景的问题，为MPC服务器工具检索提供了更精准的测试基准。

## English Version

**HumanMCP Dataset: Evaluating MCP Tool Retrieval Performance**

Researchers released the HumanMCP dataset specifically for evaluating Model Context Protocol (MCP) tool retrieval performance. Containing thousands of samples simulating real human user queries, this dataset addresses the lack of authentic human interaction scenarios in existing evaluation datasets, providing a more precise testing benchmark for MCP server tool retrieval.

---

**来源**：[arXiv cs.AI](https://arxiv.org/abs/2602.23367)

**详情页**：https://ai.daily.yangsir.net/daily/20260302-T0-02

---

*智语观潮 · Daily — https://ai.daily.yangsir.net/llms.txt*