---
id: 20260602-T0-07
title: "EHRBench构建医疗LLM决策评估基准"
title_en: "EHRBench: LLM Clinical Decision Benchmark"
url: https://ai.daily.yangsir.net/daily/20260602-T0-07
issue_date: 2026-06-02
publish_date: 2026-06-01T04:00:00.000Z
category: research
source_name: "arXiv cs.AI"
source_url: https://arxiv.org/abs/2605.30637
---

# EHRBench构建医疗LLM决策评估基准

研究人员推出EHRBench基准测试，基于电子健康记录自动评估LLM在临床决策任务中的表现。该框架可诊断、选择治疗方案并预测健康结果，填补评估空白。

## English Version

**EHRBench: LLM Clinical Decision Benchmark**

EHRBench introduces an automated benchmark for evaluating LLMs on clinical decision-making tasks using electronic health records. It measures diagnosis, treatment selection, and health outcome prediction capabilities.

---

**来源**：[arXiv cs.AI](https://arxiv.org/abs/2605.30637)

**详情页**：https://ai.daily.yangsir.net/daily/20260602-T0-07

---

*智语观潮 · Daily — https://ai.daily.yangsir.net/llms.txt*