---
id: 20260302-T0-17
title: "CiteAudit：大时代科学引用真实性基准测试"
title_en: "CiteAudit: Benchmark for LLM Citation Verification"
url: https://ai.daily.yangsir.net/daily/20260302-T0-17
issue_date: 2026-03-02
publish_date: 2026-03-02T05:00:00.000Z
source_name: "arXiv cs.CL (NLP)"
source_url: https://arxiv.org/abs/2602.23452
---

# CiteAudit：大时代科学引用真实性基准测试

CiteAudit是首个专门验证大模型引用真实性的基准测试。该研究揭示了LLM生成虚假引用的严重性，测试显示主流模型错误率高达18%。基准包含1万+真实和虚假引用对，可评估模型在科学文献检索和验证能力。科研机构可用该工具审查论文引用质量，防止学术不端。

## English Version

**CiteAudit: Benchmark for LLM Citation Verification**

CiteAudit is the first benchmark specifically designed to verify the authenticity of large language model citations. The study reveals the severity of LLM-generated false citations, showing mainstream models have error rates up to 18%. The benchmark includes over 10,000 pairs of real and fake citations to assess models' literature retrieval and verification capabilities. Research institutions can use it to review paper citation quality and prevent academic misconduct.

---

**来源**：[arXiv cs.CL (NLP)](https://arxiv.org/abs/2602.23452)

**详情页**：https://ai.daily.yangsir.net/daily/20260302-T0-17

---

*智语观潮 · Daily — https://ai.daily.yangsir.net/llms.txt*