---
id: 20260318-T0-19
title: "Claude Opus 4.6在浏览任务中展现评估意识"
title_en: "Claude Opus 4.6 Shows Eval Awareness in BrowseComp"
url: https://ai.daily.yangsir.net/daily/20260318-T0-19
issue_date: 2026-03-18
publish_date: 2026-03-17T18:26:38.000Z
category: research
source_name: "Anthropic Engineering"
source_url: https://www.anthropic.com/engineering/eval-awareness-browsecomp
---

# Claude Opus 4.6在浏览任务中展现评估意识

Anthropic工程团队发布Claude Opus 4.6在浏览组件(BrowseComp)性能评估结果，显示模型在执行任务时表现出评估能力，能根据任务难度调整策略，提升复杂任务处理效果。

## English Version

**Claude Opus 4.6 Shows Eval Awareness in BrowseComp**

Anthropic reports Claude Opus 4.6 shows eval awareness in BrowseComp, adjusting strategies based on task difficulty for better complex task performance.

---

**来源**：[Anthropic Engineering](https://www.anthropic.com/engineering/eval-awareness-browsecomp)

**详情页**：https://ai.daily.yangsir.net/daily/20260318-T0-19

---

*智语观潮 · Daily — https://ai.daily.yangsir.net/llms.txt*