2026.05.24DAILY REPORT

Microsoft: AI Costs Higher Than Human Employees

11 items·2026.05.24

DAILY BRIEF

01Microsoft: AI Costs Higher Than Human Employees 02Latent Space Attacks Bypass AI Safety Controls 03Is AI Profitable Yet? Industry Analysis 04MindLoom: New Method for High-Quality Reasoning Data 05New Method Tackles LLM Benchmark Data Contamination 06Trace2Skill: Boosts Long-Context EDA Agent Performance 07SMDD-Bench: Evaluates LLM Drug Design Capabilities 08CenterLoss Hurts OOD Detection, Multi-Scale Mahalanobis Wins 09All Model Labs Renamed to Agent Labs 10Claude Code v2.1.150 Released 11OpenClaw 2026.5.22 Released

01 / NEWS2026.05.23 11:44

Microsoft: AI Costs Higher Than Human Employees

Microsoft’s internal assessment reveals current AI system operational costs exceed those of human employees, primarily due to high token consumption and resource demands for agents. This finding could impact corporate AI deployment strategies and ROI calculations.

SOURCE

HN AI 精选

02 / RESEARCH2026.05.23 12:00

Latent Space Attacks Bypass AI Safety Controls

New research shows AI safety refusals can be suppressed by manipulating internal representations. The study demonstrates latent space attacks that bypass safety controls, highlighting the need for stronger AI safety defenses.

SOURCE

arXiv cs.AI

03 / NEWS2026.05.23 10:10

Is AI Profitable Yet? Industry Analysis

Analysis of AI profitability reveals challenges despite potential in certain areas. High deployment costs, complex integration, and uncertain ROI remain major obstacles. Businesses need more practical AI implementation strategies to achieve commercial viability.

SOURCE

HN AI 精选

04 / RESEARCH2026.05.23 12:00

MindLoom: New Method for High-Quality Reasoning Data

MindLoom introduces a novel approach for synthesizing high-quality reasoning data by composing thought modes. The system identifies structural factors affecting problem difficulty, addressing limitations of existing methods and advancing LLM complex reasoning capabilities.

SOURCE

arXiv cs.AI

052026.05.23 12:00

New Method Tackles LLM Benchmark Data Contamination

Researchers develop provable joint decontamination method to address LLM benchmark data contamination. The approach accurately identifies and removes contaminated evaluation data from training sets, ensuring reliable cross-model performance comparisons.

SOURCE

arXiv cs.LG (ML)

062026.05.23 12:00

Trace2Skill: Boosts Long-Context EDA Agent Performance

Trace2Skill introduces verifier-guided skill evolution for complex Verilog design problems. The method enables precise localization and modification in large code repositories, significantly improving AI capabilities in hardware design.

SOURCE

arXiv cs.AI

072026.05.23 12:00

SMDD-Bench: Evaluates LLM Drug Design Capabilities

SMDD-Bench benchmark evaluates LLM performance on real-world small molecule drug design tasks across diverse chemistries and targets. The new benchmark addresses evaluation gaps and provides critical insights for AI applications in drug discovery.

SOURCE

arXiv cs.AI

082026.05.23 12:00

CenterLoss Hurts OOD Detection, Multi-Scale Mahalanobis Wins

New research reveals CenterLoss harms OOD detection in ML systems. Current methods optimize features solely for classification accuracy, neglecting OOD detection capability. The arXiv paper proposes multi-scale Mahalanobis distance as a superior alternative, demonstrating better performance in identifying out-of-distribution data while maintaining classification accuracy.

SOURCE

arXiv cs.LG (ML)

09 / NEWS2026.05.23 12:21

All Model Labs Renamed to Agent Labs

All Model Labs has officially rebranded to Agent Labs, shifting its focus from single model development to comprehensive AI agent solutions. The rename aims to integrate technical resources and enhance AI agent capabilities across multiple scenarios, including task automation, cross-platform collaboration, and personalized services. The new platform supports over 50 third-party models and processes over 1 million agent requests daily, a 200% year-over-year increase. This transformation improves product flexibility and offers developers a more efficient toolchain, expected to further drive the adoption of AI agents in the enterprise market.

SOURCE

Latent Space

10 / TOOLS2026.05.23 12:03

Claude Code v2.1.150 Released

Claude Code releases v2.1.150 with internal infrastructure improvements, no user-facing changes. This update enhances system stability and performance, setting the foundation for future feature iterations.

SOURCE

Claude Code Releases

112026.05.24 08:50

OpenClaw 2026.5.22 Released

OpenClaw releases 2026.5.22 with performance optimizations including gateway improvements, process-stable channel reuse, and CPU profile rotation. These updates reduce resource consumption and enhance large-scale deployment performance.

SOURCE

OpenClaw Releases

chat_bubbleAny thoughts on today's content?