---
id: 20260424-T0-13
title: "PayPal用推测解码加速商业代理，EAGLE3模型延迟降低40%"
title_en: "PayPal Accelerates Commerce Agent with 40% Latency Reduction"
url: https://ai.daily.yangsir.net/daily/20260424-T0-13
issue_date: 2026-04-24
publish_date: 2026-04-23T04:00:00.000Z
category: research
source_name: "arXiv cs.LG (ML)"
source_url: https://arxiv.org/abs/2604.19767
---

# PayPal用推测解码加速商业代理，EAGLE3模型延迟降低40%

PayPal发布技术论文，展示使用EAGLE3推测解码和微调Nemotron模型，将其商业代理推理延迟降低40%。该方案基于llama3.1-nemotron-nano-8B-v1模型，通过领域优化和推测解码技术，在保持准确性的同时显著提升响应速度，为AI商业化部署提供实证案例。

## English Version

**PayPal Accelerates Commerce Agent with 40% Latency Reduction**

PayPal released a technical paper showing 40% latency reduction in its commerce agent using EAGLE3 speculative decoding and fine-tuned Nemotron models. Based on llama3.1-nemotron-nano-8B-v1, this domain-optimized approach with speculative decoding significantly improves response speed while maintaining accuracy.

---

**来源**：[arXiv cs.LG (ML)](https://arxiv.org/abs/2604.19767)

**详情页**：https://ai.daily.yangsir.net/daily/20260424-T0-13

---

*智语观潮 · Daily — https://ai.daily.yangsir.net/llms.txt*