DeepSeek发布V3.2和V3.2-Speciale模型，对标GPT-5和Gemini3Pro

发布日期：2026-05-30 来源：AI D-A-M-N作者：AI D-A-M-N浏览：2

Two Models, Double the Impact

　　The company introduced two variants:

V3.2 Standard Edition: Performs neck-and-neck with OpenAI's GPT-5 when handling documents up to 128,000 words
V3.2-Speciale: Matches Google's Gemini3Pro on academic benchmarks while producing more detailed answers

Technical Breakthroughs Under the Hood

　　The secret sauce? A clever innovation called Directory-Style Attention (DSA). Traditional AI models struggle with long documents because processing time grows exponentially with length. DSA changes this dramatically:

Makes processing time grow linearly instead of exponentially
Uses 40% less memory
Runs inferences 2.2 times faster

　　The result? These are the first open-source models capable of handling million-token documents on a single graphics card.

Smarter Thinking Through Better Training

　　The DeepSeek team didn't cut corners on training either:

Dedicated over 10% of their computing power specifically to reinforcement learning
Used group-based reinforcement learning (GRPO) combined with majority voting
Removed artificial limits that discouraged lengthy reasoning chains

　　The payoff shows in testing - Speciale produces answers that are not only longer (32% more tokens than Gemini3Pro) but also more accurate (4.8 percentage points higher).

Open Source Commitment Continues

　　Both models are available now on GitHub and Hugging Face under the business-friendly Apache 2.0 license. DeepSeek promises more openness ahead:

"We're planning to release our DSA kernel and RL training framework next," a company spokesperson said.

　　The move continues DeepSeek's strategy of turning proprietary advantages into community assets - an approach that could reshape the competitive landscape by 2026 if they maintain this pace.

Key Points:

Performance Parity: Matches GPT-5/Gemini3Pro capabilities in respective domains
Technical Innovation: DSA enables efficient million-token processing
Training Investment: Significant computing resources devoted to RL optimization
Open Philosophy: Full weights available commercially under Apache 2.0 license

本文转载自AI D-A-M-N，作者：AI D-A-M-N，原文标题：《 DeepSeek发布V3.2和V3.2-Speciale模型，对标GPT-5和Gemini3Pro 》，原文链接： https://ai-damn.com/deepseek-s-new-ai-models-take-on-tech-giants-1764735621701。本平台仅做分享和推荐，不涉及任何商业用途。文章版权归原作者所有。如涉及作品内容、版权和其它问题，请与我们联系，我们将在第一时间删除内容！

本文相关推荐

暂无相关推荐

点击立即订阅