DeepSeek in Action

by Jing Dai

★★★★☆
4.3 (554)

US$42.50

15% OFF CODE: SAVE15

Description

From fundamental concepts to advanced implementations, this book thoroughly explores the DeepSeek-V3 model, focusing on its Transformer-based architecture, technological innovations, and applications. The book begins with a thorough examination of theoretical foundations, including self-attention, positional encoding, the Mixture of Experts mechanism, and distributed training strategies. It then explores DeepSeek-V3’s technical advancements, including sparse attention mechanisms, FP8 mixed-preci

Explore Related Tags