InfoBedingungenDatenschutzKontakt
 
Wird aktualisiert
WAP: Weekly AI Papers

WAP: Weekly AI Papers

Veröffentlicht: 2025-01-08
© Ankit Sharma
WAP: Weekly AI Papers - QR Code
1 Folge
Audio
Anhören auf Apple Podcasts
1 Folge
Audio
Anhören auf Apple Podcasts
Veröffentlicht: 2025-01-08
© Ankit Sharma
Aktuelle Folge
DeepSeek V3

DeepSeek V3

Länge: 14:10
DeepSeek-V3, a 671B-parameter Mixture-of-Experts large language model. It covers the model's architecture, including Multi-Head Latent Attention and an innovative auxiliary-loss-free load balancing strategy for DeepSeekMoE. The training process, encompassing pre-training on 14.8 trillion tokens and post-training using supervised fine-tuning and reinforcement learning, is described.
paper: https://github.com/deepseek-ai/DeepSeek-V3/blob/main/DeepSeek_V3.pdf
Folgen-ID: 1000683217588
GUID: 3b9860e5-2323-42da-b17b-0fefeedbef33
Erscheinungs­datum: 8.1.2025, 20:00:22

Beschreibung

This show provides an overview of AI papers. The overview is generated using Google Illuminate and NotebookLM. Taking full advantage of the technology era we are living in. Making listening to audio discussions of your favorite papers easy and on the go.

Apple Podcasts: Kundenrezensionen

Kein Eintrag