An in-depth look at DeepSeek: DeepSeekMoE and DeepSeekMLA, cheap V3 training, the US chip ban, “distillation” from other models, impact on Nvidia, AGI, and more (Ben Thompson/Stratechery) 27-01-2025
Ben Thompson / Stratechery: An in-depth look at DeepSeek: DeepSeekMoE and DeepSeekMLA, cheap V3 training, the US chip ban, “distillation” from other models, impact on Nvidia, AGI, and more — It’s Monday, January 27. Why haven’t you written about DeepSeek yet? — I did! I wrote about R1 last Tuesday. Lees verder op Tech Meme