DeepSeek AI Development: $1.6 Billion Spent, Debunking Affordability Myth
DeepSeek's chatbot, which introduced itself with the intriguing line "Hi, I was created so you can ask anything and get an answer that might even surprise you," has quickly emerged as a formidable competitor in the AI market. Its impact was so significant that it led to one of NVIDIA's largest stock price drops. This achievement is rooted in DeepSeek's innovative approach to AI model architecture and training methods.
DeepSeek's model stands out due to its use of several advanced technologies. The first is Multi-token Prediction (MTP), where the model predicts multiple words at once by analyzing different parts of a sentence. This not only improves accuracy but also enhances efficiency. Another key feature is the Mixture of Experts (MoE) architecture, which employs 256 neural networks, activating eight for each token processing task. This accelerates training and boosts performance. Lastly, Multi-head Latent Attention (MLA) focuses on crucial parts of a sentence, repeatedly extracting key details to capture important nuances in the input data.
Image: ensigame.com
DeepSeek, a prominent Chinese startup, claims to have developed this competitive AI model at a minimal cost. They state that they spent only $6 million on training DeepSeek V3, utilizing just 2048 graphics processors. However, analysts from SemiAnalysis have uncovered that DeepSeek operates a vast computational infrastructure, comprising around 50,000 Nvidia Hopper GPUs, including 10,000 H800 units, 10,000 H100s, and additional H20 GPUs. These resources are spread across multiple data centers and used for AI training, research, and financial modeling.
Image: ensigame.com
The company's total investment in servers is approximately $1.6 billion, with operational expenses estimated at $944 million. DeepSeek is a subsidiary of the Chinese hedge fund High-Flyer, which spun off the startup in 2023 to focus on AI technologies. Unlike most startups, DeepSeek owns its data centers, allowing full control over AI model optimization and faster innovation implementation. The company remains self-funded, enhancing its flexibility and decision-making speed.
Image: ensigame.com
Moreover, DeepSeek attracts top talent from leading Chinese universities, with some researchers earning over $1.3 million annually. Despite the company's claim of spending just $6 million on training, this figure only accounts for GPU usage during pre-training and excludes research expenses, model refinement, data processing, and infrastructure costs. Since its inception, DeepSeek has invested over $500 million in AI development. Its compact structure enables it to implement AI innovations actively and effectively.
Image: ensigame.com
DeepSeek's example demonstrates that a well-funded, independent AI company can compete with industry leaders. However, experts note that the company's success is due to significant investments, technical breakthroughs, and a strong team, rather than a "revolutionary budget" for developing AI models. Despite this, DeepSeek's costs remain lower than those of its competitors; for instance, while DeepSeek spent $5 million on R1, ChatGPT4o cost $100 million to train.
-
Jul 02,22Isophyne Debuts as Original Character in Marvel Contest of Champions Kabam introduces a brand-new original character to Marvel Contest of Champions: Isophyne. This unique champion, a fresh creation from Kabam's developers, boasts a striking design reminiscent of the film Avatar, incorporating copper-toned metallic accents. Isophyne's Role in the Contest Isophyne ent
-
Dec 13,24Genshin Impact Flops into S.E.A Aquarium for Aquatic Adventure Get ready for a "fin-tastic" adventure! S.E.A. Aquarium and Genshin Impact are joining forces for the Teyvat S.E.A. Exploration event, running from September 12th to October 28th, 2024. This unique collaboration marks the first time Genshin Impact has partnered with an aquarium, offering an unforge
-
Jan 27,25Roblox: Bike Obby Codes (January 2025) Bike Obby: Unlock Awesome Rewards with These Roblox Codes! Bike Obby, the Roblox cycling obstacle course, lets you earn in-game currency to upgrade your bike, buy boosters, and customize your ride. Mastering the various tracks requires a top-tier bike, and thankfully, these Bike Obby codes deliver
-
May 18,24Acolyte Joins Grimguard Tactics in Content Update Grimguard Tactics, the story-driven dark fantasy RPG, receives a major content update on November 28th! A month after its release on Android and iOS, players can look forward to exciting new additions: The Acolyte, a brand-new support hero class, joins the fray. This blood-bending character wields