DeepSeek: All information about the AI Chatbot application you need to understand
Deepseek has spread.
After China AI Lab Deepseek has risen to the Apple App Store list this week, this week broke into the mainstream consciousness. DeepSeek's AI model uses effective technical training. It has led Wall Street analysts and technicians to question whether the United States can maintain its leading position in the AI competition and whether the demand for AI chips can be maintained.
But where does DeepSeek come from, and how does it be so quickly ascended into the international reputation?
Deepseek trader origin
DeepSeek was supported by high flight capital management. This is China's quantitative hedge fund, which uses AI to inform them of transaction decisions.
AI liang wenfeng jointly founded High-Flyer in 2015. According to reports, Wenfeng began to get involved in transactions, and a student of Zhejiang University launched a high -level capital management as hedge fund in 2019, focusing on development and deployment of AI algorithms.
In 2023, High-Flyer started with DeepSeek and is a laboratory dedicated to studying the separation of AI tools from financial business. The laboratory uses High-Flyer as one of its investors and rotates into its own company, also known as Deepseek.
Starting on the first day, Deepseek has built its own data center cluster for model training. But like other AI companies in China, Deepseek was influenced by the US export ban. In order to train one of its latest models, the company was forced to use the NVIDIA H800 chip, which is a small version that American companies can use the chip H100 of the US company.
It is said that Deepseek's technical team is biased towards Young. According to reports, the company has actively recruited PhDs from AI researchers from top universities in China. Deepseek also hires people with no computer scientific background to help their technology better understand the New York Times.
Deepseek's powerful model
DeepSeek launched its first set of models-Deepseek encoders, Deepseek LLM and Deepseek Chat in November 2023. Start paying attention.
DeepSeek-V2 is a general text and image analysis system that performs well in various AI benchmark tests-and running is much cheaper than those comparable models at the time. It forces DeepSeek's domestic competition, including ByTedance and Alibaba, reducing the use price of certain models and making others completely free.
DeepSeek-V3 was launched in December 2024, and only added DeepSeek's infamous.
According to Deepseek's internal benchmark test, Deepseek V3 is better than downloadable, open and available models, such as Meta's LLAMA and “closed” models, and can only be accessed by API, such as OPENAI's GPT-4O.
Equally impressive is the R1 “reasoning” model of DeepSeek. DeepSeek claims that R1 was released in January, and the O1 model of Openai was tested in critical benchmarks.
As a reasoning model, R1 effectively conducts factual inspections, which helps it avoid some traps that usually trip models. Compared with typical non -disputed models, the reasoning model takes longer (usually longer to minutes) to achieve solutions. The advantage is that they tend to be more reliable in the fields of physics, science and mathematics.
However, other models of R1, Deepseek V3 and Deepseek have a disadvantage. As the development of China's development, they are tested by the Chinese Internet regulatory agency to ensure that they react “reflect the core socialist values.” For example, in DeepSeek's chat robot application, R1 will not answer questions about Tiananmen Square or Taiwan's autonomy.
A destructive method
If Deepseek has a business model, it is unclear what the model is. The company's price of its products and services is much lower than market value and provides other people for free.
DeepSeek's way of telling is that efficiency breakthroughs allow it to maintain high cost competitiveness. However, some experts raised the number provided by the company.
In any case, developers use DeepSeek model. This model is not open source because the phrase is usually understood, but it can be obtained under the loose permission of allowing commercial use. Clem Delangue, chief executive of Hugging Face, said that one of the platforms for hosting DeepSeek model, developers on Hugging Face have created more than 500 R1 “derivative” models, which have added 2.5 million downloads.
DeepSeek is described as “raising artificial intelligence” in a larger and more mature competitors, and has ushered in the “new era of AI BrinkManship”. The company's success was at least part of the company's stock price fell 18 % on Monday and caused public response from OPENAI CEO Sam Altman.
As for the future of Deepseek, it may be unclear. The improved model is given. However, the US government seems to be more and more vigilant about its harmful foreign influence.
TechCrunch has a news communication focusing on AI! Register here to obtain it in the box every Wednesday.