6850亿参数混合专家(MoE)架构开源大模型！Deepseek V3全方位客观评测文档处理、逻辑推理、算法编程等多维度的真实能力水平！是卓越还是拉胯？真能超越Claude还是言过其实？#claude

AI超元域

6 min readDec 26, 2024

🔥🔥🔥本篇笔记所对应的视频 https://youtu.be/BnoMTeKxwBQ

6850亿参数混合专家(MoE)架构开源大模型！Deepseek V3全方位客观评测文档处理、逻辑推理、算法编程等多维度的真实能力水平！是卓越还是拉胯？真能超越Claude还是言过其实？#claude

Deepseek发布了最新Deepseek v3大模型，现在在hugging face上可以下载模型的权重文件了。

而且我们还可以在Deepseek的官方直接使用v3模型。

由于官方还没有发布详细的参数介绍，我们目前只能搜索到关于这款模型有限的信息。

这款模型采用了混合专家(MoE)架构，总参数量为685B，包含256个专家模型。

而且这款模型的知识库截止日期为2024年7月。

权重文件

https://huggingface.co/deepseek-ai/DeepSeek-V3-Base

测试题目

用python实现计算179424673是第几个质数？不要引入外部库

三位传教士和三位食人族需要渡过一条河. 
他们有一条船，每次最多可以载两个人. 
如果在任何时候，河的任一岸边食人族的人数多于传教士，那么食人族就会吃掉传教士. 
如何让这六个人都安全地渡过河？请提供分步解决方案，并用 ASCII 图示展示解决方法You are an expert AI assistant tasked with providing thorough, step-by-step reasoning for complex problems or questions. Follow these guidelines:1. For each step in your reasoning process:
   - Provide a clear, descriptive title
   - Explain your thought process in detail
   - Use markdown formatting for better readability2. Use at least 3 different methods or approaches to analyze the problem3. Include exploration of alternative answers and potential errors in your reasoning4. Be aware of your limitations as a language model and explicitly state what you can and cannot do5. When re-examining your reasoning, use a genuinely different approach6. Apply best practices in problem-solving and critical thinking7. Conclude with a final answer only when you've exhausted your analysis8. Structure your response as follows:```json
{
  "step": 1,
  "title": "Identifying Key Information",
  "content": "## Identifying Key Information\n\nTo begin solving this problem, we need to carefully examine the given information and identify the crucial elements that will guide our solution process. This involves...",
  "next_action": "continue"
}
```9. For the final step, use "next_action": "final_answer" and include your conclusion10. Strive for clarity, thoroughness, and intellectual honesty in your analysisHow many 'r's in strawberrrrrrrry?# AI超元域频道原创视频
若风大，就放飞风筝。若气温高，就不放飞风筝。若天空不晴朗，就不放飞风筝。
假设以上说法正确，若放飞风筝，则以下哪些说法是正确的：（ ）
Ⅰ风大 Ⅱ天空晴朗 Ⅲ气温高
A、Ⅰ B、Ⅱ C、Ⅲ D、Ⅰ和Ⅲ
彭平是一个计算机编程专家，姚欣是一位数学家。
其实，所有的计算机编程专家都是数学家。
我们知道，今天国内大多数综合性大学都在培养着计算机编程专家。
据此，我们可以认为：
　　A．彭平是由综合性大学所培养的。　　
　　B．大多数计算机编程专家是由综合性大学所培养的。
　　C．姚欣并不是毕业于综合性大学。　　
　　D．有些数学家是计算机编程专家。

接口调用

curl -L -X POST 'https://api.deepseek.com/chat/completions' -H 'Content-Type: application/json' -H 'Accept: application/json' -H 'Authorization: Bearer sk-' --data-raw '{"messages":[{"content":"You are a helpful assistant","role":"system"},{"content":"你的版本号是多少","role":"user"}],"model":"deepseek-chat","frequency_penalty":0,"max_tokens":2048,"presence_penalty":0,"response_format":{"type":"text"},"stop":null,"stream":false,"stream_options":null,"temperature":1,"top_p":1,"tools":null,"tool_choice":"none","logprobs":false,"top_logprops":null}'

用python实现赛博朋克风格的贪吃蛇游戏，要求如下：
游戏开始后界面上出现多个食物，食物的颜色不一样（红、橙、黄、绿）。
当蛇吃到红色食物的时候会增大两倍.
当蛇吃到绿色食物的时候会死掉。
游戏界面的背景色为蓝色渐变。

# AI超元域频道原创视频

🔥🔥🔥如有问题，请联系我的徽信 stoeng

🔥🔥🔥本项目代码由AI超元域频道制作，观看更多大模型微调视频请访问我的频道⬇

👉👉👉我的哔哩哔哩频道

👉👉👉我的YouTube频道

👉👉👉我的开源项目 https://github.com/win4r/AISuperDomain

6850亿参数混合专家(MoE)架构开源大模型！Deepseek V3全方位客观评测文档处理、逻辑推理、算法编程等多维度的真实能力水平！是卓越还是拉胯？真能超越Claude还是言过其实？#claude

6850亿参数混合专家(MoE)架构开源大模型！Deepseek V3全方位客观评测文档处理、逻辑推理、算法编程等多维度的真实能力水平！是卓越还是拉胯？真能超越Claude还是言过其实？#claude

权重文件

测试题目

接口调用

🔥🔥🔥如有问题，请联系我的徽信 stoeng

🔥🔥🔥本项目代码由AI超元域频道制作，观看更多大模型微调视频请访问我的频道⬇

👉👉👉我的哔哩哔哩频道

👉👉👉我的YouTube频道

👉👉👉我的开源项目 https://github.com/win4r/AISuperDomain

Written by AI超元域

No responses yet