We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
请问run_search_o1.py代码中函数replace_recent_steps()的触发条件和作用分别是什么?我仔细阅读了代码,并查看在使用deepseek-r1:8b模型时seq['output']的输出,暂未发现包含step的部分
The text was updated successfully, but these errors were encountered:
你好,在Reason-in-Documents模块输入的当前推理链会优先选择搜索和返回结果的历史,要是拼接完整推理链的话可能会过长而影响效果,因此这里进行了筛选。
对于DeepSeek-R1-Distill系列模型,由于其指令遵循能力较差,直接prompt可能调用搜索比较少。因此建议优先选择QwQ模型进行实验。若希望使用小参数的R1模型,可以尝试我们最新的方法WebThinker,其中有训练的DeepSeek-R1-7B模型,有更好的调用搜索等工具的能力。
Sorry, something went wrong.
No branches or pull requests
请问run_search_o1.py代码中函数replace_recent_steps()的触发条件和作用分别是什么?我仔细阅读了代码,并查看在使用deepseek-r1:8b模型时seq['output']的输出,暂未发现包含step的部分
The text was updated successfully, but these errors were encountered: