Skip to content

replace_recent_steps()的作用 #36

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
mhdpr opened this issue Apr 25, 2025 · 1 comment
Open

replace_recent_steps()的作用 #36

mhdpr opened this issue Apr 25, 2025 · 1 comment

Comments

@mhdpr
Copy link

mhdpr commented Apr 25, 2025

请问run_search_o1.py代码中函数replace_recent_steps()的触发条件和作用分别是什么?我仔细阅读了代码,并查看在使用deepseek-r1:8b模型时seq['output']的输出,暂未发现包含step的部分

@sunnynexus
Copy link
Owner

你好,在Reason-in-Documents模块输入的当前推理链会优先选择搜索和返回结果的历史,要是拼接完整推理链的话可能会过长而影响效果,因此这里进行了筛选。

对于DeepSeek-R1-Distill系列模型,由于其指令遵循能力较差,直接prompt可能调用搜索比较少。因此建议优先选择QwQ模型进行实验。若希望使用小参数的R1模型,可以尝试我们最新的方法WebThinker,其中有训练的DeepSeek-R1-7B模型,有更好的调用搜索等工具的能力。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants