Skip to content

Conversation

@cdliang11
Copy link
Collaborator

@cdliang11 cdliang11 commented Sep 2, 2025

  • refine dataset
  • readme
  • test result

@cdliang11 cdliang11 requested a review from robin1001 September 2, 2025 06:53
@cdliang11 cdliang11 marked this pull request as draft September 2, 2025 06:53
@cdliang11 cdliang11 marked this pull request as ready for review September 2, 2025 07:01
@cdliang11 cdliang11 marked this pull request as draft September 2, 2025 07:20
@cdliang11 cdliang11 marked this pull request as ready for review September 9, 2025 10:51
@cdliang11 cdliang11 requested a review from robin1001 September 9, 2025 10:52
try:
x['txt'] = x['txt'].decode('utf8')
x['wav'] = io.BytesIO(x['wav'])
if "messages" in x.keys():
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这块我们不加了,

对于 SFT 的数据,我们系统上设计只支持 jsonl 的

if 'messages' in item: # OpenAI role-content based SFT data
# OpenAI role-content based SFT data
# At least one pair of "user" and "assistant"
if 'messages' in item and len(item["messages"]) >= 2:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个直接在下面 assert len(item['messages']) >= 2

@@ -0,0 +1,82 @@
#!/usr/bin/env python3
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

文件命名上我们都是用下划线形式compute_acc_of_contain.py

python west/bin/decode.py \
--data_path $data/chinese_qa.jsonl \
--model_dir $mdir \
--result_path $mdir/result.txt
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

现在 main 分支中输出该 jsonl 了

@cdliang11 cdliang11 marked this pull request as draft September 10, 2025 06:47
@robin1001 robin1001 marked this pull request as ready for review September 10, 2025 06:48
@robin1001 robin1001 merged commit 5b47db3 into main Sep 10, 2025
1 check passed
@robin1001 robin1001 deleted the qa-recipe branch September 10, 2025 06:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants