Skip to main content
zhangshengdong
Log in
Sign up
张胜东
zhangshengdong
BZ
Teams
bz-zhangshengdong
Profile
Activity
Mon
Wed
Fri
Nov
Dec
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Runs
1-10
of 13
Name
Project
State
Created
instruct模型+用难数据集
OpenR1
Finished
7 months ago
base模型+用难数据集
OpenR1
Killed
7 months ago
加长length+增加思考过程的reward
OpenR1
Finished
8 months ago
优化ref_model_prompt+滑窗target_length
OpenR1
Killed
8 months ago
中文数据集+优化ref_model_prompt
OpenR1
Finished
8 months ago
ref_model_accuracy奖励函数与x*x_length_reward
OpenR1
Killed
8 months ago
length与accuracy奖励函数解耦
OpenR1
Finished
8 months ago
自定义length_reward+no_warmup
OpenR1
Finished
8 months ago
自定义length_reward+NuminaMathTIR
OpenR1
Killed
8 months ago
自定义length_reward+NuminaMathTIR
OpenR1
Killed
8 months ago
Loading...