Tian Li
RicardoLee
AI & ML interests
Natural Language Procesing, Automatic Speech Recognition, Reinforcement Learning
Organizations
None yet
RicardoLee's activity
13b的context len多大以及batch?
5
#1 opened over 1 year ago
by
lucasjin
此时不应降低学习率,warmup 等超参,而是应该放大到Pretrain 规模
3
#2 opened over 1 year ago
by
daner
关于train_sft.py中coati包
2
#3 opened over 1 year ago
by
BatmanBill
那这个怎么调用呢
4
#1 opened over 1 year ago
by
yjianchun
那这个怎么调用呢
4
#1 opened over 1 year ago
by
yjianchun