Hojin Lee
hjlee1371
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Kanana: Compute-efficient Bilingual Language Models
new activity
5 months ago
nvidia/Llama-3.1-Minitron-4B-Width-Base:Teacher correction training hyperparameters
new activity
10 months ago
HuggingFaceFW/ablation-model-fineweb-v1:Reproducing evaluation results
Organizations
None yet
hjlee1371's activity
Teacher correction training hyperparameters
2
#13 opened 5 months ago
by
hjlee1371
Reproducing evaluation results
1
#3 opened 10 months ago
by
hjlee1371
BOS token prepending?
5
#9 opened 11 months ago
by
hjlee1371