Emergent Abilities of Large Language Models under Continued
Collection
Models trained using exponential moving average of weights (EMA), work done in Paper (todo: link the paper).
•
4 items
•
Updated
No model card