Commit History

Export small model at 850 epochs
ea296b2

Andrew DalPino commited on

Update model card
9c2b560

Andrew DalPino commited on

Use SmolTalk dataset for instruction-tuning
111c2a3

Andrew DalPino commited on

Add library metadata
8b2ddb8

Andrew DalPino commited on

Export small at 400 epochs
98bb233

Andrew DalPino commited on

Fix training forward pass
a0ce4f0

Andrew DalPino commited on

Fix unsqueeze missing dimension argument
a1fecbb

Andrew DalPino commited on

Optimize next-token prediction
c7028d4

Andrew DalPino commited on

Export LightGPT small at 150 epochs
98369b4

Andrew DalPino commited on

Improve the README
8d59dc3

Andrew DalPino commited on

Fix model card
253dffc

Andrew DalPino commited on

Blanket optimizations
e431f0f

Andrew DalPino commited on

A little nicer
fc4824e

Andrew DalPino commited on

Configurable feed-forward ratio
624da87

Andrew DalPino commited on

Fix README
d5dd39c

Andrew DalPino commited on

More scaling law stuff
873b967

Andrew DalPino commited on

Merge branch 'master'
c00d012

Andrew DalPino commited on

Compensate for git issues
f4f6bf0

Andrew DalPino commited on

Initial commit
ad56477

Andrew DalPino commited on

Broad improvements
ab12a97

Andrew DalPino commited on

Use Fineweb instead of Openwebtext
3325763

Andrew DalPino commited on

Add MFU estimation for Ampere GPUs
0cc4ecd

Andrew DalPino commited on

Compile before wrapping with FSDP
19b8dfb

Andrew DalPino commited on

Add FSDP
f28a628

Andrew DalPino commited on

Cleanup checkpointing history
160e81f

Andrew DalPino commited on

A little nicer
8c4359f

Andrew DalPino commited on

Merge branch 'main' of https://huggingface.co./andrewdalpino/LightGPT
ec4551f

Andrew DalPino commited on

Remove old license info
dc00b6b

Andrew DalPino commited on

Add metadata to README
c50fdd6
verified

andrewdalpino commited on

Merge branch 'main' of https://huggingface.co./andrewdalpino/LightGPT
305cc73

Andrew DalPino commited on

Initial commit
cac8fe7

Andrew DalPino commited on