Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
andrewdalpino
/
LightGPT
like
2
Text Generation
PyTorch
TensorBoard
ONNX
Safetensors
HuggingFaceFW/fineweb
HuggingFaceTB/smoltalk
English
NoPE
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
main
LightGPT
2 contributors
History:
40 commits
Andrew DalPino
Compensate for poor HuggingFace UX
a174dbc
5 days ago
checkpoints
Compensate for git issues
27 days ago
datasets
Compensate for git issues
27 days ago
exports
Add runs data
5 days ago
runs
Add runs data
5 days ago
.gitattributes
1.52 kB
Initial commit
about 1 month ago
.gitignore
136 Bytes
Blanket optimizations
25 days ago
README.md
15 kB
Export small model at 850 epochs
17 days ago
beam_search.py
2.82 kB
Use SmolTalk dataset for instruction-tuning
22 days ago
config.json
193 Bytes
Push model using huggingface_hub.
5 days ago
data.py
7.38 kB
Use SmolTalk dataset for instruction-tuning
22 days ago
export_model.ipynb
5.98 kB
Add runs data
5 days ago
generate.py
2.84 kB
Export small model at 850 epochs
17 days ago
instruction-tune.py
6.62 kB
Use SmolTalk dataset for instruction-tuning
22 days ago
model.py
16.4 kB
Add runs data
5 days ago
model_sizing.ipynb
79.8 kB
Use SmolTalk dataset for instruction-tuning
22 days ago
pretrain.py
11.6 kB
Export small model at 1280 epochs
11 days ago
requirements.txt
206 Bytes
Add runs data
5 days ago