Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Muadil
/
Llama-3.2-1B-Instruct_sum_PPO_Skywork_20.0k_2_3ep
like
0
Follow
Muadil AI R&D Team
4
Text Generation
Transformers
Safetensors
llama
llama-factory
conversational
text-generation-inference
Inference Endpoints
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Llama-3.2-1B-Instruct_sum_PPO_Skywork_20.0k_2_3ep
/
special_tokens_map.json
Commit History
Upload tokenizer
595e9cf
verified
SAadettin-BERber
commited on
15 days ago