lunahr
/

thea-pro-2b-100r

Text Generation

text-generation-inference

Model card Files Files and versions Metrics Training metrics Community

Model Description

An uncensored reasoning EXAONE 3.5 model trained on reasoning data. Now with a full epoch!

It has been trained using improved training code, and gives an improved performance. Here is what inference code you should use:

# DEBUGGING IN PROGRESS, check later

Trained by: Piotr Zalewski
License: exaone
Finetuned from model: LGAI-EXAONE/EXAONE-3.5-2.4B-Instruct
Dataset used: KingNish/reasoning-base-20k

This Llama model was trained faster than Unsloth using custom training code.

Visit https://www.kaggle.com/code/piotr25691/distributed-hf-training-with-2xt4 to find out how you can finetune your models using BOTH of the Kaggle provided GPUs.

Downloads last month: 64

Safetensors

Model size

2.41B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The HF Inference API does not support model that require custom code execution.

Model tree for lunahr/thea-pro-2b-100r

Base model

LGAI-EXAONE/EXAONE-3.5-2.4B-Instruct

Finetuned

(13)

this model

Dataset used to train lunahr/thea-pro-2b-100r

Collection including lunahr/thea-pro-2b-100r

Thea

A family of compact reasoning models, based off of the best 2B and 3B models, trained using improved DDP training code, no Unsloth. • 5 items • Updated Jan 22 • 1