File size: 1,599 Bytes
9848e69
 
6b04195
9b48520
 
 
 
 
 
 
9848e69
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6b04195
 
 
9848e69
 
 
6b04195
9848e69
6b04195
9848e69
6b04195
 
 
 
9848e69
6b04195
9848e69
6b04195
 
 
 
 
 
9848e69
6b04195
 
 
 
9848e69
6b04195
 
 
 
 
 
9848e69
 
 
 
 
6b04195
9848e69
 
 
9b48520
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
---
library_name: transformers
license: apache-2.0
datasets:
- mlabonne/orpo-dpo-mix-40k
language:
- en
base_model:
- meta-llama/Llama-3.1-8B-Instruct
pipeline_tag: text-generation
---

# Model Card for Model ID

<!-- Provide a quick summary of what the model is/does. -->



## Model Details

### Model Description

<!-- Provide a longer summary of what this model is. -->

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.

- **Developed by:** Yeonwoo Sung
- **License:** apache 2.0
- **Finetuned from model:** meta-llama/Llama-3.1-8B-Instruct

### Model Sources [optional]

Trained from [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co./meta-llama/Llama-3.1-8B-Instruct).

## How to Get Started with the Model

You could use this model with huggingface transformer by using code below:
```python
import transformers
import torch

model_id = "BlackBeenie/Neos-Llama-3.1-8B"

pipeline = transformers.pipeline(
    "text-generation",
    model=model_id,
    model_kwargs={"torch_dtype": torch.bfloat16},
    device_map="auto",
)

messages = [
    {"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak!"},
    {"role": "user", "content": "Who are you?"},
]

outputs = pipeline(
    messages,
    max_new_tokens=256,
)
print(outputs[0]["generated_text"][-1])
```

## Training Details

### Training Data

Trained on [mlabonne/orpo-dpo-mix-40k](https://huggingface.co./datasets/mlabonne/orpo-dpo-mix-40k).

### Training Procedure

This model is finetuned with ORPO trainer.