Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,162 @@
|
|
1 |
---
|
2 |
license: cc-by-nc-4.0
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: cc-by-nc-4.0
|
3 |
+
datasets:
|
4 |
+
- bertin-project/alpaca-spanish
|
5 |
+
language:
|
6 |
+
- es
|
7 |
+
inference: false
|
8 |
---
|
9 |
+
|
10 |
+
|
11 |
+
# Model Card for Model ID
|
12 |
+
|
13 |
+
This model is the Llama-2-13b-hf fine-tuned with an adapter on the Spanish Alpaca dataset.
|
14 |
+
|
15 |
+
## Model Details
|
16 |
+
|
17 |
+
### Model Description
|
18 |
+
|
19 |
+
This is a Spanish chat model fine-tuned on a Spanish instruction dataset.
|
20 |
+
|
21 |
+
The model expect a prompt containing the instruction, with an option to add an input (see examples below).
|
22 |
+
|
23 |
+
|
24 |
+
|
25 |
+
- **Developed by:** 4i Intelligent Insights
|
26 |
+
- **Model type:** Chat model
|
27 |
+
- **Language(s) (NLP):** Spanish
|
28 |
+
- **License:** cc-by-nc-4.0 (inhereted from the alpaca-spanish dataset),
|
29 |
+
- **Finetuned from model :** Llama 2 13B ([license agreement](https://ai.meta.com/resources/models-and-libraries/llama-downloads/))
|
30 |
+
|
31 |
+
|
32 |
+
## Uses
|
33 |
+
|
34 |
+
The model is intended to be used directly without the need of further fine-tuning.
|
35 |
+
|
36 |
+
|
37 |
+
## Bias, Risks, and Limitations
|
38 |
+
|
39 |
+
This model inherits the bias, risks, and limitations of its base model, Llama 2, and of the dataset used for fine-tuning.
|
40 |
+
Note that the Spanish Alpaca dataset was obtained by translating the original Alpaca dataset. It contains translation errors that may have negatively impacted the fine-tuning of the model.
|
41 |
+
|
42 |
+
|
43 |
+
|
44 |
+
## How to Get Started with the Model
|
45 |
+
|
46 |
+
Use the code below to get started with the model for inference. The adapter was directly merged into the original Llama 2 model.
|
47 |
+
|
48 |
+
|
49 |
+
The following code sample uses 4-bit quantization, you may load the model without it if you have enough VRAM. We show results for hyperparameters that we found work well for this set of prompts.
|
50 |
+
|
51 |
+
```py
|
52 |
+
from transformers import AutoTokenizer, AutoModelForCausalLM, BitsAndBytesConfig, TrainingArguments, GenerationConfig
|
53 |
+
import torch
|
54 |
+
model_name = "4i-ai/Llama-2-13b-alpaca-es"
|
55 |
+
|
56 |
+
|
57 |
+
#Tokenizer
|
58 |
+
|
59 |
+
tokenizer = AutoTokenizer.from_pretrained(model_name, use_fast=True)
|
60 |
+
|
61 |
+
def create_and_prepare_model():
|
62 |
+
compute_dtype = getattr(torch, "float16")
|
63 |
+
bnb_config = BitsAndBytesConfig(
|
64 |
+
load_in_4bit=True,
|
65 |
+
bnb_4bit_quant_type="nf4",
|
66 |
+
bnb_4bit_compute_dtype=compute_dtype,
|
67 |
+
bnb_4bit_use_double_quant=True,
|
68 |
+
)
|
69 |
+
model = AutoModelForCausalLM.from_pretrained(
|
70 |
+
model_name, quantization_config=bnb_config, device_map={"": 0}
|
71 |
+
)
|
72 |
+
return model
|
73 |
+
model = create_and_prepare_model()
|
74 |
+
|
75 |
+
def generate(instruction, input=None):
|
76 |
+
#Format the prompt to look like the training data
|
77 |
+
if input is not None:
|
78 |
+
prompt = "### Instruction:\n"+instruction+"\n\n### Input:\n"+input+"\n\n### Response:\n"
|
79 |
+
else :
|
80 |
+
prompt = "### Instruction:\n"+instruction+"\n\n### Response:\n"
|
81 |
+
|
82 |
+
|
83 |
+
inputs = tokenizer(prompt, return_tensors="pt")
|
84 |
+
input_ids = inputs["input_ids"].cuda()
|
85 |
+
|
86 |
+
generation_output = model.generate(
|
87 |
+
input_ids=input_ids,
|
88 |
+
repetition_penalty=1.5,
|
89 |
+
generation_config=GenerationConfig(temperature=0.1, top_p=0.75, top_k=40, num_beams=20), #hyperparameters for generation
|
90 |
+
return_dict_in_generate=True,
|
91 |
+
output_scores=True,
|
92 |
+
max_new_tokens=150, #maximum tokens generated, increase if you want longer asnwer (up to 2048 - the length of the prompt), generation "looks" slower for longer response
|
93 |
+
|
94 |
+
)
|
95 |
+
for seq in generation_output.sequences:
|
96 |
+
output = tokenizer.decode(seq, skip_special_tokens=True)
|
97 |
+
print(output.split("### Response:")[1].strip())
|
98 |
+
|
99 |
+
generate("Háblame de la superconductividad.")
|
100 |
+
print("-----------")
|
101 |
+
generate("Encuentra la capital de España.")
|
102 |
+
print("-----------")
|
103 |
+
generate("Encuentra la capital de Portugal.")
|
104 |
+
print("-----------")
|
105 |
+
generate("Organiza los números dados en orden ascendente.", "2, 3, 0, 8, 4, 10")
|
106 |
+
print("-----------")
|
107 |
+
generate("Compila una lista de 5 estados de EE. UU. ubicados en el Oeste.")
|
108 |
+
print("-----------")
|
109 |
+
generate("Compila una lista de 2 estados de EE. UU. ubicados en el Oeste.")
|
110 |
+
print("-----------")
|
111 |
+
generate("Compila una lista de 10 estados de EE. UU. ubicados en el Este.")
|
112 |
+
print("-----------")
|
113 |
+
generate("¿Cuál es la color de una fresa?")
|
114 |
+
print("-----------")
|
115 |
+
generate("¿Cuál es la color de la siguiente fruta?", "fresa")
|
116 |
+
print("-----------")
|
117 |
+
|
118 |
+
```
|
119 |
+
|
120 |
+
Expected output:
|
121 |
+
|
122 |
+
```
|
123 |
+
La superconductividad es un fenómeno físico en el que los materiales pueden conducir corrientes eléctricas a bajas temperaturas sin pérdida de energía debido a la resistencia. Los materiales superconductores son capaces de conducir corrientes eléctricas a temperaturas mucho más bajas que los materiales normales. Esto se debe a que los electrones en los materiales superconductores se comportan de manera cooperativa, lo que les permite conducir corrientes eléctricas sin pérdida de energía. Los materiales superconductores tienen muchas aplicaciones
|
124 |
+
-----------
|
125 |
+
La capital de España es Madrid.
|
126 |
+
-----------
|
127 |
+
La capital de Portugal es Lisboa.
|
128 |
+
-----------
|
129 |
+
0, 2, 3, 4, 8, 10
|
130 |
+
-----------
|
131 |
+
1. California
|
132 |
+
2. Oregón
|
133 |
+
3. Washington
|
134 |
+
4. Nevada
|
135 |
+
5. Arizona
|
136 |
+
-----------
|
137 |
+
California y Washington.
|
138 |
+
-----------
|
139 |
+
1. Maine
|
140 |
+
2. Nuevo Hampshire
|
141 |
+
3. Vermont
|
142 |
+
4. Massachusetts
|
143 |
+
5. Rhode Island
|
144 |
+
6. Connecticut
|
145 |
+
7. Nueva York
|
146 |
+
8. Nueva Jersey
|
147 |
+
9. Pensilvania
|
148 |
+
10. Delaware
|
149 |
+
-----------
|
150 |
+
La color de una fresa es rosa.
|
151 |
+
-----------
|
152 |
+
Roja
|
153 |
+
-----------
|
154 |
+
```
|
155 |
+
|
156 |
+
|
157 |
+
|
158 |
+
|
159 |
+
|
160 |
+
## Contact Us
|
161 |
+
[4i.ai](https://4i.ai/) provides natural language processing solutions with dialog, vision and voice capabilities to deliver real-life multimodal human-machine conversations.
|
162 |
+
Please contact us at [email protected]
|