|
--- |
|
datasets: |
|
- HuggingFaceH4/ultrachat_200k |
|
base_model: |
|
- google/gemma-2-9b |
|
library_name: transformers, deltazip |
|
--- |
|
|
|
## google/gemma-2-9b - 4b_2n4m_128bs Compression |
|
|
|
This is a compressed model using [deltazip](https://github.com/eth-easl/deltazip). |
|
|
|
[Paper](https://arxiv.org/abs/2312.05215), [Compression Tool](https://github.com/eth-easl/deltazip), [Inference Engine (Soon)](https://github.com/eth-easl/deltazip). |
|
|
|
## Compression Configuration |
|
|
|
- Base Model: google/gemma-2-9b |
|
- Compression Scheme: 4b_2n4m_128bs |
|
- Dataset: HuggingFaceH4/ultrachat_200k |
|
- Dataset Split: train_sft |
|
- Max Sequence Length: 2048 |
|
- Number of Samples: 256 |
|
|
|
## Sample Output |
|
|
|
#### Prompt: |
|
|
|
``` |
|
[{'role': 'user', 'content': 'Who is Alan Turing?'}] |
|
``` |
|
|
|
#### Output: |
|
|
|
``` |
|
<bos><start_of_turn>user |
|
Who is Alan Turing?<end_of_turn> |
|
*Alan Turing* (1912-1954) was a British mathematician and computer scientist who is widely considered to be the father of theoretical computer science and artificial intelligence. |
|
|
|
Here are some of his key contributions: |
|
|
|
* **Turing Machine:** He proposed the concept of a *Turing machine*, a theoretical model of computation that can simulate any algorithm. This laid the foundation for modern computers. |
|
* **Codebreaking during World War II:** Turing played a crucial role in breaking the German Enigma code at Bletchley Park, significantly contributing to the Allied victory |
|
``` |
|
|
|
## Evaluation |
|
|
|
<TODO> |
|
|
|
|