--- datasets: - HuggingFaceH4/ultrachat_200k base_model: - google/gemma-2-9b library_name: transformers, deltazip --- ## google/gemma-2-9b - 4b_2n4m_128bs Compression This is a compressed model using [deltazip](https://github.com/eth-easl/deltazip). [Paper](https://arxiv.org/abs/2312.05215), [Compression Tool](https://github.com/eth-easl/deltazip), [Inference Engine (Soon)](https://github.com/eth-easl/deltazip). ## Compression Configuration - Base Model: google/gemma-2-9b - Compression Scheme: 4b_2n4m_128bs - Dataset: HuggingFaceH4/ultrachat_200k - Dataset Split: train_sft - Max Sequence Length: 2048 - Number of Samples: 256 ## Sample Output #### Prompt: ``` user Who is Alan Turing? ``` #### Output: ``` user user Who is Alan Turing? *Alan Turing* (1912-1954) was a British mathematician and computer scientist who is considered the father of theoretical computer science and artificial intelligence. Here are some key points about his life and work: **Early Life and Education:** * Born in London, England, in 1912. * Showed exceptional mathematical ability from a young age. * Studied mathematics at King's College, Cambridge. **World War II Contributions:** * Played a crucial role in breaking the German Enigma ``` ## Evaluation