webbigdata
/

ALMA-7B-Ja

Text Generation

text-generation-inference

Model card Files Files and versions Community

dahara1 commited on Oct 13, 2023

Commit

0657532

•

1 Parent(s): 5cdfc95

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -50,6 +50,8 @@ ALMA-7B-Ja(Ours) | 26.41/83.13 | 34.39/83.50 | 24.77/81.12 | 20.60/78.54 | 15.57
 mmnga made llama.cpp(gguf) version [webbigdata-ALMA-7B-Ja-gguf](https://huggingface.co/mmnga/webbigdata-ALMA-7B-Ja-gguf). Thank you!
 llama.cpp is a tool used primarily on Macs, and gguf is its latest version format. It can be used without gpu.
 ### ALMA-7B-Ja-GPTQ-Ja-En
 GPTQ is quantized(reduce the size of the model) method and ALMA-7B-Ja-GPTQ has GPTQ quantized version that reduces model size(3.9GB) and memory usage.

 mmnga made llama.cpp(gguf) version [webbigdata-ALMA-7B-Ja-gguf](https://huggingface.co/mmnga/webbigdata-ALMA-7B-Ja-gguf). Thank you!
 llama.cpp is a tool used primarily on Macs, and gguf is its latest version format. It can be used without gpu.
+[ALMA-7B-Ja-gguf Free Colab sample](https://github.com/webbigdata-jp/python_sample/blob/main/ALMA_7B_Ja_gguf_Free_Colab_sample.ipynb)
 ### ALMA-7B-Ja-GPTQ-Ja-En
 GPTQ is quantized(reduce the size of the model) method and ALMA-7B-Ja-GPTQ has GPTQ quantized version that reduces model size(3.9GB) and memory usage.