lcw99
/

google-gemma-10B-ko-chang

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

google-gemma-10B-ko-chang / README.md

lcw99's picture

Update README.md

d10e001 verified 6 months ago

|

history blame contribute delete

No virus

1.08 kB

	---
	library_name: transformers
	license: apache-2.0
	language:
	- ko
	---

	# Model Card for Model ID


	## Model Details

	### Model Description

	The Gemma Self-Attention Merged model is a large language model created by merging the self-attention layers of an [English-based Gemma 7B model](https://huggingface.co./google/gemma-1.1-7b-it) and a [Korean-based Gemma 7B model](beomi/gemma-ko-7b). This merger allows the model to leverage the capabilities of both the English and Korean models, resulting in a more versatile and capable language model that can perform well on tasks involving both English and Korean text.

	The key features of this merged model include:

	- Increased self-attention capacity with doubled number of attention heads
	- Ability to handle both English and Korean language input
	- Potential for improved performance on a wide range of natural language processing tasks

	#### Chat template

	system: system message...
	B: user message...
	A: assistant message...

	### Model Sources

	- Repository: https://github.com/lcw99/merge-gemma-attn.git