hiroshi-matsuda-rit
/

bert-base-japanese-basic-char-v2

Inference Endpoints

Model card Files Files and versions Community

bert-base-japanese-basic-char-v2 / README.md

hiroshi-matsuda-rit's picture

hiroshi-matsuda-rit

Update README.md

6191180 over 3 years ago

|

469 Bytes

	---
	language: ja
	license: cc-by-sa-3.0
	datasets:
	- wikipedia
	---

	# BERT base Japanese (character-level tokenization with whole word masking, jawiki-20200831)

	This pretrained model is almost the same as [cl-tohoku/bert-base-japanese-char-v2](https://huggingface.co./cl-tohoku/bert-base-japanese-char-v2) but do not need `fugashi` or `unidic_lite`.
	The only difference is in `word_tokenzer_type` property (specify `basic` instead of `mecab`) in `tokenizer_config.json`.