taishi-i
/

nagisa_bert

@@ -21,7 +21,7 @@ Python 3.7+ on Linux or macOS is required.
 ```bash
-$ pip install nagisa_bert
 ```
 ## Usage
@@ -29,13 +29,16 @@ $ pip install nagisa_bert
 This model is available in Transformer's pipeline method.
 ```python
->>> from transformers import pipeline
->>> from nagisa_bert import NagisaBertTokenizer
->>> text = "nagisaで[MASK]できるモデルです"
->>> tokenizer = NagisaBertTokenizer.from_pretrained("taishi-i/nagisa_bert")
->>> fill_mask = pipeline("fill-mask", model='taishi-i/nagisa_bert', tokenizer=tokenizer)
->>> print(fill_mask(text))
 [{'score': 0.1385931372642517,
   'sequence': 'nagisa で 使用 できる モデル です',
   'token': 8092,
@@ -61,18 +64,21 @@ This model is available in Transformer's pipeline method.
 Tokenization and vectorization.
 ```python
->>> from transformers import BertModel
->>> from nagisa_bert import NagisaBertTokenizer
->>> text = "nagisaで[MASK]できるモデルです"
->>> tokenizer = NagisaBertTokenizer.from_pretrained("taishi-i/nagisa_bert")
->>> tokens = tokenizer.tokenize(text)
->>> print(tokens)
-['na', '##g', '##is', '##a', 'で', '[MASK]', 'できる', 'モデル', 'です']
->>> model = BertModel.from_pretrained("taishi-i/nagisa_bert")
->>> h = model(**tokenizer(text, return_tensors="pt")).last_hidden_state
->>> print(h)
 tensor([[[-0.2912, -0.6818, -0.4097,  ...,  0.0262, -0.3845,  0.5816],
          [ 0.2504,  0.2143,  0.5809,  ..., -0.5428,  1.1805,  1.8701],
          [ 0.1890, -0.5816, -0.5469,  ..., -1.2081, -0.2341,  1.0215],
@@ -108,4 +114,4 @@ You can find here a list of the notebooks on Japanese NLP using pre-trained mode
 | [Feature-extraction](https://github.com/taishi-i/nagisa_bert/blob/develop/notebooks/feature_extraction-japanese_bert_models.ipynb)  | How to use the pipeline function in transformers to extract features from Japanese text. |[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/taishi-i/nagisa_bert/blob/develop/notebooks/feature_extraction-japanese_bert_models.ipynb)|
 | [Embedding visualization](https://github.com/taishi-i/nagisa_bert/blob/develop/notebooks/embedding_visualization-japanese_bert_models.ipynb)  | Show how to visualize embeddings from Japanese pre-trained models. |[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/taishi-i/nagisa_bert/blob/develop/notebooks/embedding_visualization_japanese_bert_models.ipynb)|
 | [How to fine-tune a model on text classification](https://github.com/taishi-i/nagisa_bert/blob/develop/notebooks/text_classification-amazon_reviews_ja.ipynb)  | Show how to fine-tune a pretrained model on a Japanese text classification task. |[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/taishi-i/nagisa_bert/blob/develop/notebooks/text_classification-amazon_reviews_ja.ipynb)|
-| [How to fine-tune a model on text classification with csv files](https://github.com/taishi-i/nagisa_bert/blob/develop/notebooks/text_classification-csv_files.ipynb)  | Show how to preprocess the data and fine-tune a pretrained model on a Japanese text classification task. |[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/taishi-i/nagisa_bert/blob/develop/notebooks/text_classification-csv_files.ipynb)|

 ```bash
+pip install nagisa_bert
 ```
 ## Usage
 This model is available in Transformer's pipeline method.
 ```python
+from transformers import pipeline
+from nagisa_bert import NagisaBertTokenizer
+text = "nagisaで[MASK]できるモデルです"
+tokenizer = NagisaBertTokenizer.from_pretrained("taishi-i/nagisa_bert")
+fill_mask = pipeline("fill-mask", model='taishi-i/nagisa_bert', tokenizer=tokenizer)
+print(fill_mask(text))
+```
+```python
 [{'score': 0.1385931372642517,
   'sequence': 'nagisa で 使用 できる モデル です',
   'token': 8092,
 Tokenization and vectorization.
 ```python
+from transformers import BertModel
+from nagisa_bert import NagisaBertTokenizer
+text = "nagisaで[MASK]できるモデルです"
+tokenizer = NagisaBertTokenizer.from_pretrained("taishi-i/nagisa_bert")
+tokens = tokenizer.tokenize(text)
+print(tokens)
+# ['na', '##g', '##is', '##a', 'で', '[MASK]', 'できる', 'モデル', 'です']
+model = BertModel.from_pretrained("taishi-i/nagisa_bert")
+h = model(**tokenizer(text, return_tensors="pt")).last_hidden_state
+print(h)
+```
+```python
 tensor([[[-0.2912, -0.6818, -0.4097,  ...,  0.0262, -0.3845,  0.5816],
          [ 0.2504,  0.2143,  0.5809,  ..., -0.5428,  1.1805,  1.8701],
          [ 0.1890, -0.5816, -0.5469,  ..., -1.2081, -0.2341,  1.0215],
 | [Feature-extraction](https://github.com/taishi-i/nagisa_bert/blob/develop/notebooks/feature_extraction-japanese_bert_models.ipynb)  | How to use the pipeline function in transformers to extract features from Japanese text. |[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/taishi-i/nagisa_bert/blob/develop/notebooks/feature_extraction-japanese_bert_models.ipynb)|
 | [Embedding visualization](https://github.com/taishi-i/nagisa_bert/blob/develop/notebooks/embedding_visualization-japanese_bert_models.ipynb)  | Show how to visualize embeddings from Japanese pre-trained models. |[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/taishi-i/nagisa_bert/blob/develop/notebooks/embedding_visualization_japanese_bert_models.ipynb)|
 | [How to fine-tune a model on text classification](https://github.com/taishi-i/nagisa_bert/blob/develop/notebooks/text_classification-amazon_reviews_ja.ipynb)  | Show how to fine-tune a pretrained model on a Japanese text classification task. |[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/taishi-i/nagisa_bert/blob/develop/notebooks/text_classification-amazon_reviews_ja.ipynb)|
+| [How to fine-tune a model on text classification with csv files](https://github.com/taishi-i/nagisa_bert/blob/develop/notebooks/text_classification-csv_files.ipynb)  | Show how to preprocess the data and fine-tune a pretrained model on a Japanese text classification task. |[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/taishi-i/nagisa_bert/blob/develop/notebooks/text_classification-csv_files.ipynb)|