taishi-i
/

awesome-japanese-nlp-classification-model

Text Classification

Inference Endpoints

Model card Files Files and versions Community

taishi-i commited on Sep 9, 2023

Commit

0df6d6c

·

1 Parent(s): 70edc56

add evaluation script to README.md

Files changed (1) hide show

README.md +44 -0

README.md CHANGED Viewed

@@ -53,6 +53,50 @@ label = pipe(text)
 print(label) # [{'label': '0', 'score': 0.9986791014671326}]
 ```
 # License
 This model was trained from a dataset collected from the GitHub API under [GitHub Acceptable Use Policies - 7. Information Usage Restrictions](https://docs.github.com/en/site-policy/acceptable-use-policies/github-acceptable-use-policies#7-information-usage-restrictions) and [GitHub Terms of Service - H. API Terms](https://docs.github.com/en/site-policy/github-terms/github-terms-of-service#h-api-terms). It should be used solely for research verification purposes. Adhering to GitHub's regulations is mandatory.

 print(label) # [{'label': '0', 'score': 0.9986791014671326}]
 ```
+# Evaluation
+Please install the following library.
+```bash
+pip install evaluate scikit-learn datasets transformers torch
+```
+```python
+import evaluate
+from datasets import load_dataset
+from sklearn.metrics import classification_report
+from transformers import pipeline
+# Evaluation dataset
+dataset = load_dataset("taishi-i/awesome-japanese-nlp-classification-dataset")
+# Text classification model
+pipe = pipeline(
+    "text-classification",
+    model="taishi-i/awesome-japanese-nlp-classification-model",
+)
+# Evaluation metric
+f1 = evaluate.load("f1")
+# Predict process
+predicted_labels = []
+for text in dataset["test"]["text"]:
+    prediction = pipe(text)
+    predicted_label = prediction[0]["label"]
+    predicted_labels.append(int(predicted_label))
+score = f1.compute(
+    predictions=predicted_labels, references=dataset["test"]["label"]
+)
+print(score)
+report = classification_report(
+    y_true=dataset["test"]["label"], y_pred=predicted_labels
+)
+print(report)
+```
 # License
 This model was trained from a dataset collected from the GitHub API under [GitHub Acceptable Use Policies - 7. Information Usage Restrictions](https://docs.github.com/en/site-policy/acceptable-use-policies/github-acceptable-use-policies#7-information-usage-restrictions) and [GitHub Terms of Service - H. API Terms](https://docs.github.com/en/site-policy/github-terms/github-terms-of-service#h-api-terms). It should be used solely for research verification purposes. Adhering to GitHub's regulations is mandatory.