EIStakovskii
commited on
Commit
·
7a127e4
1
Parent(s):
931b2e5
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,17 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language: de # <-- my language
|
3 |
+
widget:
|
4 |
+
- text: "Unberechenbar, gefährlich, ja, auf jeden Fall."
|
5 |
+
- text: "Aber hinterher... oh, oh..."
|
6 |
+
- text: "Nettes Haus, was? - Ja."
|
7 |
+
- text: "Wissqween Sisssasde, adddddqwe12was Mdddilednberg war, 122huh?"
|
8 |
+
- text: "asdaojn;klL:JjJALSJD"
|
9 |
+
- text: "Was dadasdasein erster aaaaEind2ruck"
|
10 |
+
|
11 |
+
license: other
|
12 |
+
---
|
13 |
+
This model was trained for evaluating linguistic acceptability and grammaticality.
|
14 |
+
Label_1 means ACCEPTABLE - the sentence is perfectly understandable by native speakers and has no serious grammatic and syntactic flaws.
|
15 |
+
Label_0 means NOT ACCEPTABLE - the sentence is flawed both orthographically and grammatically.
|
16 |
+
|
17 |
+
The model was trained on 50 thousand German sentences from the news_commentary dataset (https://huggingface.co/datasets/news_commentary). Out of 50 thousand 25 thousand sentences were algorithmically corrupted using the open source Python library https://github.com/eistakovskii/text_corruption_plus. The library was originally developed by aylliote (https://github.com/aylliote/corruption), but it was slightly adapted for the purposes of this model.
|