EIStakovskii commited on
Commit
7a127e4
·
1 Parent(s): 931b2e5

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: de # <-- my language
3
+ widget:
4
+ - text: "Unberechenbar, gefährlich, ja, auf jeden Fall."
5
+ - text: "Aber hinterher... oh, oh..."
6
+ - text: "Nettes Haus, was? - Ja."
7
+ - text: "Wissqween Sisssasde, adddddqwe12was Mdddilednberg war, 122huh?"
8
+ - text: "asdaojn;klL:JjJALSJD"
9
+ - text: "Was dadasdasein erster aaaaEind2ruck"
10
+
11
+ license: other
12
+ ---
13
+ This model was trained for evaluating linguistic acceptability and grammaticality.
14
+ Label_1 means ACCEPTABLE - the sentence is perfectly understandable by native speakers and has no serious grammatic and syntactic flaws.
15
+ Label_0 means NOT ACCEPTABLE - the sentence is flawed both orthographically and grammatically.
16
+
17
+ The model was trained on 50 thousand German sentences from the news_commentary dataset (https://huggingface.co/datasets/news_commentary). Out of 50 thousand 25 thousand sentences were algorithmically corrupted using the open source Python library https://github.com/eistakovskii/text_corruption_plus. The library was originally developed by aylliote (https://github.com/aylliote/corruption), but it was slightly adapted for the purposes of this model.