pszemraj commited on
Commit
57eae71
1 Parent(s): 1622e27

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -81,7 +81,7 @@ Obviously, this section is quite general as there are many things one can use "g
81
  - To be investigated further, depending on what model/system is used it _might_ be worth it to apply this after OCR on typed characters.
82
  2. Correcting/infilling text generated by text generation models to be cohesive/remove obvious errors that break the conversation immersion. I use this on the outputs of [this OPT 2.7B chatbot-esque model of myself](https://huggingface.co/pszemraj/opt-peter-2.7B).
83
  > TODO add an example
84
- 3. Somewhat related to #2 above, fixing/correcting so-called [tortured-phrases](https://arxiv.org/abs/2107.06751) that are dead giveaways text was generated by a language model.
85
 
86
 
87
  ## Training and evaluation data
 
81
  - To be investigated further, depending on what model/system is used it _might_ be worth it to apply this after OCR on typed characters.
82
  2. Correcting/infilling text generated by text generation models to be cohesive/remove obvious errors that break the conversation immersion. I use this on the outputs of [this OPT 2.7B chatbot-esque model of myself](https://huggingface.co/pszemraj/opt-peter-2.7B).
83
  > TODO add an example
84
+ 3. Somewhat related to #2 above, fixing/correcting so-called [tortured-phrases](https://arxiv.org/abs/2107.06751) that are dead giveaways text was generated by a language model. _Note that _SOME_ of these are not fixed, especially as they venture into domain-specific terminology (i.e. irregular timberland instead of Random Forest)._
85
 
86
 
87
  ## Training and evaluation data