Commit
·
e835bc7
1
Parent(s):
def3eb7
move model link
Browse files
app.py
CHANGED
@@ -324,12 +324,12 @@ This model was developed as part of work by the [Living with Machines](https://l
|
|
324 |
|
325 |
This model is intended to predict, from the title of a book, whether it is 'fiction' or 'non-fiction'. This model was trained on data created from the [Digitised printed books (18th-19th Century)](https://www.bl.uk/collection-guides/digitised-printed-books) book collection.
|
326 |
This dataset is dominated by English language books though it includes books in several other languages in much smaller numbers. This model was originally developed for use as part of the Living with Machines project to be able to 'segment' this large dataset of books into different categories based on a 'crude' classification of genre i.e. whether the title was `fiction` or `non-fiction`.
|
327 |
-
|
328 |
|
329 |
## Training data
|
330 |
|
331 |
The model is trained on a particular collection of books digitised by the British Library. As a result the model may do less well on titles that look different to this data.
|
332 |
-
In particular the training data, was mostly English, and mostly from the 19th Century.
|
333 |
|
334 |
## Model performance
|
335 |
|
|
|
324 |
|
325 |
This model is intended to predict, from the title of a book, whether it is 'fiction' or 'non-fiction'. This model was trained on data created from the [Digitised printed books (18th-19th Century)](https://www.bl.uk/collection-guides/digitised-printed-books) book collection.
|
326 |
This dataset is dominated by English language books though it includes books in several other languages in much smaller numbers. This model was originally developed for use as part of the Living with Machines project to be able to 'segment' this large dataset of books into different categories based on a 'crude' classification of genre i.e. whether the title was `fiction` or `non-fiction`.
|
327 |
+
You can find more information about the model [here]((https://doi.org/10.5281/zenodo.5245175))
|
328 |
|
329 |
## Training data
|
330 |
|
331 |
The model is trained on a particular collection of books digitised by the British Library. As a result the model may do less well on titles that look different to this data.
|
332 |
+
In particular the training data, was mostly English, and mostly from the 19th Century.
|
333 |
|
334 |
## Model performance
|
335 |
|