DaidalosTeam's picture
Update README.md
dc887e4 verified
metadata
license: mit
language:
  - la
base_model:
  - latincy/la_core_web_lg
model-index:
  - name: la_core_web_lg_3.7.4
    results:
      - task:
          type: NER
        dataset:
          name: Herodotos_dataset
          type: Herodotos_dataset
        metrics:
          - name: macro F1
            type: macro F1
            value: 58
        source:
          name: SEFLAG
          url: >-
            https://bibbase.org/network/publication/schulz-deichsler-seflagsystematicevaluationframeworkfornlpmodelsanddatasetsinlatinandancientgreek-2024
      - task:
          type: lemmatization
        dataset:
          name: UD-Latin
          type: UD-Latin
        metrics:
          - name: accuracy
            type: accuracy
            value: 88
        source:
          name: SEFLAG
          url: >-
            https://bibbase.org/network/publication/schulz-deichsler-seflagsystematicevaluationframeworkfornlpmodelsanddatasetsinlatinandancientgreek-2024

la_core_web_lg

Intended Use

  • Primary intended uses: Morphological analysis, POS-Tagging, Lemmatizing, Parsing, NER

  • Primary intended users: Classical Scholars

  • Out-of-scope use cases: unknown

Data, Limitations, and Recommendations

  • Data selection for training: Training data consists of latin UD-Treebanks, Wikipedia and OSCAR sentence data, the CC-100 Latin dataset and the Herodotos Project NER dataset

  • Data selection for evaluation: Evaluation was done according to the spaCy workflow and is documented in the meta.json file found in the repository (https://huggingface.co./latincy/la_core_web_lg/blob/main/meta.json)

  • Limitations: unknown