File size: 3,232 Bytes

---
license: cc-by-nc-sa-4.0
datasets:
- Blablablab/ALOE
---

### Model Description

The model classifies an *appraisal* given a sentence and is trained on [ALOE](https://huggingface.co./datasets/Blablablab/ALOE) dataset.

**Input:**  a sentence

**Labels:**   No Label, Pleasantness, Anticipated Effort, Certainty, Objective Experience, Self-Other Agency, Situational Control, Advice, Trope

**Output:** logits (in order of labels)

**Model architecture**: OpenPrompt_+RoBERTa

**Developed by:** Jiamin Yang

### Model Performance

##### Overall performance

| Macro-F1 | Recall | Precision |
| :------: | :----: | :-------: |
|   0.56   |  0.57  |   0.58    |

##### Per-label performance

| Label                | Recall | Precision |
| -------------------- | :----: | :-------: |
| No Label             |  0.34  |   0.64    |
| Pleasantness         |  0.69  |   0.54    |
| Anticipated Effort   |  0.46  |   0.46    |
| Certainty            |  0.58  |   0.47    |
| Objective Experience |  0.58  |   0.69    |
| Self-Other Agency    |  0.62  |   0.55    |
| Situational Control  |  0.31  |   0.55    |
| Advice               |  0.72  |   0.66    |
| Trope                |  0.80  |   0.67    |

### Getting Started

```python
import torch
from openprompt.plms import load_plm
from openprompt.prompts import ManualTemplate
from openprompt.prompts import ManualVerbalizer
from openprompt import PromptForClassification
from openprompt.data_utils import InputExample
from openprompt import PromptDataLoader

checkpoint_file = 'your_path_to/empathy-appraisal-span.pt'

plm, tokenizer, model_config, WrapperClass = load_plm('roberta', 'roberta-large')
template_text = 'The sentence {"placeholder":"text_a"} has the label {"mask"}.'
template = ManualTemplate(tokenizer=tokenizer, text=template_text)

num_classes = 9
label_words = [['No Label'], ['Pleasantness'], ['Anticipated Effort'], ['Certainty'], ['Objective Experience'], ['Self-Other Agency'], ['Situational Control'], ['Advice'], ['Trope']]
verbalizer = ManualVerbalizer(tokenizer, num_classes=num_classes, label_words=label_words)
prompt_model = PromptForClassification(plm=plm,template=template, verbalizer=verbalizer, freeze_plm=False).to('cuda')

checkpoint = torch.load(checkpoint_file)
state_dict = checkpoint['model_state_dict']

# depend on the version of torch
del state_dict['prompt_model.plm.roberta.embeddings.position_ids']

prompt_model.load_state_dict(state_dict)

# use the model
dataset = [
    InputExample(
        guid = 0,
        text_a = "I am sorry for your loss",
    ),
    InputExample(
        guid = 1,
        text_a = "It's not your fault",
    ),
]

data_loader = PromptDataLoader(dataset=dataset, 
                template=template, 
                tokenizer=tokenizer,
                tokenizer_wrapper_class=WrapperClass,
                max_seq_length=512,
                batch_size=2,
                shuffle=False,
                teacher_forcing=False,
                predict_eos_token=False,
                truncate_method='head')
prompt_model.eval()
with torch.no_grad():
    for batch in data_loader:
        logits = prompt_model(batch.to('cuda'))
        preds = torch.argmax(logits, dim = -1)
        print(preds) #[8, 5]
```