Update README_EN.md
Browse files- README_EN.md +42 -25
README_EN.md
CHANGED
@@ -22,31 +22,44 @@ zjunlp/knowlm-13b-ie samples around 10% of the data from Chinese-English informa
|
|
22 |
|
23 |
|
24 |
# 2. Information Extraction Template
|
25 |
-
The
|
26 |
-
1. Task
|
27 |
-
2.
|
28 |
-
3.
|
29 |
|
30 |
|
31 |
-
Template with specified list of candidate labels
|
32 |
-
```json
|
33 |
-
NER: "You are an expert specialized in entity extraction. With the candidate entity types list: {s_schema}, please extract possible entities from the input below, outputting NAN if a certain entity does not exist. Respond in the format {s_format}."
|
34 |
-
RE: "You are an expert in extracting relation triples. With the candidate relation list: {s_schema}, please extract the possible head entities and tail entities from the input below and provide the corresponding relation triples. If a relation does not exist, output NAN. Please answer in the {s_format} format."
|
35 |
-
EE: "You are a specialist in event extraction. Given the candidate event dictionary: {s_schema}, please extract any possible events from the input below. If an event does not exist, output NAN. Please answer in the format of {s_format}."
|
36 |
-
EET: "As an event analysis specialist, you need to review the input and determine possible events based on the event type directory: {s_schema}. All answers should be based on the {s_format} format. If the event type does not match, please mark with NAN."
|
37 |
-
EEA: "You are an expert in event argument extraction. Given the event dictionary: {s_schema1}, and the event type and trigger words: {s_schema2}, please extract possible arguments from the following input. If an event argument does not exist, output NAN. Please respond in the {s_format} format."
|
38 |
```
|
|
|
39 |
|
|
|
40 |
|
41 |
-
|
42 |
-
|
43 |
-
|
44 |
-
|
45 |
-
|
46 |
-
EET: "Examine the following text content and extract any events you deem significant. Provide your findings in the {s_format} format."
|
47 |
-
EEA: "Please extract possible arguments based on the event type and trigger word {s_schema2} from the input below. Answer in the format of {s_format}."
|
48 |
```
|
49 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
50 |
<details>
|
51 |
<summary><b>Candidate Labels {s_schema}</b></summary>
|
52 |
|
@@ -67,11 +80,15 @@ Template without specifying a list of candidate labels:
|
|
67 |
|
68 |
|
69 |
```json
|
70 |
-
NER: (Entity,Entity Type)
|
71 |
-
|
72 |
-
|
73 |
-
|
74 |
-
|
|
|
|
|
|
|
|
|
75 |
```
|
76 |
|
77 |
</details>
|
@@ -229,13 +246,13 @@ For Event Extraction (EE) tasks:
|
|
229 |
{"OrganizationalRelation-Layoff": ["LayoffParty", "NumberLaidOff", "Time"], "LegalAction-Sue": ["Plaintiff", "Defendant", "Time"], ...} # Dictionary of event types
|
230 |
|
231 |
|
232 |
-
For EET tasks:
|
233 |
["Social Interaction-Thanks", "Organizational Action-OpeningCeremony", "Competition Action-Withdrawal", ...] # List of event types
|
234 |
[] # Empty list
|
235 |
{} # Empty dictionary
|
236 |
|
237 |
|
238 |
-
For Event Extraction
|
239 |
["Social Interaction-Thanks", "Organizational Action-OpeningCeremony", "Competition Action-Withdrawal", ...] # List of event types
|
240 |
["DismissingParty", "TerminatingParty", "Reporter", "ArrestedPerson"] # List of argument roles
|
241 |
{"OrganizationalRelation-Layoff": ["LayoffParty", "NumberLaidOff", "Time"], "LegalAction-Sue": ["Plaintiff", "Defendant", "Time"], ...} # Dictionary of event types
|
|
|
22 |
|
23 |
|
24 |
# 2. Information Extraction Template
|
25 |
+
The `template` is used to construct an `instruction` for the input of model, consisting of three parts:
|
26 |
+
1. **Task Description**: Clearly define the model's function and the task it needs to complete, such as entity recognition, relation extraction, event extraction, etc.
|
27 |
+
2. **Candidate Label List {s_schema} (optional)**: Define the categories of labels that the model needs to extract, such as entity types, relation types, event types, etc.
|
28 |
+
3. **Structured Output Format {s_format}**: Specify how the model should present the structured information it extracts.
|
29 |
|
30 |
|
31 |
+
Template **with specified list of candidate labels**:
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
```
|
33 |
+
Named Entity Recognition(NER): You are an expert specialized in entity extraction. With the candidate entity types list: {s_schema}, please extract possible entities from the input below, outputting NAN if a certain entity does not exist. Respond in the format {s_format}.
|
34 |
|
35 |
+
Relation Extraction(RE): You are an expert in extracting relation triples. With the candidate relation list: {s_schema}, please extract the possible head entities and tail entities from the input below and provide the corresponding relation triples. If a relation does not exist, output NAN. Please answer in the {s_format} format.
|
36 |
|
37 |
+
Event Extraction(EE): You are a specialist in event extraction. Given the candidate event dictionary: {s_schema}, please extract any possible events from the input below. If an event does not exist, output NAN. Please answer in the format of {s_format}.
|
38 |
+
|
39 |
+
Event Type Extraction(EET): As an event analysis specialist, you need to review the input and determine possible events based on the event type directory: {s_schema}. All answers should be based on the {s_format} format. If the event type does not match, please mark with NAN.
|
40 |
+
|
41 |
+
Event Argument Extraction(EEA): You are an expert in event argument extraction. Given the event dictionary: {s_schema1}, and the event type and trigger words: {s_schema2}, please extract possible arguments from the following input. If an event argument does not exist, output NAN. Please respond in the {s_format} format.
|
|
|
|
|
42 |
```
|
43 |
|
44 |
+
|
45 |
+
<details>
|
46 |
+
<summary><b>Template without specifying a list of candidate labels</b></summary>
|
47 |
+
|
48 |
+
|
49 |
+
```
|
50 |
+
Named Entity Recognition(NER): Analyze the text content and extract the clear entities. Present your findings in the {s_format} format, skipping any ambiguous or uncertain parts.
|
51 |
+
|
52 |
+
Relation Extraction(RE): Please extract all the relation triples from the text and present the results in the format of {s_format}. Ignore those entities that do not conform to the standard relation template.
|
53 |
+
|
54 |
+
Event Extraction(EE): Please analyze the following text, extract all identifiable events, and present them in the specified format {s_format}. If certain information does not constitute an event, simply skip it.
|
55 |
+
|
56 |
+
Event Type Extraction(EET): Examine the following text content and extract any events you deem significant. Provide your findings in the {s_format} format.
|
57 |
+
|
58 |
+
Event Argument Extraction(EEA): Please extract possible arguments based on the event type and trigger word {s_schema2} from the input below. Answer in the format of {s_format}.
|
59 |
+
```
|
60 |
+
</details>
|
61 |
+
|
62 |
+
|
63 |
<details>
|
64 |
<summary><b>Candidate Labels {s_schema}</b></summary>
|
65 |
|
|
|
80 |
|
81 |
|
82 |
```json
|
83 |
+
Named Entity Recognition(NER): (Entity,Entity Type)
|
84 |
+
|
85 |
+
Relation Extraction(RE): (Subject,Relation,Object)
|
86 |
+
|
87 |
+
Event Extraction(EE): (Event Trigger,Event Type,Argument1#Argument Role1;Argument2#Argument Role2)
|
88 |
+
|
89 |
+
Event Type Extraction(EET): (Event Trigger,Event Type)
|
90 |
+
|
91 |
+
Event Argument Extraction(EEA): (Event Trigger,Event Type,Argument1#Argument Role1;Argument2#Argument Role2)
|
92 |
```
|
93 |
|
94 |
</details>
|
|
|
246 |
{"OrganizationalRelation-Layoff": ["LayoffParty", "NumberLaidOff", "Time"], "LegalAction-Sue": ["Plaintiff", "Defendant", "Time"], ...} # Dictionary of event types
|
247 |
|
248 |
|
249 |
+
For Event Type Extraction(EET) tasks:
|
250 |
["Social Interaction-Thanks", "Organizational Action-OpeningCeremony", "Competition Action-Withdrawal", ...] # List of event types
|
251 |
[] # Empty list
|
252 |
{} # Empty dictionary
|
253 |
|
254 |
|
255 |
+
For Event Argument Extraction(EEA) tasks:
|
256 |
["Social Interaction-Thanks", "Organizational Action-OpeningCeremony", "Competition Action-Withdrawal", ...] # List of event types
|
257 |
["DismissingParty", "TerminatingParty", "Reporter", "ArrestedPerson"] # List of argument roles
|
258 |
{"OrganizationalRelation-Layoff": ["LayoffParty", "NumberLaidOff", "Time"], "LegalAction-Sue": ["Plaintiff", "Defendant", "Time"], ...} # Dictionary of event types
|