Minej commited on
Commit
6de45ac
·
1 Parent(s): 2c717fc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +234 -0
README.md CHANGED
@@ -1,3 +1,237 @@
1
  ---
2
  license: mit
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
+ language:
4
+ - en
5
+ library_name: transformers
6
+ pipeline_tag: text-classification
7
  ---
8
+
9
+ ### Model Description
10
+
11
+ Transfer Learning for Big Five Personality Prediction
12
+
13
+ In machine learning, training accurate models can be challenging when labeled data is limited. Transfer learning offers a solution by leveraging pre-existing labeled data from a similar task or domain. By transferring knowledge learned from one task to another, we can overcome data scarcity and train more effective models.
14
+
15
+ In this project, we used transfer learning with the BERT BASE UNCASED model to predict Big Five personality traits. The model was fine-tuned on a curated dataset for personality traits, learning patterns between input text and personality characteristics. By applying transfer learning, we improved the accuracy of personality trait predictions.
16
+
17
+ By leveraging transfer learning and fine-tuning BERT BASE UNCASED, we accurately predict an individual's Big Five personality traits based on their input text. This approach addresses the challenges of limited labeled data in personality prediction, providing insights into individuals' personalities.
18
+
19
+ This project showcases the power of transfer learning in machine learning and highlights the effectiveness of BERT BASE UNCASED for predicting Big Five personality traits.
20
+
21
+ - **Developed by:** Minej
22
+ - **Model type:** BERT BASE UNCASED
23
+ - **Language(s) (NLP):** English
24
+ - **License:** MIT
25
+ - **Finetuned from model [optional]:** https://huggingface.co/bert-base-uncased
26
+
27
+
28
+ ## Uses
29
+
30
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
31
+
32
+ ### Direct Use
33
+
34
+ The personality prediction model can be used directly by individuals who are interested in gaining insights into their own personality traits based on their input text. Users can input text and receive predictions for the Big Five personality traits.
35
+
36
+ ### Downstream Use [optional]
37
+
38
+ This model is not intended for downstream use or fine-tuning for specific tasks. It is designed as a standalone personality prediction model.
39
+
40
+ ### Out-of-Scope Use
41
+
42
+ This model is not suitable for uses beyond personality prediction. It should not be used for making critical decisions or judgments about individuals in areas such as employment, education, or legal matters.
43
+
44
+ ## Bias, Risks, and Limitations
45
+
46
+ The personality prediction model, like any machine learning model, has certain limitations and potential biases that should be taken into account:
47
+
48
+ ####Limited Context:
49
+ The model makes predictions based on input text alone and may not capture the full context of an individual's personality. It is important to consider that personality traits are influenced by various factors beyond textual expression.
50
+
51
+ ####Generalization:
52
+ The model predicts personality traits based on patterns learned from a specific dataset. Its performance may vary when applied to individuals from different demographic or cultural backgrounds not well represented in the training data.
53
+
54
+ ####Ethical Considerations:
55
+ Personality prediction models should be used responsibly, with an understanding that personality traits do not determine a person's worth or abilities. It is important to avoid making unfair judgments or discriminating against individuals based on predicted personality traits.
56
+
57
+ ####Privacy Concerns:
58
+ The model relies on user-provided input text, which may contain sensitive or personal information. Users should exercise caution when sharing personal details and ensure the security of their data.
59
+
60
+ ####False Positives/Negatives:
61
+ The model's predictions may not always align perfectly with an individual's actual personality traits. It is possible for the model to generate false positives (predicting a trait that is not present) or false negatives (missing a trait that is present).
62
+
63
+ ### Recommendations
64
+
65
+ To mitigate risks and limitations associated with personality prediction models, the following recommendations are suggested:
66
+
67
+ ####Awareness and Education:
68
+ Users should be informed about the limitations and potential biases of the model. Promote understanding that personality traits are complex and cannot be fully captured by a single model or text analysis.
69
+
70
+ ####Avoid Stereotyping and Discrimination:
71
+ Users should be cautious about making judgments or decisions solely based on predicted personality traits. Personality predictions should not be used to discriminate against individuals or perpetuate stereotypes.
72
+
73
+ ####Interpret with Context:
74
+ Interpret the model's predictions in the appropriate context and consider additional information about an individual beyond their input text.
75
+
76
+ ####Data Privacy and Security:
77
+ Ensure that user data is handled securely and with respect to privacy regulations. Users should be aware of the information they provide and exercise caution when sharing personal details.
78
+
79
+ ####Promote Ethical Use:
80
+ Encourage responsible use of personality prediction models and discourage misuse or harmful applications.
81
+
82
+ It is important to note that the above recommendations are general guidelines, and further context-specific recommendations should be developed based on the particular use case and ethical considerations.
83
+
84
+ ## How to Get Started with the Model
85
+
86
+ To use the model through Hosted inference API, follow the code snippet provided below:
87
+
88
+ ```python
89
+ from transformers import BertTokenizer, BertForSequenceClassification
90
+
91
+ def personality_detection(text):
92
+ tokenizer = BertTokenizer.from_pretrained("Minej/bert-base-personality")
93
+ model = BertForSequenceClassification.from_pretrained("Minej/bert-base-personality")
94
+
95
+ inputs = tokenizer(text, truncation=True, padding=True, return_tensors="pt")
96
+ outputs = model(**inputs)
97
+ predictions = outputs.logits.squeeze().detach().numpy()
98
+
99
+ label_names = ['Extroversion', 'Neuroticism', 'Agreeableness', 'Conscientiousness', 'Openness']
100
+ result = {label_names[i]: predictions[i] for i in range(len(label_names))}
101
+
102
+ return result
103
+ ```
104
+
105
+ If you would like to download the model files and use them instead of the Hosted inference API, then you can follow the code snippet provided below:
106
+
107
+ ```python
108
+ from transformers import BertForSequenceClassification, BertTokenizer
109
+ import torch
110
+
111
+ # Initialization of the model values
112
+ model = BertForSequenceClassification.from_pretrained(".", num_labels=5)
113
+ tokenizer = BertTokenizer.from_pretrained('.', do_lower_case=True)
114
+ model.config.label2id = {
115
+ "Extroversion": 0,
116
+ "Neuroticism": 1,
117
+ "Agreeableness": 2,
118
+ "Conscientiousness": 3,
119
+ "Openness": 4,
120
+ }
121
+ model.config.id2label = {
122
+ "0": "Extroversion",
123
+ "1": "Neuroticism",
124
+ "2": "Agreeableness",
125
+ "3": "Conscientiousness",
126
+ "4": "Openness",
127
+ }
128
+
129
+ def personality_detection(model_input: str) -> dict:
130
+ '''
131
+ Performs personality prediction on the given input text
132
+
133
+ Args:
134
+ model_input (str): The text conversation
135
+
136
+ Returns:
137
+ dict: A dictionary where keys are speaker labels and values are their personality predictions
138
+ '''
139
+
140
+ if len(model_input) == 0:
141
+ ret = {
142
+ "Extroversion": float(0),
143
+ "Neuroticism": float(0),
144
+ "Agreeableness": float(0),
145
+ "Conscientiousness": float(0),
146
+ "Openness": float(0),
147
+ }
148
+ return ret
149
+ else:
150
+ dict_custom = {}
151
+ preprocess_part1 = model_input[:len(model_input)]
152
+ dict1 = tokenizer.encode_plus(preprocess_part1, max_length=1024, padding=True, truncation=True)
153
+ dict_custom['input_ids'] = [dict1['input_ids'], dict1['input_ids']]
154
+ dict_custom['token_type_ids'] = [dict1['token_type_ids'], dict1['token_type_ids']]
155
+ dict_custom['attention_mask'] = [dict1['attention_mask'], dict1['attention_mask']]
156
+ outs = model(torch.tensor(dict_custom['input_ids']), token_type_ids=None, attention_mask=torch.tensor(dict_custom['attention_mask']))
157
+ b_logit_pred = outs[0]
158
+ pred_label = torch.sigmoid(b_logit_pred)
159
+ ret = {
160
+ "Extroversion": float(pred_label[0][0]),
161
+ "Neuroticism": float(pred_label[0][1]),
162
+ "Agreeableness": float(pred_label[0][2]),
163
+ "Conscientiousness": float(pred_label[0][3]),
164
+ "Openness": float(pred_label[0][4]),
165
+ }
166
+ return ret
167
+
168
+ personality_prediction = personality_detection(text_input)
169
+ ```
170
+
171
+ Make sure you have the required dependencies installed (transformers and torch). This code snippet initializes the model, tokenizer, and configuration. It then defines the personality_detection function, which takes a text conversation as input and returns a dictionary with personality predictions for each speaker.
172
+
173
+ You can call the personality_detection function with your input text to obtain the personality predictions. The personality_prediction variable will hold the resulting dictionary.
174
+
175
+ Please note that this code assumes you have already downloaded the necessary model files (config.json, pytorch_model.bin, special_tokens_map.json, tokenizer_config.json, vocab.txt
176
+ ) and placed them in the current directory (indicated by "."). Adjust the paths and filenames accordingly if needed.
177
+
178
+ #### Result Format [optional]
179
+
180
+ The personality_detection function returns a dictionary containing the predicted personality traits based on the given input text.
181
+
182
+ The dictionary contains the following personality traits with their corresponding predicted values:
183
+
184
+ Extroversion: A value between 0 and 1 representing the predicted extroversion trait.
185
+ Neuroticism: A value between 0 and 1 representing the predicted neuroticism trait.
186
+ Agreeableness: A value between 0 and 1 representing the predicted agreeableness trait.
187
+ Conscientiousness: A value between 0 and 1 representing the predicted conscientiousness trait.
188
+ Openness: A value between 0 and 1 representing the predicted openness trait.
189
+
190
+ ```python
191
+ text_input = "I am feeling excited about the upcoming event."
192
+ personality_prediction = personality_detection(text_input)
193
+
194
+ print(personality_prediction)
195
+ ```
196
+ #####Output:
197
+ ```python
198
+ {
199
+ "Extroversion": 0.535,
200
+ "Neuroticism": 0.576,
201
+ "Agreeableness": 0.399,
202
+ "Conscientiousness": 0.253,
203
+ "Openness": 0.563
204
+ }
205
+ ```
206
+
207
+ Note: The values in the example output are just placeholders and may not reflect the actual predictions.
208
+
209
+ You can modify the example code and the result format to match your specific use case and desired output format.
210
+
211
+ ## Citation [optional]
212
+
213
+ @article{DBLP:journals/corr/abs-1810-04805,
214
+ author = {Jacob Devlin and
215
+ Ming{-}Wei Chang and
216
+ Kenton Lee and
217
+ Kristina Toutanova},
218
+ title = {{BERT:} Pre-training of Deep Bidirectional Transformers for Language
219
+ Understanding},
220
+ journal = {CoRR},
221
+ volume = {abs/1810.04805},
222
+ year = {2018},
223
+ url = {http://arxiv.org/abs/1810.04805},
224
+ archivePrefix = {arXiv},
225
+ eprint = {1810.04805},
226
+ timestamp = {Tue, 30 Oct 2018 20:39:56 +0100},
227
+ biburl = {https://dblp.org/rec/journals/corr/abs-1810-04805.bib},
228
+ bibsource = {dblp computer science bibliography, https://dblp.org}
229
+ }
230
+
231
+
232
+ ## More Information [optional]
233
+
234
+ TBA
235
+
236
+
237
+