sumuks commited on
Commit
2f2bdcf
1 Parent(s): 4df48fd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +58 -4
README.md CHANGED
@@ -1,11 +1,65 @@
1
  ---
2
  title: README
3
- emoji: 🐨
4
- colorFrom: yellow
5
- colorTo: blue
6
  sdk: static
7
  pinned: false
8
  short_description: Unsupervised Human Preference Learning
9
  ---
10
 
11
- Edit this `README.md` markdown file to author your organization card.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  title: README
3
+ emoji: ⚙️
4
+ colorFrom: blue
5
+ colorTo: purple
6
  sdk: static
7
  pinned: false
8
  short_description: Unsupervised Human Preference Learning
9
  ---
10
 
11
+ # ⚙️ Preference Agents
12
+
13
+ Preference Agents is an organization focused on developing and releasing small language models ("preference agents") that enable efficient personalization of larger language models (LLMs). Our agents learn user preferences and generate natural language rules that guide LLMs to produce tailored content, without requiring extensive fine-tuning of the larger models.
14
+
15
+ ## 🎯 Our Approach
16
+
17
+ We train small, locally deployable language models to act as "steering wheels" for larger, pre-trained LLMs. These agents learn user preferences from small, personalized datasets and encode these preferences into concise natural language rules. These rules are then provided as context to the larger LLM, guiding its output towards the desired personalized style and content.
18
+
19
+ ## 📦 Resources
20
+
21
+ ### Datasets
22
+
23
+ We release three datasets for research on personalized language modeling:
24
+
25
+ * **Enron-42k:** A curated subset of the Enron email corpus, focused on original content creation. It contains approximately 40,240 emails from 191 unique senders.
26
+ * **The New Yorker:** A curated subset of the All The News 2.0 corpus, containing 4000 articles from the New Yorker.
27
+ * **LAMP 3U Subset:** A subset of the LAMP 3U Amazon product reviews dataset, containing 22,500 reviews from 15 users.
28
+
29
+ Both datasets are licensed under the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license.
30
+
31
+ ### Models
32
+
33
+ We provide fine-tuned Llama-3-8B-Instruct preference agent models for the Enron-42k and LAMP 3U datasets. These agents can be used to generate personalized emails or product reviews based on user preferences.
34
+
35
+ **Note:** Due to licensing restrictions on the New Yorker dataset, we are unable to release models trained on that data.
36
+
37
+ ## 🚀 How to Use Our Resources
38
+
39
+ ### Datasets
40
+
41
+ ```python
42
+ from datasets import load_dataset
43
+
44
+ enron_dataset = load_dataset("preference-agents/Enron-42k", split="train")
45
+ lamp_dataset = load_dataset("preference-agents/LAMP-3U-Subset", split="train")
46
+ ```
47
+
48
+
49
+
50
+ ## 📊 Evaluation
51
+
52
+ Our approach was evaluated using GPT-4o and human evaluations, demonstrating significant improvements over baselines like zero-shot generation, few-shot learning, and naive fine-tuning.
53
+
54
+
55
+ ## 📜 Citation
56
+
57
+ If you use our resources in your research or applications, please cite our paper:
58
+
59
+ ```bibtex
60
+ pending
61
+ ```
62
+
63
+ ## 🙏 Acknowledgements
64
+
65
+ We thank Meta AI for providing the Llama-3 models, Google AI for access to Gemini 1.5 Pro, and Anthropic for access to Claude 3.5 Sonnet. We also acknowledge the creators of the Enron email corpus and the LAMP 3U dataset for making their valuable resources available to the research community.