Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,46 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
+
# Model Card
|
5 |
+
|
6 |
+
### Model Name: Delexa-7b
|
7 |
+
|
8 |
+
#### Overview:
|
9 |
+
|
10 |
+
**Purpose:** Delexa-7b is our newest large language model designed for general-purpose language tasks. It's currently under development, with ongoing improvements and testing.
|
11 |
+
|
12 |
+
**Status:** Active development and refinement. More comprehensive evaluation results will be available soon.
|
13 |
+
|
14 |
+
**Skills:** Initial evaluations show Delexa-7b performing exceptionally well on general tasks from llm-judge.
|
15 |
+
|
16 |
+
**Guardrails** This Model allows 18+ content and lewd content, but it wont let any illegal content through (unless you jailbreak it)
|
17 |
+
|
18 |
+
**Evaluation:** Preliminary results from llm-judge are extremely promising. Delexa-7b demonstrates strong performance, with the potential to surpass established models. Stay tuned for more detailed evaluations!
|
19 |
+
|
20 |
+
| model | first turn score | second turn score | average score |
|
21 |
+
|-----------------------|------------------|-------------------|---------------|
|
22 |
+
| gpt-4 | 8.95625 | 9.0250 | 8.990625 |
|
23 |
+
| **Delexa-7b** | **8.70000** | 7.5875 | **8.143750** |
|
24 |
+
| gpt-3.5-turbo | 8.07500 | 7.8125 | 7.943750 |
|
25 |
+
| claude-v1 | 8.15000 | 7.6500 | 7.900000 |
|
26 |
+
| palm-2-chat-bison-001 | 6.71250 | 6.0875 | 6.400000 |
|
27 |
+
| vicuna-13b-v1.3 | 6.81250 | 5.9625 | 6.387500 |
|
28 |
+
|
29 |
+
**Intended Use:**
|
30 |
+
|
31 |
+
* Exploring the capabilities of new language models.
|
32 |
+
* Experimentation and learning for AI development enthusiasts.
|
33 |
+
* Potential applications in areas where STEM reasoning is essential.
|
34 |
+
|
35 |
+
**Potential Risks:**
|
36 |
+
|
37 |
+
* Like other uncensored large language models, Delexa-7b could and will generate harmful, biased, or offensive content if asked to. Responsible use and careful monitoring are essential if this model goes into production for your Business.
|
38 |
+
|
39 |
+
**Ethical Considerations**
|
40 |
+
|
41 |
+
* Delexa-7b is in the early stages of development. We are committed to ongoing evaluation to identify potential biases and address them proactively.
|
42 |
+
* Updates to this model card will ensure transparency as Delexa-7b evolves.
|
43 |
+
|
44 |
+
### Additional Notes
|
45 |
+
|
46 |
+
Delexa-7b represents an exciting development with the potential to deliver impressive results. We invite the community to explore its capabilities and provide feedback as we continue to refine it.
|