Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,39 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
base_model:
|
4 |
+
- meditsolutions/MSH-v1-Bielik-v2.3-Instruct-MedIT-merge
|
5 |
+
- speakleash/Bielik-11B-v2.3-Instruct
|
6 |
+
pipeline_tag: text-generation
|
7 |
+
tags:
|
8 |
+
- medit-lite
|
9 |
+
- model-pruning
|
10 |
+
- text-generation
|
11 |
+
language:
|
12 |
+
- pl
|
13 |
+
- en
|
14 |
+
---
|
15 |
+
|
16 |
+
<div align="center">
|
17 |
+
<img src="https://i.ibb.co/YLfCzXR/imagine-image-c680e106-e404-45e5-98da-af700ffe41f4.png" alt="MSH-Lite" style="border-radius: 10px; box-shadow: 0 4px 8px 0 rgba(0, 0, 0, 0.2), 0 6px 20px 0 rgba(0, 0, 0, 0.19); max-width: 100%; height: auto;">
|
18 |
+
</div>
|
19 |
+
|
20 |
+
# Marsh Harrier Lite (MSH-Lite)
|
21 |
+
|
22 |
+
Marsh Harrier Lite (MSH-Lite) is a compact, efficient version of the [MedIT Solutions MSH-v1-Bielik-v2.3-Instruct-MedIT-merge](meditsolutions/MSH-v1-Bielik-v2.3-Instruct-MedIT-merge) model, reduced to 7 billion parameters using advanced pruning techniques. This pruning retains the core functionality and efficiency of the original model while optimizing for reduced computational resource usage.
|
23 |
+
|
24 |
+
## Key Features:
|
25 |
+
- **Pruned Model**: Reduced from 11B to 7B parameters using the pruning method based on the [MedIT Solutions LLaMA pruning framework](https://github.com/MedITSolutionsKurman/llama-pruning).
|
26 |
+
- **Optimized Performance**: Despite its reduced size, MSH-Lite delivers competitive performance across a wide array of NLP tasks.
|
27 |
+
- **Bilingual Support**: Designed to handle both Polish (pl) and English (en) with high fluency.
|
28 |
+
|
29 |
+
## Technical Details:
|
30 |
+
- **Base Model**: [MSH-v1-Bielik-v2.3-Instruct-MedIT-merge](https://huggingface.co/meditsolutions/MSH-v1-Bielik-v2.3-Instruct-MedIT-merge)
|
31 |
+
- **Parameter Count**: 7 billion
|
32 |
+
- **Architecture**: Derived from Bielik's core architecture, with parameter optimization.
|
33 |
+
- **Model Efficiency**: Ideal for deployments where computational efficiency is paramount.
|
34 |
+
|
35 |
+
## Performance Highlights:
|
36 |
+
To be done.
|
37 |
+
|
38 |
+
### Acknowledgments:
|
39 |
+
Gratitude to the **[SpeakLeash](https://speakleash.org)** project and **[ACK Cyfronet AGH](https://www.cyfronet.pl/)** for their contributions and collaboration.
|