apple
/

DCLM-7B

Transformers

Safetensors

openlm

Inference Endpoints

Model card Files Files and versions Community

vaishaal commited on Jul 18

Commit

4b666fb

•

1 Parent(s): 099daf7

Update README.md

Browse files

Files changed (1) hide show

README.md +26 -0

README.md CHANGED Viewed

@@ -139,6 +139,32 @@ Here are the evaluation results for DCLM-Baseline-7B on various tasks (using [ll
 Note: All scores are presented as decimal values between 0 and 1, representing the proportion of correct answers or the model's performance on each task.
 ## Limitations and Biases
 While DCLM-Baseline-7B demonstrates strong performance across a range of tasks, it's important to note:

 Note: All scores are presented as decimal values between 0 and 1, representing the proportion of correct answers or the model's performance on each task.
+## Comparison
+Below are comparisions of this model with other models in the 7B regime.
+| Model         | Params | Tokens | Open dataset? | CORE     | MMLU     | EXTENDED |
+|---------------|--------|--------|---------------|----------|----------|----------|
+| **Open weights, closed datasets** |        |        |               |          |          |          |
+| Llama2        | 7B     | 2T     | ✗             | 49.2     | 45.8     | 34.1     |
+| DeepSeek      | 7B     | 2T     | ✗             | 50.7     | 48.5     | 35.3     |
+| Mistral-0.3   | 7B     | ?      | ✗             | 57.0     | 62.7     | 45.1     |
+| QWEN-2        | 7B     | ?      | ✗             | 57.5     | **71.9** | 50.5     |
+| Llama3        | 8B     | 15T    | ✗             | 57.6     | 66.2     | 46.3     |
+| Gemma         | 8B     | 6T     | ✗             | 57.8     | 64.3     | 44.6     |
+| Phi-3         | 7B     | ?      | ✗             | **61.0** | 69.9     | **57.9** |
+| **Open weights, open datasets** |        |        |               |          |          |          |
+| Falcon        | 7B     | 1T     | ✓             | 44.1     | 27.4     | 25.1     |
+| OLMo-1.7      | 7B     | 2.1T   | ✓             | 47.0     | 54.0     | 34.2     |
+| MAP-Neo       | 7B     | 4.5T   | ✓             | **50.2** | **57.1** | **40.4** |
+| **Models we trained** |        |        |               |          |          |          |
+| FineWeb edu   | 7B     | 0.14T  | ✓             | 38.7     | 26.3     | 22.1     |
+| FineWeb edu   | 7B     | 0.28T  | ✓             | 41.9     | 37.3     | 24.5     |
+| **DCLM-7B** | 7B     | 2.5T   | ✓             | **56.1** | **63.7** | **43.6** |
 ## Limitations and Biases
 While DCLM-Baseline-7B demonstrates strong performance across a range of tasks, it's important to note: