IntelligentEstate
/

Tangu-3B-Qwenstar-Q8-GGUF

Text Generation

Transformers

GGUF

Model card Files Files and versions Community

fuzzy-mittenz commited on 7 days ago

Commit

9530142

verified ·

1 Parent(s): 430027a

Update README.md

Browse files

![{A2792263-DC35-4ED6-AA74-A64812F2E396}.png](https://cdn-uploads.huggingface.co/production/uploads/6593502ca2607099284523db/7NYXxCVwBZG58gbcQVxz6.png)

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -64,3 +64,14 @@ So far not neccisary but may be tuned as needed for suggestions refer to Reasoni
 ### Other models
 This should work well on other UIs the [original model](https://huggingface.co/PowerInfer/SmallThinker-3B-Preview) has usage instructions for them

 ### Other models
 This should work well on other UIs the [original model](https://huggingface.co/PowerInfer/SmallThinker-3B-Preview) has usage instructions for them
+## Benchmark Performance/Without the JavaScript/code_interpreter in GPT4ALL Should easily obtain o1 levals at a reasonable even without GPU Though you may need patience depending on your setup.
+| Model | AIME24 | AMC23 | GAOKAO2024_I | GAOKAO2024_II | MMLU_STEM | AMPS_Hard | math_comp |
+|---------|--------|-------|--------------|---------------|-----------|-----------|-----------|
+| Qwen2.5-3B-Instruct | 6.67 | 45 | 50 | 35.8 | 59.8 | - | - |
+| SmallThinker | 16.667 | 57.5 | 64.2 | 57.1 | 68.2 | 70 | 46.8 |
+| GPT-4o | 9.3 | - | - | - | 64.2 | 57 | 50 |
+Example
+![{A2792263-DC35-4ED6-AA74-A64812F2E396}.png](https://cdn-uploads.huggingface.co/production/uploads/6593502ca2607099284523db/7NYXxCVwBZG58gbcQVxz6.png)