Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
32
1
39
Phil
phil111
Follow
Gargaz's profile picture
AhmedMoneim's profile picture
koungho's profile picture
12 followers
·
14 following
AI & ML interests
None yet
Recent Activity
new
activity
27 days ago
mistralai/Mistral-Small-24B-Instruct-2501:
This Mistral Small has FAR less knowledge than the last.
liked
a model
about 1 month ago
deepseek-ai/DeepSeek-R1
new
activity
about 1 month ago
internlm/internlm3-8b-instruct:
English tests and tasks are absurdly overfit.
View all activity
Organizations
None yet
phil111
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
mistralai/Mistral-Small-24B-Instruct-2501
27 days ago
This Mistral Small has FAR less knowledge than the last.
20
#5 opened 29 days ago by
phil111
liked
a model
about 1 month ago
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
5 days ago
•
4.63M
•
•
10.5k
New activity in
internlm/internlm3-8b-instruct
about 1 month ago
English tests and tasks are absurdly overfit.
21
#8 opened about 1 month ago by
phil111
New activity in
microsoft/phi-4
about 2 months ago
A heavily filtered corpus simply doesn't work.
4
#19 opened about 2 months ago by
phil111
I Don't Understand This Model
16
#9 opened about 2 months ago by
phil111
New activity in
matteogeniaccio/phi-4
2 months ago
Notably better than Phi3.5 in many ways, but something is wrong.
8
#5 opened 2 months ago by
phil111
liked
a model
2 months ago
deepseek-ai/DeepSeek-V3
Text Generation
•
Updated
5 days ago
•
3.29M
•
•
3.57k
New activity in
deepseek-ai/DeepSeek-V3-Base
2 months ago
Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.
2
#27 opened 2 months ago by
phil111
liked
a model
2 months ago
deepseek-ai/DeepSeek-V3-Base
Updated
5 days ago
•
557k
•
1.58k
New activity in
NyxKrage/Microsoft_Phi-4
2 months ago
SimpleQA score
2
#1 opened 2 months ago by
frappuccino
New activity in
ibm-granite/granite-3.1-8b-instruct
2 months ago
Exceptional creative writer
5
#1 opened 2 months ago by
SubtleOne
liked
2 models
2 months ago
ibm-granite/granite-3.1-8b-instruct
Text Generation
•
Updated
2 days ago
•
102k
•
152
QuantFactory/granite-3.1-8b-instruct-GGUF
Text Generation
•
Updated
Dec 19, 2024
•
548
•
7
New activity in
tiiuae/Falcon3-7B-Instruct
2 months ago
Very High English MMLU scores, Yet Extremely Low Broad English Knowledge
2
#8 opened 2 months ago by
phil111
New activity in
CohereForAI/c4ai-command-r7b-12-2024
2 months ago
How was r7b?
6
#3 opened 3 months ago by
MRU4913
Add Qwen 2.5 7B & Tulu 3 8B results to OLLM benchmarks
12
#1 opened 3 months ago by
Fizzarolli
New activity in
meta-llama/Llama-3.3-70B-Instruct
2 months ago
local Llama + GPU(cuda)
7
#34 opened 3 months ago by
Luciolla
New activity in
meta-llama/Llama-3.3-70B-Instruct
3 months ago
Base Model?
3
#32 opened 3 months ago by
User8213
New activity in
open-llm-leaderboard/open_llm_leaderboard
3 months ago
Add Hymba-1.5B to the leaderboard
3
#1030 opened 3 months ago by
pmolchanov
liked
a model
3 months ago
lmstudio-community/Llama-3.3-70B-Instruct-GGUF
Text Generation
•
Updated
Dec 6, 2024
•
33.4k
•
46
Load more