Phil's picture

Phil

phil111

·

AI & ML interests

None yet

Recent Activity

new activity 27 days ago

mistralai/Mistral-Small-24B-Instruct-2501:This Mistral Small has FAR less knowledge than the last.

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1

new activity about 1 month ago

internlm/internlm3-8b-instruct:English tests and tasks are absurdly overfit.

View all activity

Organizations

None yet

phil111's activity

New activity in mistralai/Mistral-Small-24B-Instruct-2501 27 days ago

This Mistral Small has FAR less knowledge than the last.

#5 opened 29 days ago by

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 5 days ago • 4.63M • • 10.5k

New activity in internlm/internlm3-8b-instruct about 1 month ago

English tests and tasks are absurdly overfit.

#8 opened about 1 month ago by

New activity in microsoft/phi-4 about 2 months ago

A heavily filtered corpus simply doesn't work.

#19 opened about 2 months ago by

I Don't Understand This Model

#9 opened about 2 months ago by

New activity in matteogeniaccio/phi-4 2 months ago

Notably better than Phi3.5 in many ways, but something is wrong.

#5 opened 2 months ago by

liked a model 2 months ago

deepseek-ai/DeepSeek-V3

Text Generation • Updated 5 days ago • 3.29M • • 3.57k

New activity in deepseek-ai/DeepSeek-V3-Base 2 months ago

Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.

#27 opened 2 months ago by

liked a model 2 months ago

deepseek-ai/DeepSeek-V3-Base

Updated 5 days ago • 557k • 1.58k

New activity in NyxKrage/Microsoft_Phi-4 2 months ago

SimpleQA score

#1 opened 2 months ago by

New activity in ibm-granite/granite-3.1-8b-instruct 2 months ago

Exceptional creative writer

#1 opened 2 months ago by

liked 2 models 2 months ago

ibm-granite/granite-3.1-8b-instruct

Text Generation • Updated 2 days ago • 102k • 152

QuantFactory/granite-3.1-8b-instruct-GGUF

Text Generation • Updated Dec 19, 2024 • 548 • 7

New activity in tiiuae/Falcon3-7B-Instruct 2 months ago

Very High English MMLU scores, Yet Extremely Low Broad English Knowledge

#8 opened 2 months ago by

New activity in CohereForAI/c4ai-command-r7b-12-2024 2 months ago

How was r7b?

#3 opened 3 months ago by

Add Qwen 2.5 7B & Tulu 3 8B results to OLLM benchmarks

#1 opened 3 months ago by

New activity in meta-llama/Llama-3.3-70B-Instruct 2 months ago

local Llama + GPU(cuda)

#34 opened 3 months ago by

New activity in meta-llama/Llama-3.3-70B-Instruct 3 months ago

Base Model?

#32 opened 3 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 3 months ago

Add Hymba-1.5B to the leaderboard

#1030 opened 3 months ago by

liked a model 3 months ago

lmstudio-community/Llama-3.3-70B-Instruct-GGUF

Text Generation • Updated Dec 6, 2024 • 33.4k • 46