jiakai

real-jiakai

https://blog.gujiakai.top

AI & ML interests

LLM && Smart QA

Recent Activity

liked a model about 7 hours ago

mistralai/Mistral-Small-24B-Instruct-2501

liked a model about 7 hours ago

mistralai/Mistral-Small-24B-Base-2501

liked a model about 7 hours ago

m-a-p/YuE-s1-7B-anneal-zh-icl

View all activity

Organizations

real-jiakai's activity

liked 3 models about 7 hours ago

upvoted an article about 8 hours ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

•

Oct 14, 2024

• 65

liked a model about 16 hours ago

UnfilteredAI/NSFW-gen-v2.1

Text-to-Image • Updated May 16, 2024 • 3.01k • 55

liked a model 1 day ago

huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2

Text Generation • Updated 10 days ago • 1.7k • 36

liked a Space 2 days ago

Running

🤖

CoT-Lab: Human-AI Co-Thinking Laboratory

upvoted a collection 3 days ago

Tulu 3 Models

Collection

All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 5 days ago • 83

upvoted a paper 4 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 5 days ago • 45

liked a model 4 days ago

netease-youdao/Confucius-o1-14B

Text Generation • Updated 12 days ago • 182 • 31

liked a Space 4 days ago

Running

363

🐢

Qwen2.5 Max Demo

liked a model 4 days ago

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

Text Generation • Updated 3 days ago • 142k • 438

liked 2 datasets 5 days ago

OnDeviceMedNotes/synthetic-medical-conversations-deepseek-v3

Viewer • Updated 6 days ago • 143k • 181 • 28

promptfoo/CCP-sensitive-prompts

Viewer • Updated 6 days ago • 1.36k • 165 • 25

liked a Space 6 days ago

Running on Zero

193

🏃

JanusFlow 1.3B

Huggingface space for JanusFlow-1.3B

upvoted a paper 6 days ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published 10 days ago • 41

upvoted an article 6 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

20 days ago

• 132

liked a Space 7 days ago

Running on Zero

1.29k

🌍

Chat With Janus-Pro-7B

A unified multimodal understanding and generation model.

liked 2 models 7 days ago

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • Updated 7 days ago • 16.7k • 201

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated 7 days ago • 127k • 288