2 59 225

Blanc Swan

blancsw

https://swan-blanc.fr/

AI & ML interests

ChatBot

Recent Activity

liked a model about 10 hours ago

google/siglip2-base-patch16-224

upvoted a paper 10 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

liked a model 15 days ago

Unbabel/TowerInstruct-Mistral-7B-v0.2

View all activity

Organizations

blancsw's activity

liked a model about 10 hours ago

google/siglip2-base-patch16-224

Zero-Shot Image Classification • Updated 8 days ago • 8.84k • 25

upvoted a paper 10 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 16 days ago • 142

liked a model 15 days ago

Unbabel/TowerInstruct-Mistral-7B-v0.2

Translation • Updated Sep 4, 2024 • 1.45k • 16

liked a dataset 15 days ago

Unbabel/TowerBlocks-v0.1

Viewer • Updated Mar 4, 2024 • 637k • 128 • 28

liked a model 15 days ago

LLaMAX/LLaMAX3-8B

Text Generation • Updated Dec 6, 2024 • 162 • 35

liked a Space 16 days ago

592

Open Deep-Research

🏆

OpenAI's Deep Research, but open

upvoted an article 16 days ago

Article

Open-source DeepResearch – Freeing our search agents

25 days ago

• 1.11k

upvoted 2 papers 16 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 18 days ago • 140

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published 17 days ago • 45

updated a model 17 days ago

Infomaniak-AI/smolLM2-135M-Instruct-movie-reco

Updated 17 days ago

published a model 18 days ago

Infomaniak-AI/smolLM2-135M-Instruct-movie-reco

Updated 17 days ago

updated a model 18 days ago

Infomaniak-AI/smolLM2-135M-Instruct-structure-output

Text Generation • Updated 18 days ago • 44

published 2 models 18 days ago

blancsw/SmolLM2-135M-Instruct-structure-output

Updated 18 days ago

Infomaniak-AI/smolLM2-135M-Instruct-structure-output

Text Generation • Updated 18 days ago • 44

upvoted an article 19 days ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 323

liked a dataset 19 days ago

ChristianAzinn/json-training

Viewer • Updated Aug 23, 2024 • 20.6k • 412 • 16

liked a model 20 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 5 days ago • 1.26M • • 1.2k

liked a dataset 21 days ago

IJUN/FakeNews

Viewer • Updated Jan 13 • 362 • 77 • 2

upvoted an article 25 days ago

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 205

liked a model 27 days ago