Ahmet's picture

Ahmet

atasoglu

·

atasoglu

AI & ML interests

NLP, LLMs.

Recent Activity

liked a dataset 1 day ago

selimc/bilmecebench

upvoted a collection 6 days ago

Feb 14 Releases 💌

reacted to merve's post with ❤️ 6 days ago

Your weekly recap of open AI is here, and it's packed with models! https://huggingface.co./collections/merve/feb-14-releases-67af876b404cc27c6d837767 👀 Multimodal > OpenGVLab released InternVideo 2.5 Chat models, new video LMs with long context > AIDC released Ovis2 model family along with Ovis dataset, new vision LMs in different sizes (1B, 2B, 4B, 8B, 16B, 34B), with video and OCR support > ColQwenStella-2b is a multilingual visual retrieval model that is sota in it's size > Hoags-2B-Exp is a new multilingual vision LM with contextual reasoning, long context video understanding 💬 LLMs A lot of math models! > Open-R1 team released OpenR1-Math-220k large scale math reasoning dataset, along with Qwen2.5-220K-Math fine-tuned on the dataset, OpenR1-Qwen-7B > Nomic AI released new Nomic Embed multilingual retrieval model, a MoE with 500 params with 305M active params, outperforming other models > DeepScaleR-1.5B-Preview is a new DeepSeek-R1-Distill fine-tune using distributed RL on math > LIMO is a new fine-tune of Qwen2.5-32B-Instruct on Math 🗣️ Audio > Zonos-v0.1 is a new family of speech recognition models, which contains the model itself and embeddings 🖼️ Vision and Image Generation > We have ported DepthPro of Apple to transformers for your convenience! > illustrious-xl-v1.0 is a new illustration generation model

View all activity

Organizations

atasoglu's activity

upvoted a collection 6 days ago

Feb 14 Releases 💌

23 items • Updated 6 days ago • 7

upvoted a collection 13 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 12 items • Updated about 11 hours ago • 74

upvoted an article 15 days ago

Article

Open-source DeepResearch – Freeing our search agents

17 days ago

• 1.06k

upvoted a collection 23 days ago

DeepSeek-R1

8 items • Updated about 1 month ago • 519

upvoted a collection 25 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 25 days ago • 356

upvoted a paper 27 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 29 days ago • 325

upvoted a collection 27 days ago

Jan 24 Releases

39 items • Updated 27 days ago • 7

upvoted 3 articles 28 days ago

Article

Visual Document Retrieval Goes Multilingual

Jan 10

• 68

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 147

Article

Mastering Long Contexts in LLMs with KVPress

By

and 1 other •

28 days ago

• 62

upvoted a collection 28 days ago

SmolVLM 256M & 500M

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated about 6 hours ago • 69

upvoted an article 28 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

29 days ago

• 142

upvoted a paper about 1 month ago

Enhancing Human-Like Responses in Large Language Models

Paper • 2501.05032 • Published Jan 9 • 49

upvoted 5 collections about 1 month ago

Human-Like LLMs

Human-Like LLMs series. • 5 items • Updated Jan 20 • 12

Cosmos Tokenizer

A suite of image and video tokenizers • 13 items • Updated Jan 17 • 39

Cosmos

The collection of Cosmos models • 31 items • Updated Jan 17 • 261

Jan 10 Releases 🌨️

38 items • Updated Jan 10 • 12

NeMo Audio Codecs

A series of Neural Audio Codecs • 5 items • Updated Jan 17 • 11

upvoted a collection about 2 months ago

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated Jan 1 • 43

upvoted a paper about 2 months ago

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Paper • 2412.10302 • Published Dec 13, 2024 • 17