Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
merveย 
posted an update 2 days ago
Post
2762
This week in open AI was ๐Ÿ”ฅ Let's recap! ๐Ÿค— merve/january-31-releases-679a10669bd4030090c5de4d
LLMs ๐Ÿ’ฌ
> Huge: AllenAI released new Tรผlu models that outperform DeepSeek R1 using Reinforcement Learning with Verifiable Reward (RLVR) based on Llama 3.1 405B ๐Ÿ”ฅ
> Mistral AI is back to open-source with their "small" 24B models (base & SFT), with Apache 2.0 license ๐Ÿ˜ฑ
> Alibaba Qwen released their 1M context length models Qwen2.5-Instruct-1M, great for agentic use with Apache 2.0 license ๐Ÿ”ฅ
> Arcee AI released Virtuoso-medium, 32.8B LLMs distilled from DeepSeek V3 with dataset of 5B+ tokens
> Velvet-14B is a new family of 14B Italian LLMs trained on 10T tokens in six languages
> OpenThinker-7B is fine-tuned version of Qwen2.5-7B-Instruct on OpenThoughts dataset

VLMs & vision ๐Ÿ‘€
> Alibaba Qwen is back with Qwen2.5VL, amazing new capabilities ranging from agentic computer use to zero-shot localization ๐Ÿ”ฅ
> NVIDIA released new series of Eagle2 models with 1B and 9B sizes
> DeepSeek released Janus-Pro, new any-to-any model (image-text generation from image-text input) with MIT license
> BEN2 is a new background removal model with MIT license!

Audio ๐Ÿ—ฃ๏ธ
> YuE is a new open-source music generation foundation model, lyrics-to-song generation

Codebase ๐Ÿ‘ฉ๐Ÿปโ€๐Ÿ’ป
> We are open-sourcing our SmolVLM training and eval codebase! https://github.com/huggingface/smollm/tree/main/vision
> Open-R1 is open-source reproduction of R1 by @huggingface science team https://huggingface.co./blog/open-r1

This week in "Open"-AI was ๐Ÿ”ฅ Let's recap! ๐Ÿค—
merve/january-31-releases-679a10669bd4030090c5de4d
LLMs ๐Ÿ’ฌ
Huge: AllenAI released new Tรผlu models that outperform DeepSeek R1 using Reinforcement Learning with Verifiable Reward (RLVR) based on Llama 3.1 405B

Tรผlu models are based on proprietary Llama license, thus do not fit into the same category as DeepSeek which is truly free software, and which suddenly helped so many other companies.

You can't be comparing apples with oranges. Yes, you can, but it is obvious huge difference and person comparing it doesn't get right kudos he/she wanted to get.

AllenAI pretending to be "Open" sadly, joining companies like META to deceit and betray and enter into the community, it is type of deceitful propaganda where words such as "Open" are not protected, but rather tend to attract good portion of oblivious community.

Proprietary software category cannot be compared to free software category.

DeepSeek has reached the popularity for reason it is free software.

Infecting the AI space with proprietary software like Tรผlu by AllenAI to me looks like US propaganda against China.

I liked AllenAI truthfully, but now I see how much tricky they are, I feel deeply hurt by that betrayal.

References:

Metaโ€™s LLaMa 2 license is not Open Source โ€“ Open Source Initiative:
https://opensource.org/blog/metas-llama-2-license-is-not-open-source

The Open Source Definition โ€“ Open Source Initiative:
https://opensource.org/osd

What is Free Software? - GNU Project - Free Software Foundation:
https://www.gnu.org/philosophy/free-sw.html

Word "Open" as in "Open Source" - Words to Avoid (or Use with Care) Because They Are Loaded or Confusing:
https://www.gnu.org/philosophy/words-to-avoid.html#Open

In this post