AI & ML interests

None defined yet.

HuggingFace-CN-community's activity

AdinaYΒ 
posted an update 1 day ago
AdinaYΒ 
posted an update 3 days ago
AdinaYΒ 
posted an update 3 days ago
AdinaYΒ 
posted an update 5 days ago
view post
Post
2549
What happened yesterday in the Chinese AI community? πŸš€

T2A-01-HD πŸ‘‰ https://hailuo.ai/audio
MiniMax's Text-to-Audio model, now in Hailuo AI, offers 300+ voices in 17+ languages and instant emotional voice cloning.

Tare πŸ‘‰ https://www.trae.ai/
A new coding tool by Bytedance for professional developers, supporting English & Chinese with free access to Claude 3.5 and GPT-4 for a limited time.

DeepSeek-R1 Series πŸ‘‰ deepseek-ai/deepseek-r1-678e1e131c0169c0bc89728d
Open-source reasoning models with MIT license by DeepSeek.

Kimi K 1.5 πŸ‘‰ https://github.com/MoonshotAI/Kimi-k1.5 | https://kimi.ai/
An O1-level multi-modal model by MoonShot AI, utilizing reinforcement learning with long and short-chain-of-thought and supporting up to 128k tokens.

And today…

Hunyuan 3D-2.0 πŸ‘‰ tencent/Hunyuan3D-2
A SoTA 3D synthesis system for high-res textured assets by Tencent Hunyuan , with open weights and code!

Stay tuned for more updates πŸ‘‰ https://huggingface.co./zh-ai-community
AdinaYΒ 
posted an update 5 days ago
view post
Post
722
Hunyuan 3D 2.0πŸ”₯ a synthesis system for high-res textured 3D assets released by Tencent Hunyuan

2 key components: Hunyuan3D-DiT (geometry) and Hunyuan3D-Paint (textures) work together, achieving highly realistic 3D results.

Model: tencent/Hunyuan3D-2
Demo coming soon!
AdinaYΒ 
posted an update 6 days ago
view post
Post
2744
BIG release by DeepSeek AIπŸ”₯πŸ”₯πŸ”₯

DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community!
https://huggingface.co./deepseek-ai
deepseek-ai/DeepSeek-R1

✨ MIT License : enabling distillation for custom models
✨ 32B & 70B models match OpenAI o1-mini in multiple capabilities
✨ API live now! Access Chain of Thought reasoning with model='deepseek-reasoner'
AdinaYΒ 
posted an update 9 days ago
AdinaYΒ 
posted an update 11 days ago
AdinaYΒ 
posted an update 11 days ago
view post
Post
3076
MiniMax, the company behind Hailuo_AI, has joined the open source community by releasing both models and demos of MiniMax-Text-01 & MiniMax-VL-01πŸ”₯
- Model
MiniMaxAI/MiniMax-VL-01
MiniMaxAI/MiniMax-Text-01
- Demo
MiniMaxAI/MiniMax-VL-01
MiniMaxAI/MiniMax-Text-01

✨ MiniMax-text-01:
- 456B with 45.9B activated per token
- Combines Lightning Attention, Softmax Attention, and MoE for optimal performance
- Training context up to 1M tokens, inference handles 4M tokens

✨ MiniMax-VL-01:
- ViT-MLP-LLM framework ( non-transformerπŸ‘€)
- Handles image inputs from 336Γ—336 to 2016Γ—2016
- 694M image-caption pairs + 512B tokens processed across 4 stages
  • 1 reply
Β·
AdinaYΒ 
posted an update 12 days ago
view post
Post
3164
MiniCPM-o2.6 πŸ”₯ an end-side multimodal LLMs released by OpenBMB from the Chinese community
Model: openbmb/MiniCPM-o-2_6
✨ Real-time English/Chinese conversation, emotion control and ASR/STT
✨ Real-time video/audio understanding
✨ Processes up to 1.8M pixels, leads OCRBench & supports 30+ languages
AdinaYΒ 
posted an update 16 days ago
AdinaYΒ 
posted an update 20 days ago
AdinaYΒ 
posted an update about 1 month ago
view post
Post
3604
The Chinese community is shipping 🚒

DeepSeek V3 (685 B MoE) has quietly released on the hub!
Base: deepseek-ai/DeepSeek-V3-Base
Instruct: deepseek-ai/DeepSeek-V3

Can’t wait to see what’s next!
  • 1 reply
Β·
AdinaYΒ 
posted an update about 1 month ago
view post
Post
3021
QvQ-72B-PreviewπŸŽ„ an open weight model for visual reasoning just released by Alibaba_Qwen team
Qwen/qvq-676448c820912236342b9888
✨ Combines visual understanding & language reasoning.
✨ Scores 70.3 on MMMU
✨ Outperforms Qwen2-VL-72B-Instruct in complex problem-solving
AdinaYΒ 
posted an update about 1 month ago
view post
Post
552
Megrez-3B-Omni πŸ”₯ an on-device multimodal LLM by Infinigence AI, another startup emerging from the Tsinghua University ecosystem.
Model: Infinigence/Megrez-3B-Omni
Demo: Infinigence/Megrez-3B-Omni
✨Supports analysis of image, text, and audio modalities
✨Leads in bilingual speech ( English & Chinese ) input, multi-turn conversations, and voice-based queries
✨Outperforms in scene understanding and OCR across major benchmarks
AdinaYΒ 
posted an update about 2 months ago
view post
Post
891
Updates from the Chinese community last week πŸ”₯

LLM:
✨ Sailor 2 , multilingual model supporting 10+ South Asian languages by Sea AI Lab. https://huggingface.co./sailor2

MLLM:
✨InternVL 2.5 , new open multimodal LLM by OpenGVLab
https://huggingface.co./collections/OpenGVLab/internvl-25-673e1019b66e2218f68d7c1c
✨Qwen2-VL 2B/7B/72B base model, the latest iteration of our Qwen-VL model by Alibaba Qwen
Qwen/qwen2-vl-66cee7455501d7126940800d

Video model:
✨HunyuanVideo , 13B open video model by Tencent
tencent/HunyuanVideo

Reasoning model:
✨ LLaMA-O1 πŸ¦™ base & supervised model; pretrain & finetune datasets and demo all released
zh-ai-community/reasoning-models-67409fb3aa1ed78f10087cd7

Audio model:
✨Fish Speech 1.5, Text-to-speech in 13 languages, trained on 1M+ hours of audio by FishAudio
fishaudio/fish-speech-1.5
✨ClearVoice, An advanced voice processing framework by Alibaba Tongyi SpeechAI https://huggingface.co./alibabasglab

More details πŸ‘‰ https://huggingface.co./zh-ai-community
AdinaYΒ 
posted an update about 2 months ago
view post
Post
1588
Sailor 2 🚒 open multilingual model for Southeast Asia by Sea AI LabπŸ”₯
https://huggingface.co./sailor2
sail/Sailor2-20B-Chat

✨ Fully open code & ALL datasets πŸ™Œ
✨ 1B/ 8B/20B base & chat expanded on Qwen2.5
✨ Apache 2.0
✨ Supports 15 languages including English, Chinese, Burmese, Cebuano, Ilocano, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tagalog, Thai, Vietnamese, and WarayπŸ‡¬πŸ‡§πŸ‡¨πŸ‡³πŸ‡±πŸ‡¦πŸ‡²πŸ‡ΎπŸ‡²πŸ‡²πŸ‡»πŸ‡³πŸ‡ΉπŸ‡­
AdinaYΒ 
posted an update about 2 months ago
view post
Post
1487
2023 & 2024 Top Downloaded (all time) Open Models on the hub are both from the Chinese community πŸ‘€

2023 πŸ‘‰ Bge base by BAAI
BAAI/bge-base-en-v1.5
2024 πŸ‘‰ Qwen 2.5 by Alibaba Qwen
Qwen/Qwen2.5-1.5B-Instruct

Can’t wait to see what incredible models the Chinese community will bring in 2025πŸš€

✨ Follow https://huggingface.co./zh-ai-community to get the latest updates from the Chinese community
✨ Explore the 2024 Year in Review huggingface/open-source-ai-year-in-review-2024
AdinaYΒ 
posted an update about 2 months ago
view post
Post
1357
HunyuanVideo πŸ“Ή The new open video generation model by Tencent!
πŸ‘‰ tencent/HunyuanVideo
zh-ai-community/video-models-666afd86cfa4e4dd1473b64c
✨ 13B parameters: Probably the largest open video model to date
✨ Unified architecture for image & video generation
✨ Powered by advanced features: MLLM Text Encoder, 3D VAE, and Prompt Rewrite
✨ Delivers stunning visuals, diverse motion, and unparalleled stability
πŸ”“ Fully open with code & weights
AdinaYΒ 
posted an update about 2 months ago
view post
Post
1127
Zhipu AI, the Chinese generative AI startup behind CogVideo, just launched their first productized AI Agent - AutoGLM πŸ”₯
πŸ‘‰ https://agent.aminer.cn

With simple text or voice commands, it:
✨ Simulates phone operations effortlessly
✨ Autonomously handles 50+ step tasks
✨ Seamlessly operates across apps

Powered by Zhipu's "Decoupled Interface" and "Self-Evolving Learning Framework" to achieve major performance gains in Phone Use and Web Browser Use!

Meanwhile, GLM4-Edge is now on Hugging Face hubπŸš€
πŸ‘‰ THUDM/glm-edge-6743283c5809de4a7b9e0b8b
Packed with advanced dialogue + multimodal models:
πŸ“± 1.5B / 2B models: Built for mobile & in-car systems
πŸ’» 4B / 5B models: Optimized for PCs