Charleno Pires's picture
5 55

Charleno Pires

charleno

AI & ML interests

None yet

Recent Activity

liked a model 8 days ago
perplexity-ai/r1-1776
liked a model 8 days ago
microsoft/Phi-4-multimodal-instruct
liked a model 8 days ago
Wan-AI/Wan2.1-T2V-14B
View all activity

Organizations

None yet

charleno's activity

reacted to AdinaY's post with πŸš€ 25 days ago
view post
Post
3556
InspireMusic 🎡πŸ”₯ an open music generation framework by Alibaba FunAudio Lab
Model: FunAudioLLM/InspireMusic-1.5B-Long
Demo: FunAudioLLM/InspireMusic
✨ Music, songs, audio - ALL IN ONE
✨ High quality audio: 24kHz & 48kHz sampling rates
✨ Long-Form Generation: enables extended audio creation
✨ Efficient Fine-Tuning: precision (BF16, FP16, FP32) with user-friendly scripts
  • 1 reply
Β·
reacted to AdinaY's post with πŸ‘ 28 days ago
view post
Post
1171
Zhipu AI, the Chinese generative AI startup behind CogVideo, just launched their first productized AI Agent - AutoGLM πŸ”₯
πŸ‘‰ https://agent.aminer.cn

With simple text or voice commands, it:
✨ Simulates phone operations effortlessly
✨ Autonomously handles 50+ step tasks
✨ Seamlessly operates across apps

Powered by Zhipu's "Decoupled Interface" and "Self-Evolving Learning Framework" to achieve major performance gains in Phone Use and Web Browser Use!

Meanwhile, GLM4-Edge is now on Hugging Face hubπŸš€
πŸ‘‰ THUDM/glm-edge-6743283c5809de4a7b9e0b8b
Packed with advanced dialogue + multimodal models:
πŸ“± 1.5B / 2B models: Built for mobile & in-car systems
πŸ’» 4B / 5B models: Optimized for PCs
reacted to AdinaY's post with ❀️ about 1 month ago
view post
Post
3242
What happened yesterday in the Chinese AI community? πŸš€

T2A-01-HD πŸ‘‰ https://hailuo.ai/audio
MiniMax's Text-to-Audio model, now in Hailuo AI, offers 300+ voices in 17+ languages and instant emotional voice cloning.

Tare πŸ‘‰ https://www.trae.ai/
A new coding tool by Bytedance for professional developers, supporting English & Chinese with free access to Claude 3.5 and GPT-4 for a limited time.

DeepSeek-R1 Series πŸ‘‰ deepseek-ai/deepseek-r1-678e1e131c0169c0bc89728d
Open-source reasoning models with MIT license by DeepSeek.

Kimi K 1.5 πŸ‘‰ https://github.com/MoonshotAI/Kimi-k1.5 | https://kimi.ai/
An O1-level multi-modal model by MoonShot AI, utilizing reinforcement learning with long and short-chain-of-thought and supporting up to 128k tokens.

And today…

Hunyuan 3D-2.0 πŸ‘‰ tencent/Hunyuan3D-2
A SoTA 3D synthesis system for high-res textured assets by Tencent Hunyuan , with open weights and code!

Stay tuned for more updates πŸ‘‰ https://huggingface.co./zh-ai-community