praveen M's picture
4 8

praveen M

penma
Β·

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago
hooman650/MedQwen3B-Reasoner
upvoted an article 4 days ago
1 Billion Classifications
liked a Space 4 days ago
nanotron/ultrascale-playbook
View all activity

Organizations

MLX Community's profile picture

penma's activity

upvoted an article 4 days ago
view article
Article

1 Billion Classifications

β€’ 39
upvoted 2 articles 8 months ago
view article
Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

β€’ 188
reacted to qnguyen3's post with ❀️ 11 months ago
view post
Post
5589
πŸŽ‰ Introducing nanoLLaVA, a powerful multimodal AI model that packs the capabilities of a 1B parameter vision language model into just 5GB of VRAM. πŸš€ This makes it an ideal choice for edge devices, bringing cutting-edge visual understanding and generation to your devices like never before. πŸ“±πŸ’»

Model: qnguyen3/nanoLLaVA πŸ”
Spaces: qnguyen3/nanoLLaVA (thanks to @merve )

Under the hood, nanoLLaVA is based on the powerful vilm/Quyen-SE-v0.1 (my Qwen1.5-0.5B finetune) and Google's impressive google/siglip-so400m-patch14-384. 🧠 The model is trained using a data-centric approach to ensure optimal performance. πŸ“Š

In the spirit of transparency and collaboration, all code and model weights are open-sourced under the Apache 2.0 license. 🀝
  • 1 reply
Β·
upvoted an article 11 months ago
view article
Article

Synthetic data: save money, time and carbon with open source

β€’ 61