Celina's picture

Celina

celinah

AI & ML interests

inference, on-device and image generation

Recent Activity

Organizations

Hugging Face's profile picture Hugging Face OSS Metrics's profile picture Blog-explorers's profile picture Hugging Face for Computer Vision's profile picture MLX Community's profile picture Social Post Explorers's profile picture open/ acc's profile picture DDUF's profile picture

celinah's activity

upvoted an article 7 days ago
upvoted an article 12 days ago
view article
Article

Welcome to Inference Providers on the Hub šŸ”„

ā€¢ 293
updated a model 17 days ago
published a model 17 days ago
New activity in Salesforce/SFR-Embedding-Code-2B_R 20 days ago

troubleshooting a bug

#6 opened 20 days ago by
celinah
upvoted an article 23 days ago
view article
Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

By MiniMax-AI ā€¢
ā€¢ 40
reacted to AdinaY's post with šŸ”„ 26 days ago
view post
Post
3186
MiniCPM-o2.6 šŸ”„ an end-side multimodal LLMs released by OpenBMB from the Chinese community
Model: openbmb/MiniCPM-o-2_6
āœØ Real-time English/Chinese conversation, emotion control and ASR/STT
āœØ Real-time video/audio understanding
āœØ Processes up to 1.8M pixels, leads OCRBench & supports 30+ languages
upvoted an article 27 days ago
view article
Article

Mastering Tensor Dimensions in Transformers

By not-lain ā€¢
ā€¢ 42
reacted to merve's post with ā¤ļø 30 days ago
view post
Post
3632
What a beginning to this year in open ML šŸ¤ 
Let's unwrap! merve/jan-10-releases-677fe34177759de0edfc9714

Multimodal šŸ–¼ļø
> ByteDance released SA2VA: a family of vision LMs that can take image, video, text and visual prompts
> moondream2 is out with new capabilities like outputting structured data and gaze detection!
> Dataset: Alibaba DAMO lab released multimodal textbook ā€” 22k hours worth of samples from instruction videos šŸ¤Æ
> Dataset: SciCap captioning on scientific documents benchmark dataset is released along with the challenge!

LLMs šŸ’¬
> Microsoft released Phi-4, sota open-source 14B language model šŸ”„
> Dolphin is back with Dolphin 3.0 Llama 3.1 8B šŸ¬šŸ¬
> Prime-RL released Eurus-2-7B-PRIME a new language model trained using PRIME alignment
> SmallThinker-3B is a new small reasoning LM based on Owen2.5-3B-Instruct šŸ’­
> Dataset: QWQ-LONGCOT-500K is the dataset used to train SmallThinker, generated using QwQ-32B-preview šŸ“•
> Dataset: @cfahlgren1 released React Code Instructions: a dataset of code instruction-code pairs šŸ“•
> Dataset: Qwen team is on the roll, they just released CodeElo, a dataset of code preferences šŸ‘©šŸ»ā€šŸ’»

Embeddings šŸ”–
> @MoritzLaurer released zero-shot version of ModernBERT large šŸ‘
> KaLM is a new family of performant multilingual embedding models with MIT license built using Qwen2-0.5B

Image/Video Generation āÆļø
> NVIDIA released Cosmos, a new family of diffusion/autoregressive World Foundation Models generating worlds from images, videos and texts šŸ”„
> Adobe released TransPixar: a new text-to-video model that can generate assets with transparent backgrounds (a first!)
> Dataset: fal released cosmos-openvid-1m Cosmos-tokenized OpenVid-1M with samples from OpenVid-1M

Others
> Prior Labs released TabPFNv2, the best tabular transformer is out for classification and regression
> Metagene-1 is a new RNA language model that can be used for pathogen detection, zero-shot embedding and genome understanding
New activity in celinah/openai_records_d8a1e2c4 30 days ago

Add first file

#1 opened 30 days ago by
celinah