Xi's picture

Xi

xi0v

·

AI & ML interests

Diffusion Model Merging, LLM Merging, Model Editing and Vision/Multimodal Model Fine-tuning.

Recent Activity

upvoted a paper about 6 hours ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

liked a model about 13 hours ago

nejumi/phi-4-GPTQ-Int8-calib-ja-1k

upvoted a paper about 19 hours ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

View all activity

Organizations

xi0v's activity

upvoted a paper about 6 hours ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 2 days ago • 44

upvoted a paper about 19 hours ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 3 days ago • 176

upvoted a paper 2 days ago

DeMo: Decoupled Momentum Optimization

Paper • 2411.19870 • Published Nov 29, 2024 • 5

upvoted a paper 15 days ago

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Paper • 2412.17153 • Published 20 days ago • 34

upvoted an article 18 days ago

Article

Deriving DPO's Loss

By

•

19 days ago

• 26

upvoted a paper 23 days ago

Autoregressive Video Generation without Vector Quantization

Paper • 2412.14169 • Published 24 days ago • 14

upvoted a collection 23 days ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 23 days ago • 122

upvoted a paper about 1 month ago

ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer

Paper • 2412.07720 • Published Dec 10, 2024 • 30

upvoted a collection about 1 month ago

EXAONE-3.5

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. • 10 items • Updated Dec 10, 2024 • 87

upvoted a paper about 1 month ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 46

upvoted 2 collections about 1 month ago

Toxic Commons

Tools for de-toxifying public domain data, especially multilingual and historical text data and data with OCR errors. • 3 items • Updated Oct 31, 2024 • 5

Common Models

The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 28

upvoted 2 articles about 1 month ago

Article

They Said It Couldn’t Be Done

By

•

Dec 5, 2024

• 76

Article

EuroLLM-9B

By

•

Dec 2, 2024

• 105

upvoted a paper about 1 month ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published Nov 26, 2024 • 11

upvoted a paper about 2 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 58

upvoted a collection about 2 months ago

ESFT

models for paper expert-specialized fine-tuning • 15 items • Updated Aug 16, 2024 • 5

upvoted a paper about 2 months ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 28

upvoted a collection about 2 months ago

HyenaDNA Models

HyenaDNA models usable directly with Hugging Face classes like AutoModel. • 8 items • Updated Nov 14, 2023 • 16

upvoted a paper about 2 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 113