9 15 3

HUANG SHAOHAN

buaahsh

AI & ML interests

None yet

Recent Activity

authored a paper 18 days ago

GeAR: Generation Augmented Retrieval

upvoted a paper about 2 months ago

Multimodal Latent Language Modeling with Next-Token Diffusion

commented on a paper about 2 months ago

MH-MoE:Multi-Head Mixture-of-Experts

View all activity

Organizations

buaahsh's activity

authored a paper 18 days ago

GeAR: Generation Augmented Retrieval

Paper • 2501.02772 • Published 21 days ago • 22

upvoted a paper about 2 months ago

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published Dec 11, 2024 • 44

commented a paper about 2 months ago

MH-MoE:Multi-Head Mixture-of-Experts

Paper • 2411.16205 • Published Nov 25, 2024 • 24 •

authored a paper about 2 months ago

On Domain-Specific Post-Training for Multimodal Large Language Models

Paper • 2411.19930 • Published Nov 29, 2024 • 25

upvoted a paper about 2 months ago

On Domain-Specific Post-Training for Multimodal Large Language Models

Paper • 2411.19930 • Published Nov 29, 2024 • 25

liked a model about 2 months ago

AdaptLLM/Adapt-MLLM-to-Domains

Updated Dec 14, 2024 • 10

authored a paper 2 months ago

MH-MoE:Multi-Head Mixture-of-Experts

Paper • 2411.16205 • Published Nov 25, 2024 • 24

upvoted a paper 2 months ago

MH-MoE:Multi-Head Mixture-of-Experts

Paper • 2411.16205 • Published Nov 25, 2024 • 24

commented a paper 2 months ago

MH-MoE:Multi-Head Mixture-of-Experts

Paper • 2411.16205 • Published Nov 25, 2024 • 24 •

upvoted 2 papers 6 months ago

Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23, 2024 • 60

E5-V: Universal Embeddings with Multimodal Large Language Models

Paper • 2407.12580 • Published Jul 17, 2024 • 40

authored a paper 7 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 87

upvoted a paper 7 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 87

upvoted a paper 8 months ago

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Paper • 2405.12130 • Published May 20, 2024 • 47

liked a Space 8 months ago

Running

558

🍷

FineWeb: decanting the web for the finest text data at scale

authored a paper 8 months ago

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Paper • 2405.12130 • Published May 20, 2024 • 47

authored a paper 9 months ago

Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23, 2024 • 60

upvoted 2 papers 11 months ago

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 171

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 608

authored a paper 11 months ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 608