1 51 4

Yumin Kim

YuminKim

Yu-billie

AI & ML interests

NLG, Data Augmentation with LLMs

Recent Activity

upvoted a paper 3 days ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

upvoted a paper 3 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

upvoted a paper 3 days ago

GeAR: Generation Augmented Retrieval

View all activity

Organizations

YuminKim's activity

upvoted 9 papers 3 days ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 3 days ago • 64

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 4 days ago • 177

GeAR: Generation Augmented Retrieval

Paper • 2501.02772 • Published 6 days ago • 16

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Paper • 2501.01830 • Published 9 days ago • 14

Personalized Graph-Based Retrieval for Large Language Models

Paper • 2501.02157 • Published 8 days ago • 24

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published 5 days ago • 33

liked a dataset 13 days ago

toxigen/toxigen-data

Viewer • Updated Jun 17, 2024 • 319k • 2.9k • 49

updated a dataset 17 days ago

YuminKim/KoCoSa

Viewer • Updated 17 days ago • 12.8k • 24

upvoted a paper 4 months ago

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Paper • 2408.15237 • Published Aug 27, 2024 • 38

upvoted 8 papers 5 months ago

Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

Paper • 2408.07852 • Published Aug 14, 2024 • 16

The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community

Paper • 2408.08291 • Published Aug 15, 2024 • 11

InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning

Paper • 2408.07089 • Published Aug 9, 2024 • 14

Adapting General Disentanglement-Based Speaker Anonymization for Enhanced Emotion Preservation

Paper • 2408.05928 • Published Aug 12, 2024 • 6

Design Proteins Using Large Language Models: Enhancements and Comparative Analyses

Paper • 2408.06396 • Published Aug 12, 2024 • 8

FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data

Paper • 2408.06273 • Published Aug 12, 2024 • 10

MovieSum: An Abstractive Summarization Dataset for Movie Screenplays

Paper • 2408.06281 • Published Aug 12, 2024 • 9

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Paper • 2408.06941 • Published Aug 13, 2024 • 31