12 17 1

Yi Cui

onekq

https://onekq.ai

AI & ML interests

Benchmark, Code Generation Model

Recent Activity

posted an update about 12 hours ago

From my own experience these are the pain points for reasoning model adoption. (1) expensive and even worse, slow, due to excessive token output. You need to 10x your max output length to avoid clipping the thinking process. (2) you have to filter thinking tokens to retrieve the final output. For mature workflows, this means broad or deep refactoring. 1p vendors (open-source and proprietary) ease these pain points by manipulating their own models. But the problems are exposed when the reasoning model is hosted by 3p MaaS providers.

updated a collection about 13 hours ago

R1 Reproduction Works

updated a collection about 13 hours ago

R1 Reproduction Works

View all activity

Organizations

Posts 22

Post

271

From my own experience these are the pain points for reasoning model adoption.

(1) expensive and even worse, slow, due to excessive token output. You need to 10x your max output length to avoid clipping the thinking process.

(2) you have to filter thinking tokens to retrieve the final output. For mature workflows, this means broad or deep refactoring.

1p vendors (open-source and proprietary) ease these pain points by manipulating their own models. But the problems are exposed when the reasoning model is hosted by 3p MaaS providers.

View all Posts

Articles 2

Article

Does Daily Software Engineering Work Need Reasoning Models?

View all Articles

Papers 3

arxiv:2409.13773

arxiv:2409.05177

arxiv:2408.00019

models

None public yet

datasets

None public yet