whywhy's picture

1

whywhy

whynlp

AI & ML interests

None yet

Organizations

None yet

whynlp's activity

upvoted a paper 4 months ago

Layer-Condensed KV Cache for Efficient Inference of Large Language Models

Paper • 2405.10637 • Published May 17 • 19