Alistarh's picture

7 1

Alistarh

d-alistarh

·

AI & ML interests

NLP

Recent Activity

authored a paper about 2 months ago

Model compression via distillation and quantization

authored a paper about 2 months ago

Sparse Finetuning for Inference Acceleration of Large Language Models

authored a paper about 2 months ago

Towards End-to-end 4-Bit Inference on Generative Large Language Models

View all activity

Organizations

Papers 25

arxiv:2407.10994

arxiv:2405.15593

arxiv:2405.14852

arxiv:2405.03594

models

None public yet

datasets

None public yet