6 1 8

Artidoro Pagnoni

artidoro

https://artidoro.github.io/

AI & ML interests

NLP, generation, factuality, disinformation.

Recent Activity

commented on a paper about 1 month ago

Byte Latent Transformer: Patches Scale Better Than Tokens

authored a paper about 1 month ago

Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization

authored a paper about 1 month ago

Byte Latent Transformer: Patches Scale Better Than Tokens

View all activity

Articles

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 110

Organizations

artidoro's activity

commented a paper about 1 month ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 89 •

authored 2 papers about 1 month ago

Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization

Paper • 2212.10449 • Published Dec 20, 2022

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 89

upvoted a paper about 1 month ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 89

updated 3 models about 1 year ago

liked a model over 1 year ago

meta-llama/Llama-2-7b

Text Generation • Updated Apr 17, 2024 • 4.23k

New activity in open-llm-leaderboard/open_llm_leaderboard over 1 year ago

Error trying to submit LLaMA 2 base model

#131 opened over 1 year ago by

artidoro

updated a model over 1 year ago

uwnlp/llama-2-70b-qlora-openorca

Updated Jul 26, 2023 • 9

liked 2 models over 1 year ago

TheBloke/llama-2-70b-Guanaco-QLoRA-fp16

Text Classification • Updated Aug 8, 2023 • 832 • 56

timdettmers/guanaco-33b

Updated Jun 13, 2023 • 27

New activity in uwnlp/guanaco-playground-tgi over 1 year ago

did it stop working?

#6 opened over 1 year ago by

maschenk

How to fine-tune the Guanaco (7B, 13B) model?

#5 opened over 1 year ago by

mvermand

liked a dataset over 1 year ago

timdettmers/openassistant-guanaco

Viewer • Updated May 27, 2023 • 10.4k • 6.27k • 422

New activity in uwnlp/guanaco-playground-tgi over 1 year ago

Guanaco-13B

#3 opened over 1 year ago by

dmeight

liked a Space over 1 year ago

Running

507

📊

Guanaco Playground Tgi

liked a model over 1 year ago

timdettmers/guanaco-65b

Updated Jul 13, 2023 • 86

authored a paper over 1 year ago

QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 48

liked a model over 2 years ago

Salesforce/mixqg-large

Text2Text Generation • Updated 12 days ago • 91 • 6