Will Held's picture

14 5 1

Will Held PRO

WillHeld

·

https://williamheld.com

AI & ML interests

Machine Learning and Natural Language Processing for low-resource languages and language variants

Recent Activity

upvoted an article 12 days ago

Optimizing Pretraining Data Mixes with LLM-Estimated Utility

updated a model about 2 months ago

WillHeld/DiVA-llama-3-v0-8b

updated a Space about 2 months ago

WillHeld/diva-audio-chat

View all activity

Articles

Optimizing Pretraining Data Mixes with LLM-Estimated Utility

[Talk Arena](https://talkarena.org)

Organizations

WillHeld's activity

upvoted an article 12 days ago

Article

Optimizing Pretraining Data Mixes with LLM-Estimated Utility

By

•

12 days ago

• 3

updated a model about 2 months ago

WillHeld/DiVA-llama-3-v0-8b

Feature Extraction • Updated Dec 19, 2024 • 603 • 30

updated 2 Spaces about 2 months ago

Running on Zero

Diva Realtime Chat

Running on Zero

Diva Audio

updated a model 3 months ago

WillHeld/debug_llama

Text Generation • Updated Nov 11, 2024 • 137

New activity in allenai/dolma 3 months ago

Llama v.s. OLMo token counts

#43 opened 6 months ago by

updated 4 models 3 months ago

WillHeld/DiVA-llama-3.2-1b

Updated Nov 4, 2024 • 55

WillHeld/DiVA-llama-3.2-3B

Updated Oct 31, 2024 • 15

WillHeld/DiVA-llama-3.2-3B

Updated Oct 31, 2024 • 15

WillHeld/DiVA-llama-3.2-1b

Updated Nov 4, 2024 • 55

updated a Space 3 months ago

Running on Zero

Diva Realtime Chat

updated a model 3 months ago

WillHeld/DiVA-llama-3-v0-8b

Feature Extraction • Updated Dec 19, 2024 • 603 • 30

New activity in zero-gpu-explorers/README 4 months ago

RuntimeError: No CUDA GPUs are available

#126 opened 4 months ago by

`.then` does not work inside of Zero-GPU

#124 opened 4 months ago by

New activity in openai/whisper-large-v3-turbo 4 months ago

Make Model Name Consistent with other Whisper Models

#23 opened 4 months ago by

New activity in zero-gpu-explorers/README 4 months ago

⚡ ZeroGPU: New version rolled out! (sept 2024)

#107 opened 5 months ago by

New activity in librarian-bot/dataset_abstracts 4 months ago

Testing Tagging Feature

#2 opened 4 months ago by

upvoted 2 articles 4 months ago

Article

Welcome, Gradio 5

Oct 9, 2024

• 111

Article

AI Apps in a Flash with Gradio's Reload Mode

Apr 16, 2024

• 25

updated a model 4 months ago

WillHeld/DiVA-llama-3-distill-only-8b

Updated Oct 8, 2024 • 4