Kalle Hilsenbek

Bachstelze

AI & ML interests

Combining BERT with instructions for explainable AI: gitlab.com/Bachstelze/instructionbert

Recent Activity

Organizations

None yet

Bachstelze's activity

commented on Announcing AI Energy Score Ratings 17 days ago
view reply

Thanks for your effort in energy efficiency. You worked up my curiosity!
Why do smolLM-135m and smolLm-1.7B nearly have the same score besides a 10 times model size difference? Does the identical context size mostly cause it?
Could you please enable encoder-decoder models? They should be in theory more efficient because the input has to be encoded only once and can be reused in every decoding step.

upvoted an article about 1 month ago
view article
Article

Is Attention Interpretable in Transformer-Based Large Language Models? Let’s Unpack the Hype

4
New activity in answerdotai/ModernBERT-base about 1 month ago

ModernBART wen?

6
#38 opened about 2 months ago by
Fizzarolli
New activity in Nart/monolingual_ab 2 months ago

Goldfish model

#5 opened 2 months ago by
Bachstelze
New activity in HuggingFaceTB/SmolLM2-360M-Instruct 3 months ago
New activity in HuggingFaceTB/SmolLM-135M 4 months ago

Benchmark results

#17 opened 4 months ago by
Bachstelze