Kalle Hilsenbek

Bachstelze

AI & ML interests

Combining BERT with instructions for explainable AI: gitlab.com/Bachstelze/instructionbert

Recent Activity

Organizations

None yet

Bachstelze's activity

commented on Announcing AI Energy Score Ratings about 2 months ago
view reply

Thanks for your effort in energy efficiency. You worked up my curiosity!
Why do smolLM-135m and smolLm-1.7B nearly have the same score besides a 10 times model size difference? Does the identical context size mostly cause it?
Could you please enable encoder-decoder models? They should be in theory more efficient because the input has to be encoded only once and can be reused in every decoding step.

upvoted an article about 2 months ago
view article
Article

Is Attention Interpretable in Transformer-Based Large Language Models? Let’s Unpack the Hype

4
New activity in answerdotai/ModernBERT-base 2 months ago

ModernBART wen?

6
#38 opened 3 months ago by
Fizzarolli
New activity in Nart/monolingual_ab 3 months ago

Goldfish model

#5 opened 3 months ago by
Bachstelze
New activity in HuggingFaceTB/SmolLM2-360M-Instruct 4 months ago
New activity in HuggingFaceTB/SmolLM-135M 5 months ago

Benchmark results

#17 opened 5 months ago by
Bachstelze