flair
/

bueble-lm-2b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pdelobelle commited on Dec 4, 2024

Commit

be21d7c

·

verified ·

1 Parent(s): dfb2efc

Update README.md

Files changed (1) hide show

README.md +1 -3

README.md CHANGED Viewed

@@ -47,14 +47,12 @@ Data sampling weights:
 ## Performance
-[INSERT FIGURE: Performance comparison across models]
 Key improvements over Gemma-2B baseline:
 - HellaSwag-DE: +71% (47.9% vs 28.0%)
 - ARC-DE: +41% (32.3% vs 22.9%)
 - Average zero-shot: +40% (35.8% vs 25.5%)
-Consistently outperforms both the base Gemma-2B and other German models like LLaMmlein-1B across most tasks.
 <table class="model-comparison">
   <thead>

 ## Performance
 Key improvements over Gemma-2B baseline:
 - HellaSwag-DE: +71% (47.9% vs 28.0%)
 - ARC-DE: +41% (32.3% vs 22.9%)
 - Average zero-shot: +40% (35.8% vs 25.5%)
+→ BübleLM-2B onsistently outperforms both the base Gemma-2B and other German models like LLaMmlein-1B across most tasks.
 <table class="model-comparison">
   <thead>