JanPf commited on
Commit
25ee648
1 Parent(s): f486b26

typo, urls and congrats on the strong model!

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -52,7 +52,7 @@ Key improvements over Gemma-2-2B baseline:
52
  - ARC-DE: +41% (32.3% vs 22.9%)
53
  - Average zero-shot: +40% (35.8% vs 25.5%)
54
 
55
- → BübleLM-2B onsistently outperforms both the base Gemma-2-2B and other German models like LLaMmlein-1B across most tasks.
56
 
57
  <table class="model-comparison">
58
  <thead>
@@ -75,7 +75,7 @@ Key improvements over Gemma-2-2B baseline:
75
  </thead>
76
  <tbody>
77
  <tr>
78
- <td>Gemma-2-2B</td>
79
  <td align="center">22.9</td>
80
  <td align="center">23.1</td>
81
  <td align="center">28.0</td>
@@ -84,7 +84,7 @@ Key improvements over Gemma-2-2B baseline:
84
  <td align="center">25.5</td>
85
  </tr>
86
  <tr>
87
- <td>LLaMmlein-120M</td>
88
  <td align="center">24.7 ↑+8%</td>
89
  <td align="center">-</td>
90
  <td align="center">32.0 ↑+14%</td>
@@ -93,7 +93,7 @@ Key improvements over Gemma-2-2B baseline:
93
  <td align="center">27.2 ↑+7%</td>
94
  </tr>
95
  <tr>
96
- <td>LLaMmlein-1B</td>
97
  <td align="center">30.0 ↑+31%</td>
98
  <td align="center">-</td>
99
  <td align="center"><strong>48.5</strong> ↑+73%</td>
@@ -102,7 +102,7 @@ Key improvements over Gemma-2-2B baseline:
102
  <td align="center">34.0 ↑+33%</td>
103
  </tr>
104
  <tr>
105
- <td>Sauerkraut-Gemma-2B</td>
106
  <td align="center">28.0 ↑+22%</td>
107
  <td align="center">34.6 ↑+50%</td>
108
  <td align="center">37.2 ↑+33%</td>
 
52
  - ARC-DE: +41% (32.3% vs 22.9%)
53
  - Average zero-shot: +40% (35.8% vs 25.5%)
54
 
55
+ → BübleLM-2B consistently outperforms both the base Gemma-2-2B and other German models like LLäMmlein-1B across most tasks.
56
 
57
  <table class="model-comparison">
58
  <thead>
 
75
  </thead>
76
  <tbody>
77
  <tr>
78
+ <td><a href="https://huggingface.co/google/gemma-2-2b" target="_blank">Gemma-2-2B</a></td>
79
  <td align="center">22.9</td>
80
  <td align="center">23.1</td>
81
  <td align="center">28.0</td>
 
84
  <td align="center">25.5</td>
85
  </tr>
86
  <tr>
87
+ <td><a href="https://huggingface.co/LSX-UniWue/LLaMmlein_120M" target="_blank">LLäMmlein-120M</a></td>
88
  <td align="center">24.7 ↑+8%</td>
89
  <td align="center">-</td>
90
  <td align="center">32.0 ↑+14%</td>
 
93
  <td align="center">27.2 ↑+7%</td>
94
  </tr>
95
  <tr>
96
+ <td><a href="https://huggingface.co/LSX-UniWue/LLaMmlein_1B" target="_blank">LLäMmlein-1B</a></td>
97
  <td align="center">30.0 ↑+31%</td>
98
  <td align="center">-</td>
99
  <td align="center"><strong>48.5</strong> ↑+73%</td>
 
102
  <td align="center">34.0 ↑+33%</td>
103
  </tr>
104
  <tr>
105
+ <td><a href="https://huggingface.co/VAGOsolutions/SauerkrautLM-Gemma-2b" target="_blank">Sauerkraut-Gemma-2B</a></td>
106
  <td align="center">28.0 ↑+22%</td>
107
  <td align="center">34.6 ↑+50%</td>
108
  <td align="center">37.2 ↑+33%</td>