evaluation results
Browse files
README.md
CHANGED
@@ -205,7 +205,7 @@ Granite-3.1-8B-Base is based on a decoder-only dense transformer architecture. C
|
|
205 |
</tr>
|
206 |
<tr>
|
207 |
<td style="text-align:left; background-color: #FFFFFF; color: black;">Number of attention heads</td>
|
208 |
-
<td style="text-align:center; background-color: #FFFFFF; color: black;"
|
209 |
<td style="text-align:center; background-color: #DAE8FF; color: black;">32</td>
|
210 |
<td style="text-align:center; background-color: #FFFFFF; color: black;">16</td>
|
211 |
<td style="text-align:center; background-color: #FFFFFF; color: black;">24</td>
|
|
|
205 |
</tr>
|
206 |
<tr>
|
207 |
<td style="text-align:left; background-color: #FFFFFF; color: black;">Number of attention heads</td>
|
208 |
+
<td style="text-align:center; background-color: #FFFFFF; color: black;">32</td>
|
209 |
<td style="text-align:center; background-color: #DAE8FF; color: black;">32</td>
|
210 |
<td style="text-align:center; background-color: #FFFFFF; color: black;">16</td>
|
211 |
<td style="text-align:center; background-color: #FFFFFF; color: black;">24</td>
|