hyx_194
commited on
Commit
·
de49693
1
Parent(s):
9feb397
resize image
Browse files
README.md
CHANGED
@@ -28,7 +28,7 @@ The architecture comprises three key components: an audio encoder that transform
|
|
28 |
|
29 |
Specifically, we fine-tuned the MERaLiON-Whisper encoder from Whisper-large-v2 for the audio encoder and used SEA-LION V3, a localised LLM developed by our partner AI Singapore as the text decoder.
|
30 |
|
31 |
-
<img src="model_architecture.png" alt="model_architecture" width="
|
32 |
|
33 |
## Capabilities
|
34 |
|
|
|
28 |
|
29 |
Specifically, we fine-tuned the MERaLiON-Whisper encoder from Whisper-large-v2 for the audio encoder and used SEA-LION V3, a localised LLM developed by our partner AI Singapore as the text decoder.
|
30 |
|
31 |
+
<img src="model_architecture.png" alt="model_architecture" width="400" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
|
32 |
|
33 |
## Capabilities
|
34 |
|