ICTNLP
/

Llama-3.1-8B-Omni

omni_speech2s_llama

large language models

speech-language models

speech interaction

speech-to-speech

Model card Files Files and versions Community

poeroz commited on Sep 10

Commit

2c21386

•

1 Parent(s): 9074c64

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -30,6 +30,9 @@ LLaMA-Omni is a speech-language model built upon Llama-3.1-8B-Instruct. It suppo
 ♻️ **Trained in less than 3 days using just 4 GPUs.**
 ## Install
 1. Clone this repository.
@@ -99,6 +102,8 @@ python -m omni_speech.serve.model_worker --host 0.0.0.0 --controller http://loca
 4. Visit [http://localhost:8000/](http://localhost:8000/) and interact with LLaMA-3.1-8B-Omni!
 ## Local Inference
 To run inference locally, please organize the speech instruction files according to the format in the `omni_speech/infer/examples` directory, then refer to the following script.

 ♻️ **Trained in less than 3 days using just 4 GPUs.**
+<video controls autoplay src="https://cdn-uploads.huggingface.co/production/uploads/65b7573482d384513443875e/dr4XWUxzuVQ52lBuzNBTt.mp4"></video>
 ## Install
 1. Clone this repository.
 4. Visit [http://localhost:8000/](http://localhost:8000/) and interact with LLaMA-3.1-8B-Omni!
+**Note: Due to the instability of streaming audio playback in Gradio, we have only implemented streaming audio synthesis without enabling autoplay. If you have a good solution, feel free to submit a PR. Thanks!**
 ## Local Inference
 To run inference locally, please organize the speech instruction files according to the format in the `omni_speech/infer/examples` directory, then refer to the following script.