markoarnauto commited on
Commit
4d7e14e
·
verified ·
1 Parent(s): a9f1c05

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -57,3 +57,4 @@ curl http://localhost:8000/v1/completions -H "Content-Type: application/json
57
  "prompt": "San Francisco is a"
58
  } '
59
  ```
 
 
57
  "prompt": "San Francisco is a"
58
  } '
59
  ```
60
+ ⚡ This model is optimized to handle heavy workloads providing a total throughput of ️**4623 tokens per second** using one NVIDIA L40S ⚡