wxgeorge commited on
Commit
3793467
1 Parent(s): 6c352bd

:wrench: put README content in the right place for easier recreation.

Browse files
Files changed (1) hide show
  1. readme-generator/body.md +13 -3
readme-generator/body.md CHANGED
@@ -1,5 +1,15 @@
1
- Most models aren't useable directly on HuggingFace. Even "small" models (like llama3-8bs) require expensive enough hardware that they aren't stood up to operate for free.
 
2
 
3
- At featherless, we've taken a different approach, and are able to serve models "serverlessly" at an entirely different scale.
 
4
 
5
- This space allows you to test any 15B or smaller model directly within HF, but we have bigger models too. Check out https://featherless.ai to see the full range of supported models.
 
 
 
 
 
 
 
 
 
1
+ # Overview
2
+ Featherless is a serverless platform designed to make it 100x easier to experiment with LLMs. This space brings the magic of featherless as close as possible to the models on Hugging Face.
3
 
4
+ # The Problem with LLM Experimentation
5
+ Most models aren't useable on huggingface.co. Even "small" models (like llama3-8bs) require expensive enough hardware that they aren't stood up to operate for free, and for users to experiment with them requires allocating GPUs. HF's inference API is the simplest way to do it, but you'd be hard pressed to experiment with 10 models in a day with this approach.
6
 
7
+ # Enter Featherless
8
+ Our goal is to make all models on Hugging Face available serverlessly and enable a new kind of experimentation. With over 2,200 supported models available today, we're well on our way. Check out [featherless.ai](https://featherless.ai) to see the full range of supported models.
9
+
10
+ # Why a Hugging Face Space?
11
+ This space is intended to bring some of the magic of Featherless as close to supported model cards as possible. It currently lets you run inference of all the <=15B models supported on Featherless. (subject to a concurrency limit for the API token set in the secrets of this space)
12
+
13
+ You're of course welcome to clone this space but know that it's stock gradio with a call to the featherless API (i.e. `/chat/completions`) through the [openai python package](https://pypi.org/project/openai/) (like many inference providers, the Featherless API is OpenAI compatible) so you'll need a Featherless API key for it to work for you, which you can get at [featherless.ai](https://featherless.ai).
14
+
15
+ Thanks for stopping by! Feedback welcome in the community section of the space, by email to [email protected], or on [our discord](https://featherless.ai/discord)