wxgeorge commited on
Commit
bdde565
1 Parent(s): 5e8c4b7

:pencil2: updating README

Browse files
Files changed (1) hide show
  1. README.md +12 -3
README.md CHANGED
@@ -1976,9 +1976,18 @@ models:
1976
  - mattshumer/Reflection-Llama-3.1-70B
1977
  ---
1978
 
1979
- Most models aren't useable directly on HuggingFace. Even "small" models (like llama3-8bs) require expensive enough hardware that they aren't stood up to operate for free.
 
1980
 
1981
- At featherless, we've taken a different approach, and are able to serve models "serverlessly" at an entirely different scale.
 
1982
 
1983
- This space allows you to test any 15B or smaller model directly within HF, but we have bigger models too. Check out https://featherless.ai to see the full range of supported models.
 
1984
 
 
 
 
 
 
 
 
1976
  - mattshumer/Reflection-Llama-3.1-70B
1977
  ---
1978
 
1979
+ # Overview
1980
+ Featherless is a serverless platform designed to make it 100x easier to experiment with LLMs. This space brings the magic of featherless as close as possible to the models on Hugging Face.
1981
 
1982
+ # The Problem with LLM Experimentation
1983
+ Most models aren't useable on huggingface.co. Even "small" models (like llama3-8bs) require expensive enough hardware that they aren't stood up to operate for free, and for users to experiment with them requires allocating GPUs. HF's inference API is the simplest way to do it, but you'd be hard pressed to experiment with 10 models in a day with this approach.
1984
 
1985
+ # Enter Featherless
1986
+ Our goal is to make all models on Hugging Face available serverlessly and enable a new kind of experimentation. With over 2,200 supported models available today, we're well on our way. Check out [featherless.ai](https://featherless.ai) to see the full range of supported models.
1987
 
1988
+ # Why a Hugging Face Space?
1989
+ This space is intended to bring some of the magic of Featherless as close to supported model cards as possible. It currently lets you run inference of all the <=15B models supported on Featherless. (subject to a concurrency limit for the API token set in the secrets of this space)
1990
+
1991
+ You're of course welcome to clone this space but know that it's stock gradio with a call to the featherless API (i.e. `/chat/completions`) through the [openai python package](https://pypi.org/project/openai/) (like many inference providers, the Featherless API is OpenAI compatible) so you'll need a Featherless API key for it to work for you, which you can get at [featherless.ai](https://featherless.ai).
1992
+
1993
+ Thanks for stopping by! Feedback welcome in the community section of the space, by email to [email protected], or on [our discord](https://featherless.ai/discord)