Appreciate the model drop!

by Nitral-AI - opened Jan 21

Discussion

Nitral-AI

Jan 21

But why is it only 4k? Its 2025 man, those are rookie numbers.

yukiarimo

Jan 22

Agree

dtamayo

Language Technologies Unit @ Barcelona Supercomputing Center org Jan 23

•

edited Jan 23

We understand the demand for longer context windows and our roadmap includes multiple possible approaches to increase it. Extending the context length involves trade-offs in training efficiency, memory usage, and model performance, we are working on how to do it as efficient as possible.

If you now need a model with a longer context, consider using our instructed Salamandra-7b, it might be more suitable for you.

mapama247 changed discussion status to closed 8 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment