Any plans for YaRN CodeLlama?

by viktor-ferenczi - opened Oct 16, 2023

Discussion

viktor-ferenczi

Oct 16, 2023

CodeLlama 13B or 34B YaRN 128k would be a very potent model.

acrastt

Oct 16, 2023

CodeLlama 13B or 34B YaRN 128k would be a very potent model.

I mean CodeLlama is already 100k, not sure why you need that extra 28k.

viktor-ferenczi

Oct 16, 2023

I've tested Code Llama up to 16k with vLLM. Which LLM engine to use to run it at 100k?

acrastt

Oct 16, 2023

I've tested Code Llama up to 16k with vLLM. Which LLM engine to use to run it at 100k?

Most LLM inference tools/applications support RoPE(Which is what CodeLlama uses). It's just that it uses way to much RAM/VRAM(So do YaRN). I'm not sure if you can run it.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment