Ambarish Jash
ajash
AI & ML interests
NLP / LLM
Organizations
None yet
ajash's activity
Using the Accelerate API to train models on multiple GPUs
8
#28 opened over 1 year ago
by
ajash
Librarian Bot: Add base_model information to model
#1 opened over 1 year ago
by
librarian-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg)
Librarian Bot: Add base_model information to model
#1 opened over 1 year ago
by
librarian-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg)
Installing ! pip install git+https://github.com/HazyResearch/flash-attention.git#subdirectory=csrc/rotary but flah_llama still erroring out
4
#25 opened over 1 year ago
by
ajash
Fine tune togethercomputer/LLaMA-2-7B-32K with LoRA
7
#22 opened over 1 year ago
by
ajash
Fix RuntimeError: pad attn scores back to original query sequence length, instead of unpadded sequence length (i.e. no change).
1
#17 opened over 1 year ago
by
Birchlabs
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63052ec6c7fed54edfa9d29d/IF0z25F93ZqCPMkMLiJLi.png)
Fine tune togethercomputer/LLaMA-2-7B-32K with LoRA
7
#22 opened over 1 year ago
by
ajash
Fine tune togethercomputer/LLaMA-2-7B-32K with LoRA
7
#22 opened over 1 year ago
by
ajash