Commit History

Fix for RuntimeError: FlashAttention only support fp16 and bf16 data type during fine tuning.
5b7216f
verified

moidhassan commited on

Resolve - 196 [rank0]: triton.runtime.autotuner.OutOfResources: out of resource: shared memory, Required: 180224, Hardware limit: 101376. Reducing block sizes or `num_stages` may help.
794ffcf
verified

moidhassan commited on

Move flash_attn assert from __init__ into calling func (#32)
ad85cab
verified

nguyenbh rogerxfeng8 commited on

Update README.md
97bc412
verified

nguyenbh commited on

Add link to Phi-3 vision ONNX models
45ff32d
verified

kvaishnavi commited on

Update README.md
985e27b
verified

nguyenbh commited on

Update README
2fe15b2
verified

nguyenbh commited on

Initial commit
7a8ccc0
unverified

ammarawan commited on