Johnson Thomas's picture

Johnson Thomas PRO

Johnyquest7

AI & ML interests

Medical

Recent Activity

updated a Space 5 days ago
OnDeviceMedNotes/README
updated a dataset 30 days ago
Johnyquest7/medtran
published a dataset 30 days ago
Johnyquest7/medtran
View all activity

Organizations

On Device Medical Notes's profile picture

Johnyquest7's activity

updated a Space 5 days ago
updated a collection about 1 month ago
upvoted an article 6 months ago
view article
Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

β€’ 225
reacted to merve's post with πŸ”₯ 6 months ago
view post
Post
5622
I have put together a notebook on Multimodal RAG, where we do not process the documents with hefty pipelines but natively use:
- vidore/colpali for retrieval πŸ“– it doesn't need indexing with image-text pairs but just images!
- Qwen/Qwen2-VL-2B-Instruct for generation πŸ’¬ directly feed images as is to a vision language model with no processing to text!
I used ColPali implementation of the new 🐭 Byaldi library by @bclavie πŸ€—
https://github.com/answerdotai/byaldi
Link to notebook: https://github.com/merveenoyan/smol-vision/blob/main/ColPali_%2B_Qwen2_VL.ipynb