Seed-Music: A Unified Framework for High Quality and Controlled Music Generation Paper • 2409.09214 • Published 6 days ago • 38
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12 • 114
ViDoRe Captioning (baseline) Collection The original ViDoRe benchmark was passed to Unstructured to partition each page into chunks. Visual chunks are captioned using Claude Sonnet. • 13 items • Updated Jun 18 • 2
ViDoRe Chunk OCR (baseline) Collection The original ViDoRe benchmark was passed to Unstructured to partition each page into chunks. Visual chunks are OCRized with tesseract. • 11 items • Updated Jul 17 • 2
ColPali Paper Resources Collection Main resources for the paper: "ColPali: Efficient Document Retrieval with Vision Language Models" • 3 items • Updated Jul 2 • 4
ViDoRe Benchmark Collection Benchmark for document retrieval using visual features, introduced in "ColPali: Efficient Document Retrieval with Vision Language Models" • 10 items • Updated Jun 18 • 5
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding Paper • 2312.04461 • Published Dec 7, 2023 • 56
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output Paper • 2407.03320 • Published Jul 3 • 92
Better & Faster Large Language Models via Multi-token Prediction Paper • 2404.19737 • Published Apr 30 • 73
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28 • 146
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper • 2405.00732 • Published Apr 29 • 118
Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition Paper • 2309.15223 • Published Sep 26, 2023 • 19
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions Paper • 2403.19651 • Published Mar 28 • 23
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models Paper • 2401.04658 • Published Jan 9 • 24
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones Paper • 2312.16862 • Published Dec 28, 2023 • 30
Mini-GPTs: Efficient Large Language Models through Contextual Pruning Paper • 2312.12682 • Published Dec 20, 2023 • 8