babycommando's picture

1 9 58

babycommando

babycommando

·

AI & ML interests

ai!

Organizations

babycommando's activity

upvoted a paper 5 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 73

upvoted a paper 7 months ago

Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning

Paper • 2407.15815 • Published Jul 22, 2024 • 14

upvoted an article 7 months ago

Article

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

By

•

Jul 27, 2024

• 31

upvoted 3 papers 8 months ago

Wavelets Are All You Need for Autoregressive Image Generation

Paper • 2406.19997 • Published Jun 28, 2024 • 30

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3, 2024 • 93

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

Paper • 2406.16860 • Published Jun 24, 2024 • 60

upvoted an article 8 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24, 2024

• 188

upvoted a paper 8 months ago

LiveMind: Low-latency Large Language Models with Simultaneous Inference

Paper • 2406.14319 • Published Jun 20, 2024 • 14

upvoted a collection about 1 year ago

Vision Models (GGUF)

How to use: Download a "mmproj" model file + one or more of the primary model files. • 5 items • Updated Dec 22, 2023 • 44