24 42 180

Théo Gigant

gigant

https://giganttheo.github.io/

AI & ML interests

multimodal summarization, generative models

Recent Activity

updated a dataset about 2 hours ago

gigant/tib-bench-mm-filtering-part1-2

updated a dataset about 2 hours ago

gigant/tib-bench-mm-part6

published a dataset about 2 hours ago

gigant/tib-bench-mm-part6

View all activity

Articles

Design choices for Vision Language Models in 2024

Apr 16, 2024

• 25

Organizations

gigant's activity

upvoted an article 6 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

7 days ago

• 587

upvoted an article 2 months ago

Article

EuroLLM-9B

•

Dec 2, 2024

• 108

upvoted a paper 4 months ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24, 2024 • 26

upvoted 2 papers 5 months ago

Contextual Position Encoding: Learning to Count What's Important

Paper • 2405.18719 • Published May 29, 2024 • 5

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 124

upvoted 3 papers 6 months ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 80

Harvesting Textual and Structured Data from the HAL Publication Repository

Paper • 2407.20595 • Published Jul 30, 2024 • 22

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

Paper • 2406.11271 • Published Jun 17, 2024 • 21

upvoted an article 6 months ago

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18, 2024

• 72

upvoted 2 articles 7 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 302

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 196

upvoted 5 papers 7 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 68

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3, 2024 • 93

upvoted 4 articles 8 months ago

Article

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

•

Jun 11, 2024

• 53

Article

Vision Language Models Explained

Apr 11, 2024

• 248

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 415

Article

Explaining the SDXL latent space

•

May 20, 2024

• 35