AK's picture

AK

akhaliq

·

_akhaliq

AI & ML interests

None yet

Recent Activity

liked a Space about 13 hours ago

MakiAi/Pokedex-App-Jp-gpt4-5

commented on a paper about 14 hours ago

Mobius: Text to Seamless Looping Video Generation via Latent Shift

commented on a paper about 14 hours ago

FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

View all activity

Organizations

akhaliq's activity

upvoted a collection 2 days ago

Phi-4

Phi-4 family of small language and multi-modal models. • 7 items • Updated about 8 hours ago • 83

upvoted a collection 3 days ago

olmOCR

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 2 days ago • 46

upvoted an article 3 days ago

Article

FastRTC: The Real-Time Communication Library for Python

4 days ago

• 97

upvoted a collection 8 days ago

Ovis2

Our latest advancement in multi-modal large language models (MLLMs) • 8 items • Updated 12 days ago • 52

upvoted an article 10 days ago

Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

11 days ago

• 89

upvoted a collection 11 days ago

SkyReels-V1

SkyReels V1 open models collections • 2 items • Updated 11 days ago • 17

upvoted a collection 29 days ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 16 days ago • 91

upvoted an article about 1 month ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 400

upvoted a paper about 2 months ago

TransPixar: Advancing Text-to-Video Generation with Transparency

Paper • 2501.03006 • Published Jan 6 • 23

upvoted a collection about 2 months ago

Cosmos

The collection of Cosmos models • 31 items • Updated Jan 17 • 265

upvoted a paper 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 347

upvoted a paper 3 months ago

EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published Dec 6, 2024 • 50

upvoted 7 collections 3 months ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 136

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated Dec 13, 2024 • 143

QwQ

Qwen with Questions • 2 items • Updated Nov 28, 2024 • 59

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 74

Insight-V

Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models • 5 items • Updated Nov 22, 2024 • 9

WhisperNER

Collection of WhisperNER models for joint open type NER and ASR • 3 items • Updated Nov 30, 2024 • 6

OpenScholar_V1

The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated Nov 22, 2024 • 33

upvoted a paper 3 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 114