shrikant lengare

shri210620

AI & ML interests

yes

Recent Activity

upvoted a paper 3 days ago

Slamming: Training a Speech Language Model on One GPU in a Day

upvoted an article 17 days ago

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

upvoted a collection 20 days ago

DeepSeek-R1

View all activity

Organizations

None yet

shri210620's activity

upvoted a paper 3 days ago

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published 9 days ago • 56

upvoted an article 17 days ago

Article

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

•

18 days ago

• 44

upvoted a collection 20 days ago

DeepSeek-R1

Collection

8 items • Updated Jan 21 • 545

upvoted a collection 27 days ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 207

upvoted 2 papers 28 days ago

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published 29 days ago • 21

o3-mini vs DeepSeek-R1: Which One is Safer?

Paper • 2501.18438 • Published 29 days ago • 22

liked a Space 30 days ago

1.84k

Chat With Janus-Pro-7B

🌍

A unified multimodal understanding and generation model.

published 2 Spaces about 1 month ago

Deepseek Ai DeepSeek R1 Distill Qwen 1.5B

🏢

Fast

Deepseek Ai DeepSeek R1 Distill Llama 8B

🏢

Ai chatbot

liked 2 models about 1 month ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 5 days ago • 1.26M • • 1.2k

deepseek-ai/DeepSeek-R1

Text Generation • Updated 5 days ago • 4.63M • • 10.5k

upvoted a paper about 1 month ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

liked a model about 2 months ago

Qwen/Qwen2-Audio-7B

Audio-Text-to-Text • Updated Nov 20, 2024 • 19.8k • 102

upvoted a paper 2 months ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 37

upvoted a paper 3 months ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 94

liked a Space 3 months ago

3.99k

TRELLIS

🏢

Scalable and Versatile 3D Generation from images

upvoted 3 papers 3 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 80

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 135

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 107

upvoted an article 3 months ago

Article

Running Your Custom LoRA Fine-Tuned MusicGen Large Locally

•

Dec 6, 2024

• 1