atayloraerospace's picture

atayloraerospace

Taylor658

·

atayloraerospace

AI & ML interests

Multimodal Gen AI 🤖 | Agentic AI 🧠🤖 | Computer Vision 🔭 | AI in Healthcare 🩺 | AI in Aerospace 🚀

Recent Activity

liked a Space about 17 hours ago

open-r1/open-r1-eval-leaderboard

upvoted an article about 17 hours ago

Open-R1: Update #1

new activity 3 days ago

Taylor658/synthetic-fine-arts:Update README.md

View all activity

Organizations

Taylor658's activity

upvoted an article about 17 hours ago

Article

Open-R1: Update #1

By

•

2 days ago

• 178

upvoted a paper 4 days ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published 4 days ago • 74

upvoted 9 collections 6 days ago

FuseO1-Preview

System-II Reasoning Fusion of LLMs • 10 items • Updated 3 days ago • 15

Meta's Llama2 models

12 items • Updated Dec 13, 2024 • 52

YuE

YuE: Open Full-song Generation Foundation Model • 9 items • Updated 6 days ago • 14

Eagle 2

Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. • 9 items • Updated 11 days ago • 28

Deepseek Papers

Deepseek papers collection • 14 items • Updated Dec 30, 2024 • 38

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 8 days ago • 96

SmolVLM 256M & 500M

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 11 days ago • 64

DeepSeek-R1

8 items • Updated 14 days ago • 361

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 8 days ago • 311

upvoted 9 papers 6 days ago

Visual Generation Without Guidance

Paper • 2501.15420 • Published 9 days ago • 7

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

Paper • 2501.12370 • Published 13 days ago • 9

CodeMonkeys: Scaling Test-Time Compute for Software Engineering

Paper • 2501.14723 • Published 10 days ago • 7

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

Paper • 2501.16295 • Published 7 days ago • 7

Are Vision Language Models Texture or Shape Biased and Can We Steer Them?

Paper • 2403.09193 • Published Mar 14, 2024 • 9

iFormer: Integrating ConvNet and Transformer for Mobile Application

Paper • 2501.15369 • Published 9 days ago • 10

Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation

Paper • 2501.15907 • Published 7 days ago • 15

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 9 days ago • 49

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 9 days ago • 48