Neon Bohdan's picture

Neon Bohdan

NeonBohdan

·

AI & ML interests

None yet

Organizations

NeonBohdan's activity

upvoted a collection 17 days ago

C4AI Aya Expanse

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated 17 days ago • 25

upvoted a collection 19 days ago

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 6 days ago • 86

upvoted a collection 24 days ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated 26 days ago • 129

upvoted 2 collections about 1 month ago

OpenMath-2

A collection of models and datasets introduced in "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data" • 7 items • Updated 12 days ago • 13

Salamandra 🦎

13 items • Updated 3 days ago • 36

upvoted 3 collections about 2 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 17 days ago • 453

MagpieLM

Aligning LMs with Fully Open Recipe (data+training configs+logs) • 9 items • Updated Sep 22 • 15

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 308

upvoted 2 collections 2 months ago

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated Sep 18 • 206

Yi-1.5 (2024/05)

10 items • Updated May 20 • 90

upvoted 2 collections 3 months ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated Oct 3 • 59

Llama-3-Instruct-Referecnce-Free-Preference

3 items • Updated Jun 22 • 3

upvoted a collection 4 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Sep 25 • 613

upvoted a collection 5 months ago

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 8 days ago • 157