Dmitry Balobin's picture

Dmitry Balobin

d0rj

AI & ML interests

NLP and 🥴 tensors. 2GIS 💚

Recent Activity

liked a model 4 days ago
yandex/YandexGPT-5-Lite-8B-pretrain
liked a dataset 5 days ago
lightblue/text_ratings
liked a model 8 days ago
perplexity-ai/r1-1776
View all activity

Organizations

None yet

d0rj's activity

reacted to kristaller486's post with 🚀 12 days ago
view post
Post
1377
Nebo-T1-Russian

(Probably) the first "longCoT" dataset for the Russian language created via Deeseek-R1.

- Prompts taken from the Sky-T1 dataset and translated via Llama3.3-70B.
- Answers and reasoning generated by Deepseek-R1 (685B).
- 16.4K samples in total, ≈12.4K Russian-only (in the rest, either the answer or reasoning is in English).
- Languages in the answers and reasoning are labeled using fasttext.

kristaller486/Nebo-T1-Russian