18 5 42

Konrad Szafer

KonradSzafer

https://konradszafer.github.io/

AI & ML interests

Foundation Models, RL, Continual Learning

Recent Activity

liked a Space 2 days ago

ovi054/image-to-vector

posted an update 3 days ago

I’ve been experimenting with a “Tech Tree” to make ML research more systematic and transparent—turns out it helped me spot hidden interactions between experiments and share progress more easily. I wrote a short blog post with examples and insights! https://huggingface.co./spaces/KonradSzafer/tech_tree_blog

updated a Space 3 days ago

KonradSzafer/tech_tree_blog

View all activity

Organizations

KonradSzafer's activity

liked a Space 2 days ago

Image To Vector

🚀

Vectorizer AI | Convert Image to SVG

posted an update 3 days ago

Post

1784

updated a Space 3 days ago

Tech Tree Blog

🌳

published a Space 3 days ago

Tech Tree Blog

🌳

liked a dataset 9 days ago

SakanaAI/AI-CUDA-Engineer-Archive

Viewer • Updated 9 days ago • 30.6k • 10.7k • 128

liked a Space 9 days ago

1.79k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 22 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 24 days ago • 195

upvoted an article about 1 month ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 782

liked a model 2 months ago

google/gemma-2-2b-it

Text Generation • Updated Aug 27, 2024 • 402k • • 1k

liked a Space 2 months ago

3.99k

TRELLIS

🏢

Scalable and Versatile 3D Generation from images

liked a model 3 months ago

microsoft/swinv2-tiny-patch4-window16-256

Image Classification • Updated Dec 10, 2022 • 317k • 5

liked a Space 3 months ago

Research Tracker

🚀

liked a model 3 months ago

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated 22 days ago • 377k • • 564

upvoted a collection 3 months ago

Leaderboards and benchmarks ✨

Collection

Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 91 items • Updated about 11 hours ago • 96

liked a dataset 4 months ago

ShapeNet/ShapeNetCore

Updated Sep 20, 2023 • 590 • 116

reacted to gabrielmbmb's post with 🔥 6 months ago

Post

1879

Yesterday @mattshumer released mattshumer/Reflection-Llama-3.1-70B, an impressive model that achieved incredible results in benchmarks like MMLU. The model was fine-tuned using Reflection-Tuning and the dataset used wasn't released, but I created a small recipe with distilabel that allows generating a dataset with a similar output format:

1. We use MagPie 🐦 in combination with https://huggingface.co./meta-llama/Meta-Llama-3.1-70B-Instruct to generate reasoning instructions.
2. We generate a response again using https://huggingface.co./meta-llama/Meta-Llama-3.1-70B-Instruct, but we steer the LLM to generate an specific output format using a custom system prompt. In the system prompt, we instruct the LLM that it will have first to think 💭 and have reflections that will help resolving ambiguities. After that, we instruct the LLM to generate an output based on the previous thinking

In this dataset gabrielmbmb/distilabel-reflection-tuning you can found 5 rows that I generated with this recipe. You can also found the code of the pipeline in the file called reflection.py.

liked a model 6 months ago

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 607 • 1.71k

liked 2 Spaces 7 months ago

232

VFusion3D

🐢

Generate 3D models and videos from images

188

LevelBot

🥇

Track and level up users based on Discord and Hugging Face activity

liked a dataset 7 months ago

allenai/ai2_arc

Viewer • Updated Dec 21, 2023 • 7.79k • 271k • 172