Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
48.7
TFLOPS
4
20
24
Nicolay Rusnachenko
nicolay-r
Follow
mwmilad's profile picture
theophilusijiebor1's profile picture
gustavoia2023's profile picture
62 followers
Ā·
3 following
https://nicolay-r.github.io/
nicolayr_
nicolay-r
nicolay-r
AI & ML interests
Information Retrievalć»Medical Multimodal NLP (š¼+š) Research Fellow @BU_Researchć»software developer http://arekit.ioć»PhD in NLP
Recent Activity
posted
an
update
about 23 hours ago
š¢ For those who wish to launch distilled DeepSeek R1 for reasoning with schema, sharing the Google Colab notebook: š https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_deep_seek_7b_distill_colab.ipynb This is a wrapper of the Qwen2 transformers š¤ provider via bulk-chain framework. Model: https://huggingface.co./deepseek-ai/DeepSeek-R1-Distill-Qwen-7B GPU: T4 (15GB) is nearly enough in float32 mode. š To boost the performance you may set bf16 mode (use_bf16=True) š Powered by bulk-chain: https://github.com/nicolay-r/bulk-chain
upvoted
a
collection
2 days ago
DeepSeek-R1
posted
an
update
2 days ago
š¢ For those who wish to apply DeepSeek-R1 for handling tabular / streaming data using schema of prompts (CoT), the OpenRouter AI hosts API for accessing: https://openrouter.ai/deepseek/deepseek-r1 The no-string option to quick start with using DeepSeek-R1 includes three steps: ā OpenRouter provider: https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/open_router.py ā Bulk-chain for infering data: https://github.com/nicolay-r/bulk-chain ā Json Schema for Chain-of-Though reasoning (see screenshot š· below) šŗ below is a screenshot of how to quick start the demo, in which you can test your schema for LLM responses. It would ask to type all the parameters first for completing the requests (which is `text` within this example). š To apply it for JSONL/CSV data, you can use `--src` shell parameter for passing the related file ā³ As for time, OpenRouter finds me relatively slow with 30~40 seconds per request Models: https://huggingface.co./deepseek-ai/DeepSeek-R1
View all activity
Organizations
None yet
nicolay-r
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
2 days ago
deepseek-ai/DeepSeek-R1
Text Generation
ā¢
Updated
3 days ago
ā¢
189k
ā¢
4.38k
liked
a Space
14 days ago
Running
on
CPU Upgrade
324
š„
Open Medical-LLM Leaderboard
liked
a model
14 days ago
johnsnowlabs/JSL-MedLlama-3-8B-v2.0
Text Generation
ā¢
Updated
Apr 30, 2024
ā¢
11.9k
ā¢
29
liked
a model
4 months ago
meta-llama/Llama-3.2-3B-Instruct
Text Generation
ā¢
Updated
Oct 24, 2024
ā¢
1.47M
ā¢
934
liked
a model
6 months ago
hyy-33/hyy33-WASSA-2024-Track-2
Updated
Jul 9, 2024
ā¢
2
liked
2 models
7 months ago
google/gemma-2-9b-it
Text Generation
ā¢
Updated
Aug 27, 2024
ā¢
390k
ā¢
638
google/gemma-2-27b-it
Text Generation
ā¢
Updated
Aug 27, 2024
ā¢
156k
ā¢
506
liked
4 models
8 months ago
Qwen/Qwen2-7B-Instruct
Text Generation
ā¢
Updated
Aug 21, 2024
ā¢
879k
ā¢
611
mistralai/Mistral-7B-Instruct-v0.3
Text Generation
ā¢
Updated
Aug 21, 2024
ā¢
1.75M
ā¢
1.29k
microsoft/Phi-3-small-8k-instruct
Text Generation
ā¢
Updated
Aug 30, 2024
ā¢
23.1k
ā¢
160
microsoft/Phi-3-mini-4k-instruct
Text Generation
ā¢
Updated
Sep 20, 2024
ā¢
871k
ā¢
1.12k
liked
2 models
9 months ago
xtuner/llava-phi-3-mini-hf
Image-to-Text
ā¢
Updated
Apr 25, 2024
ā¢
5.89k
ā¢
48
xtuner/llava-llama-3-8b-v1_1
Image-Text-to-Text
ā¢
Updated
Apr 28, 2024
ā¢
50
ā¢
120
liked
7 models
10 months ago
AIRI-Institute/OmniFusion
Updated
Apr 10, 2024
ā¢
56
google-bert/bert-base-uncased
Fill-Mask
ā¢
Updated
Feb 19, 2024
ā¢
83.7M
ā¢
2.08k
google/gemma-1.1-2b-it
Text Generation
ā¢
Updated
Jun 27, 2024
ā¢
85.3k
ā¢
154
google/gemma-2b-it
Text Generation
ā¢
Updated
Sep 27, 2024
ā¢
102k
ā¢
700
google/gemma-7b-it
Text Generation
ā¢
Updated
Aug 14, 2024
ā¢
67.6k
ā¢
1.15k
google/gemma-1.1-7b-it
Text Generation
ā¢
Updated
Jun 27, 2024
ā¢
18.5k
ā¢
270
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation
ā¢
Updated
Aug 19, 2024
ā¢
1.75M
ā¢
4.28k
Load more