Oleg Dmitriev
qilowoq
AI & ML interests
NLP (mainly in Russian)
Recent Activity
liked
a dataset
about 1 month ago
ChicagoHAI/CaseSumm
liked
a dataset
about 1 month ago
google/FACTS-grounding-public
new activity
about 2 months ago
google/gemma-2-9b-it:Sliding window vs. Global Attention
Organizations
qilowoq's activity
Sliding window vs. Global Attention
6
#41 opened 6 months ago
by
tanliboy
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6448b3266ffed6ece10335ba/HLC0SfOHjssWXB99eyxt8.png)
Adding `safetensors` variant of this model
#4 opened 3 months ago
by
SFconvertbot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/635fd4cc14657fb8cff2a081/GDkyDwAcuqDBpaOvQgJuq.png)
Adding `safetensors` variant of this model
#1 opened 3 months ago
by
SFconvertbot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/635fd4cc14657fb8cff2a081/GDkyDwAcuqDBpaOvQgJuq.png)
How can we access the logits from this model output?
5
#3 opened over 1 year ago
by
vishwasprabhub
Methodology questions
2
#2 opened over 1 year ago
by
justinbarton
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1643029657266-61e3d77d06a27a84a78caa04.jpeg)
Different size between tokenizer vocab and embedding
2
#1 opened over 1 year ago
by
demharters
Different size between tokenizer vocab and embedding
2
#1 opened over 1 year ago
by
demharters