Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
26
33
76
Akarshan Biswas
qnixsynapse
Follow
surajp's profile picture
21world's profile picture
ruchir28's profile picture
11 followers
ยท
8 following
qnixsynapse
AI & ML interests
NLP, models, quantization
Recent Activity
new
activity
10 days ago
google/gemma-2-9b-it:
Tool calling support in Gemma 2
liked
a Space
10 days ago
webml-community/attention-visualization
reacted
to
suayptalha
's
post
with ๐
13 days ago
๐ Introducing ๐ ๐ข๐ซ๐ฌ๐ญ ๐๐ฎ๐ ๐ ๐ข๐ง๐ ๐ ๐๐๐ ๐๐ง๐ญ๐๐ ๐ซ๐๐ญ๐ข๐จ๐ง ๐จ๐ ๐ฆ๐ข๐ง๐๐๐ ๐๐จ๐๐๐ฅ๐ฌ from the paper ๐๐๐ซ๐ ๐๐๐๐ฌ ๐๐ฅ๐ฅ ๐๐ ๐๐๐๐๐๐? ๐ฅ I have integrated ๐ง๐๐ฑ๐ญ-๐ ๐๐ง๐๐ซ๐๐ญ๐ข๐จ๐ง ๐๐๐๐ฌ, specifically minGRU, which offer faster performance compared to Transformer architectures, into HuggingFace. This allows users to leverage the lighter and more efficient minGRU models with the "๐ญ๐ซ๐๐ง๐ฌ๐๐จ๐ซ๐ฆ๐๐ซ๐ฌ" ๐ฅ๐ข๐๐ซ๐๐ซ๐ฒ for both usage and training. ๐ป I integrated two main tasks: ๐๐ข๐ง๐๐๐๐ ๐จ๐ซ๐๐๐ช๐ฎ๐๐ง๐๐๐๐ฅ๐๐ฌ๐ฌ๐ข๐๐ข๐๐๐ญ๐ข๐จ๐ง and ๐๐ข๐ง๐๐๐๐ ๐จ๐ซ๐๐๐ฎ๐ฌ๐๐ฅ๐๐. ๐๐ข๐ง๐๐๐๐ ๐จ๐ซ๐๐๐ช๐ฎ๐๐ง๐๐๐๐ฅ๐๐ฌ๐ฌ๐ข๐๐ข๐๐๐ญ๐ข๐จ๐ง: You can use this class for ๐๐๐ช๐ฎ๐๐ง๐๐ ๐๐ฅ๐๐ฌ๐ฌ๐ข๐๐ข๐๐๐ญ๐ข๐จ๐ง tasks. I also trained a Sentiment Analysis model with stanfordnlp/imdb dataset. ๐๐ข๐ง๐๐๐๐ ๐จ๐ซ๐๐๐ฎ๐ฌ๐๐ฅ๐๐: You can use this class for ๐๐๐ฎ๐ฌ๐๐ฅ ๐๐๐ง๐ ๐ฎ๐๐ ๐ ๐๐จ๐๐๐ฅ tasks such as GPT, Llama. I also trained an example model with roneneldan/TinyStories dataset. You can fine-tune and use it! ๐ ๐๐ข๐ง๐ค๐ฌ: Models: https://huggingface.co./collections/suayptalha/mingru-676fe8d90760d01b7955d7ab GitHub: https://github.com/suayptalha/minGRU-hf LinkedIn Post: https://www.linkedin.com/posts/suayp-talha-kocabay_mingru-a-suayptalha-collection-activity-7278755484172439552-wNY1 ๐ฐ ๐๐ซ๐๐๐ข๐ญ๐ฌ: Paper Link: https://arxiv.org/abs/2410.01201 I am thankful to Leo Feng, Frederick Tung, Mohamed Osama Ahmed, Yoshua Bengio and Hossein Hajimirsadeghi for their papers.
View all activity
Organizations
None yet
models
14
Sort:ย Recently updated
qnixsynapse/gemma-2-9b-it-SimPO-IQ4_XS-GGUF
Updated
Sep 18, 2024
โข
7
qnixsynapse/Gemma-V2-2B-Instruct-GGUF
Text Generation
โข
Updated
Aug 13, 2024
โข
3
qnixsynapse/gemma-2-9b-it-IQ4_XS-GGUF
Text Generation
โข
Updated
Jul 16, 2024
โข
2
qnixsynapse/Gemma-V2-9B-Instruct-GGUF
Updated
Jul 15, 2024
โข
14
qnixsynapse/Meta-Llama-3-8B-Instruct-IQ4_XS-GGUF
Text Generation
โข
Updated
Jul 12, 2024
โข
2
qnixsynapse/Phi-3-mini-4k-instruct-Q4_K_M-GGUF
Text Generation
โข
Updated
Jul 2, 2024
qnixsynapse/Mistral-7B-Instruct-v0.3-Q4_K_M-GGUF
Updated
Jun 22, 2024
โข
4
qnixsynapse/Meta-Llama-3-8B-Instruct-Q4_K_M-GGUF
Text Generation
โข
Updated
May 27, 2024
โข
4
qnixsynapse/codegemma-7b-it-Q4_K_M-GGUF
Text Generation
โข
Updated
Apr 9, 2024
โข
3
qnixsynapse/gemma-1.1-2b-it-Q4_K_S-GGUF
Updated
Apr 6, 2024
โข
4
Expand 14 models
datasets
None public yet