ShaohanTian's picture

3

ShaohanTian

ShaohanTian

·

https://shaohanyun.top/

AI & ML interests

Natural Language Processing (NLP), Text-to-Image Generation

Recent Activity

reacted to schuler's post with 👍 3 days ago

📢 New Research Alert: Making Language Models Smaller & Smarter! Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance. The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena. 🔑 Key Findings: • 77% parameter reduction. • Maintained model capabilities. • Improved generalization. Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT Code: https://github.com/joaopauloschuler/less-parameters-llm

liked a model about 2 months ago

MGE-LLMs/SteelBERT

liked a model 6 months ago

stabilityai/stable-diffusion-3-medium

View all activity

Organizations

ShaohanTian's activity

reacted to schuler's post with 👍 3 days ago

Post

7106

📢 New Research Alert: Making Language Models Smaller & Smarter!

Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance.

The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena.

🔑 Key Findings:
• 77% parameter reduction.
• Maintained model capabilities.
• Improved generalization.

Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT
Code: https://github.com/joaopauloschuler/less-parameters-llm

2 replies

·

liked a model about 2 months ago

MGE-LLMs/SteelBERT

Fill-Mask • Updated Jun 28, 2024 • 4 • 5

liked a model 6 months ago

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 29.5k • • 4.69k

updated a Space 7 months ago

README

updated a dataset 8 months ago

ShaohanTian/curve-plot

Viewer • Updated Jun 28, 2024 • 10k • 44

updated a model 8 months ago

MGE-LLMs/SteelBERT

Fill-Mask • Updated Jun 28, 2024 • 4 • 5

updated a Space 8 months ago

MGE LLMs SteelBERT

liked a Space over 1 year ago

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots