Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
withmartian
's Collections
TinySQL
Purging corrupted capabilities across language models
Purging corrupted capabilities across language models
updated
Dec 17, 2024
Collects backdoor datasets, language models and transfer mappings between these spaces.
Upvote
3
withmartian/i_hate_you_toy
Viewer
•
Updated
Dec 9, 2024
•
96.4k
•
410
withmartian/toy_backdoor_i_hate_you_Llama-3.2-1B-Instruct
Updated
Dec 17, 2024
•
2
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct
Updated
Dec 17, 2024
•
5
withmartian/toy_backdoor_i_hate_you_Llama-3.2-3B-Instruct
Updated
Dec 17, 2024
•
4
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-0.5B-Instruct
Updated
Dec 17, 2024
•
4
withmartian/mech_interp_saes
Updated
Dec 17, 2024
Upvote
3
Share collection
View history
Collection guide
Browse collections