SQFT Collection SQFT Models (SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation Models) • 56 items • Updated Nov 14
Shears Collection Shears Models (Shears: Unstructured Sparsity with Neural Low-rank Adapter Search) • 14 items • Updated Nov 14
Shears: Unstructured Sparsity with Neural Low-rank Adapter Search Paper • 2404.10934 • Published Apr 16
A Hardware-Aware Framework for Accelerating Neural Architecture Search Across Modalities Paper • 2205.10358 • Published May 19, 2022
Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient BERT Paper • 2307.11764 • Published Jul 14, 2023
InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning Paper • 2308.15609 • Published Aug 29, 2023
A Hardware-Aware Framework for Accelerating Neural Architecture Search Across Modalities Paper • 2205.10358 • Published May 19, 2022
A Hardware-Aware System for Accelerating Deep Neural Network Optimization Paper • 2202.12954 • Published Feb 25, 2022
Accelerating Neural Architecture Exploration Across Modalities Using Genetic Algorithms Paper • 2202.12934 • Published Feb 25, 2022
SimQ-NAS: Simultaneous Quantization Policy and Neural Architecture Search Paper • 2312.13301 • Published Dec 19, 2023
LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models Paper • 2405.18377 • Published May 28 • 18
Shears: Unstructured Sparsity with Neural Low-rank Adapter Search Paper • 2404.10934 • Published Apr 16
Online Continual Learning Without the Storage Constraint Paper • 2305.09253 • Published May 16, 2023 • 2