AWQ search for accurate quantization. Pre-computed AWQ model zoo for LLMs (LLaMA-1&2, OPT, Vicuna, LLaVA; load to generate quantized weights). Memory-efficient 4-bit Linear in PyTorch. Efficient CUDA ...
AWQ search for accurate quantization. Pre-computed AWQ model zoo for LLMs (LLaMA, OPT, Vicuna, LLaVA; load to generate quantized weights). Memory-efficient 4-bit Linear in PyTorch. Efficient CUDA ...
Abstract: Single image super-resolution (SISR) aims to reconstruct a high-resolution image from its low-resolution observation. Recent deep learning-based SISR models show high performance at the ...
Abstract: Directly affecting both error performance and complexity, quantization is critical for MMSE MIMO detection. However, naively pruning quantization levels is ...