AWQ
Quantization Method
Overview
Developed byMIT Han Lab
LicenseMIT License
Open source✓ Open Source
Use case4-bit weight quantization for large language models
Integrates with
Also see
Alternative to
Knowledge graph stats
Claims14
Avg confidence92%
Avg freshness100%
Last updatedUpdated 20 days ago
Trust distribution
100% unverified
AWQ
concept
Activation-aware Weight Quantization method for efficient LLM compression with minimal accuracy loss.
Compare with...open source
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| true | ○Unverified | High | Fresh | 1 |
published year
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| 2023 | ○Unverified | High | Fresh | 1 |
stands for
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Activation-aware Weight Quantization | ○Unverified | High | Fresh | 1 |
primary use case
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| 4-bit weight quantization for large language models | ○Unverified | High | Fresh | 1 |
reduces
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| model memory footprint | ○Unverified | High | Fresh | 1 |
developed by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| MIT Han Lab | ○Unverified | High | Fresh | 1 |
license type
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| MIT License | ○Unverified | High | Fresh | 1 |
supports model
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| LLaMA | ○Unverified | High | Fresh | 1 |
| OPT | ○Unverified | High | Fresh | 1 |
requires
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| PyTorch | ○Unverified | High | Fresh | 1 |
integrates with
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Hugging Face Transformers | ○Unverified | Moderate | Fresh | 1 |
maintains
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| model accuracy during quantization | ○Unverified | Moderate | Fresh | 1 |
alternative to
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| GPTQ | ○Unverified | Moderate | Fresh | 1 |
| SmoothQuant | ○Unverified | Moderate | Fresh | 1 |