APTQ: Attention-aware Post-Training Mixed-Precision Quantization for Large Language Models
Data and Resources
-
Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Cite this as
Ziyi Guan, Hantao Huang, Yupeng Su, Hong Huang, Ngai Wong, Hao Yu (2024). Dataset: APTQ: Attention-aware Post-Training Mixed-Precision Quantization for Large Language Models. https://doi.org/10.57702/i1dmoyg7
DOI retrieved: December 16, 2024
Additional Info
Field | Value |
---|---|
Created | December 16, 2024 |
Last update | December 16, 2024 |
Defined In | https://doi.org/10.1145/3649329.3658498 |
Author | Ziyi Guan |
More Authors |
|