FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design
Data and Resources
-
Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Cite this as
Haojun Xia, Zhen Zheng, Xiaoxia Wu, Shiyang Chen, Zhewei Yao, Stephen Youn, Arash Bakhtiari, Michael Wyatt, Yuxiong He, Olatunji Ruwase, Shuaiwen Leon Song (2025). Dataset: FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design. https://doi.org/10.57702/z4ohr5qr
DOI retrieved: January 3, 2025
Additional Info
Field | Value |
---|---|
Created | January 3, 2025 |
Last update | January 3, 2025 |
Defined In | https://doi.org/10.48550/arXiv.2401.14112 |
Author | Haojun Xia |
More Authors |
|
Homepage | https://github.com/usyd-fsalab/fp6_llm |