[Arxiv 2024] PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
ContentsIntroductionMethodExperimentsReferencesIntroduction 作者提出 PrefixQuant,基于 QuaRot,通过在 WA 量化时
7月前470
ContentsIntroductionMethodExperimentsReferencesIntroduction 作者提出 PrefixQuant,基于 QuaRot,通过在 WA 量化时
Yang, S., Liu, J., Zhang, R., Pan, M., Guo, Z., Li, X., Chen, Z., Gao, P., Guo, Y., & Zhang, S. (2023). LiDAR-LLM: E
