publications

Publications

2025

  1. ICLR 2025
    QERA: an Analytical Framework for Quantization Error Reconstruction
    Cheng Zhang, Jeffrey TH Wong, Can Xiao, and 2 more authors
    The Twelfth International Conference on Learning Representations, 2025
  2. A3: an Analytical Low-Rank Approximation Framework for Attention
    Jeffrey TH Wong, Cheng Zhang, Xinye Cao, and 4 more authors
    arXiv preprint arXiv:2505.12942, 2025