Publications & Preprints

publications by categories in reversed chronological order.

2026

  1. arXiv’26
    mobilekernel.png
    MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?
    Xingze Zou*, Jing Wang*, Yuhua Zheng, and 8 more authors
    arXiv preprint arXiv:2602.11715, 2026
  2. arXiv’26
    dice.png
    DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels
    Haolei Bai, Lingcheng Kong, Xueyi Chen, and 3 more authors
    arXiv preprint arXiv:2602.11715, 2026
  3. CPAL’26
    erc-svd.png
    ERC-SVD: Error-Controlled SVD for Large Language Model Compression
    Haolei Bai, Siyong Jian, Tuo Liang, and 2 more authors
    In Third Conference on Parsimony and Learning, 2026