publications

Publications

2026

  1. ACL
    budget.jpg
    Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models
    Junhao Liu, Haonan Yu, and Xin Zhang
    In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL), 2026
  2. Focus-LIME: Surgical Interpretation of Long-Context Large Language Models via Proxy-Based Neighborhood Selection
    Junhao Liu, Haonan Yu, Zhenyu Yan, and Xin Zhang
    In Proceedings of the 35th International Joint Conference on Artificial Intelligence and the 28th European Conference on Artificial Intelligence (IJCAI-ECAI), 2026
  3. TOPLAS
    loris.jpg
    Guiding LLM-based Loop Invariant Synthesis via Feedback on Local Reasoning Errors
    Tianchi Li, Zhenyu Yan*Junhao Liu*, Peng Di, and Xin Zhang
    ACM Transactions on Programming Languages and Systems, 2026
  4. MAnchors: Memorization-Based Acceleration of Anchors via Rule Reuse and Transformation
    Haonan Yu, Junhao Liu, and Xin Zhang
    In Proceedings of the 43rd International Conference on Machine Learning (ICML), 2026

2025

  1. AAAI
    ReX.jpg
    ReX: A framework for incorporating temporal information in model-agnostic local explanation techniques
    Junhao Liu, and Xin Zhang
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2025

Preprints

2026

  1. Preprint
    wasd.jpg
    WASD: Locating Critical Neurons as Sufficient Conditions for Explaining and Controlling LLM Behavior
    Haonan Yu, Junhao Liu, Zhenyu Yan, Haoran Lin, and Xin Zhang
    arXiv preprint arXiv:2603.18474, 2026

2024

  1. Preprint
    conlux.jpg
    Beyond Attribution: Unified Concept-Level Explanations
    Junhao Liu, Haonan Yu, and Xin Zhang
    arXiv preprint arXiv:2410.12439, 2024