publications

2026

  1. Preprint
    focuslime.jpg
    Focus-LIME: Surgical Interpretation of Long-Context Large Language Models via Proxy-Based Neighborhood Selection
    Junhao Liu, Haonan Yu, Zhenyu Yan, and Xin Zhang
    arXiv preprint arXiv:2602.04607, 2026
  2. Preprint
    wasd.jpg
    WASD: Locating Critical Neurons as Sufficient Conditions for Explaining and Controlling LLM Behavior
    Haonan Yu, Junhao Liu, Zhenyu Yan, Haoran Lin, and Xin Zhang
    arXiv preprint arXiv:2603.18474, 2026

2025

  1. AAAI
    ReX.jpg
    ReX: A framework for incorporating temporal information in model-agnostic local explanation techniques
    Junhao Liu, and Xin Zhang
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2025
  2. Preprint
    manchors.jpg
    MAnchors: Memorization-Based Acceleration of Anchors via Rule Reuse and Transformation
    Haonan Yu, Junhao Liu, and Xin Zhang
    arXiv preprint arXiv:2502.11068, 2025
  3. Preprint
    budget.jpg
    Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models
    Junhao Liu, Haonan Yu, and Xin Zhang
    arXiv preprint arXiv:2505.12509, 2025

2024

  1. Preprint
    conlux.jpg
    Beyond Attribution: Unified Concept-Level Explanations
    Junhao Liu, Haonan Yu, and Xin Zhang
    arXiv preprint arXiv:2410.12439, 2024