Announcement_1
A new paper is available on arXiv: Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models.
A new paper is available on arXiv: Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models.