The Language Interpretability Tool: Extensible, Interactive Visualizations And Analysis For NLP Models · The Large Language Model Bible Contribute to LLM-Bible

The Language Interpretability Tool: Extensible, Interactive Visualizations And Analysis For NLP Models

Tenney Ian, Wexler James, Bastings Jasmijn, Bolukbasi Tolga, Coenen Andy, Gehrmann Sebastian, Jiang Ellen, Pushkarna Mahima, Radebaugh Carey, Reif Emily, Yuan Ann. Arxiv 2020

[Paper] [Code]    
Applications Ethics And Bias Fine Tuning Has Code Interpretability And Explainability Language Modeling Reinforcement Learning Tools

We present the Language Interpretability Tool (LIT), an open-source platform for visualization and understanding of NLP models. We focus on core questions about model behavior: Why did my model make this prediction? When does it perform poorly? What happens under a controlled change in the input? LIT integrates local explanations, aggregate analysis, and counterfactual generation into a streamlined, browser-based interface to enable rapid exploration and error analysis. We include case studies for a diverse set of workflows, including exploring counterfactuals for sentiment analysis, measuring gender bias in coreference systems, and exploring local behavior in text generation. LIT supports a wide range of models–including classification, seq2seq, and structured prediction–and is highly extensible through a declarative, framework-agnostic API. LIT is under active development, with code and full documentation available at https://github.com/pair-code/lit.

Similar Work