The Language Interpretability Tool: Extensible, Interactive Visualizations And Analysis For NLP Models

Ian Tenney et al.. Arxiv 2020 – 54 citations

[Paper] [Code]
Ethics and Bias Fine-Tuning Tools Interpretability and Explainability Reinforcement Learning Has Code Language Modeling

We present the Language Interpretability Tool (LIT), an open-source platform for visualization and understanding of NLP models. We focus on core questions about model behavior: Why did my model make this prediction? When does it perform poorly? What happens under a controlled change in the input? LIT integrates local explanations, aggregate analysis, and counterfactual generation into a streamlined, browser-based interface to enable rapid exploration and error analysis. We include case studies for a diverse set of workflows, including exploring counterfactuals for sentiment analysis, measuring gender bias in coreference systems, and exploring local behavior in text generation. LIT supports a wide range of models–including classification, seq2seq, and structured prediction–and is highly extensible through a declarative, framework-agnostic API. LIT is under active development, with code and full documentation available at https://github.com/pair-code/lit.

The Large Language Model Bible

The Language Interpretability Tool: Extensible, Interactive Visualizations And Analysis For NLP Models

Ian Tenney et al.. Arxiv 2020 – 54 citations

Similar Work