GitHub - explanare/char-iit: A causal intervention framework to learn robust and interpretable character representations inside subword-based language models (original) (raw)

Skip to content

Sign in

Appearance settings

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Sign in

Sign up

Appearance settings

explanare / char-iit Public

A causal intervention framework to learn robust and interpretable character representations inside subword-based language models

arxiv.org/abs/2212.09897

License

MIT license

1 star 0 forks Branches Tags Activity

Star

Notifications You must be signed in to change notification settings

Additional navigation options

BranchesTags

Folders and files

Name Name Last commit message Last commit date
Latest commitHistory6 Commits
LICENSE LICENSE
README.md README.md
char_iit.ipynb char_iit.ipynb
data.tgz data.tgz
tokenizers.tgz tokenizers.tgz

Repository files navigation

Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training

This repository contains dataset and models for ACL 2023 Findings paper: Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training.

Code to reproduce results in the paper is also available in Colab.

About

A causal intervention framework to learn robust and interpretable character representations inside subword-based language models

arxiv.org/abs/2212.09897

Topics

subword interpretability character-level-language-model causal-intervention

Resources

Readme

License

MIT license

Activity

Stars

1 star

Watchers

1 watching

Forks

0 forks

Report repository

Languages