GitHub - terarachang/MemPi: localize a memorized sequence in LLMs (NAACL 2024) (original) (raw)

Python PyTorch transformers GPU

Do Localization Methods Actually Localize Memorized Data in LLMs?

A Tale of Two Benchmarks (NAACL 2024)

Ting-Yun Chang, Jesse Thomason, and Robin Jia
🎞️ https://www.youtube.com/watch?v=V2i8CemZZHQ

📜 https://arxiv.org/abs/2311.09060

Content

INJ Benchmark

Data

Information Injection

$ bash script/ecbd/inject.sh MODEL

Run Localization Methods

$ bash script/ecbd/METHOD_NAME.sh MODEL

DEL Benchmark

Data

Find data memorized by Pythia models from the Pile-dedup

Data memorized by GPT2-XL

Pretrained sequences for perplexity

Run Localization Methods

$ bash script/pile/METHOD_NAME.sh MODEL

$ bash script/manual/METHOD_NAME.sh