arxiv:2406.10209

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Published on Jun 14

· Submitted by

ahans1 on Jun 17

Upvote

Authors:

Abhimanyu Hans ,

Neel Jain ,

Abstract

Large language models can memorize and repeat their training data, causing privacy and copyright risks. To mitigate memorization, we introduce a subtle modification to the next-token training objective that we call the goldfish loss. During training, a randomly sampled subset of tokens are excluded from the loss computation. These dropped tokens are not memorized by the model, which prevents verbatim reproduction of a complete chain of tokens from the training set. We run extensive experiments training billion-scale Llama-2 models, both pre-trained and trained from scratch, and demonstrate significant reductions in extractable memorization with little to no impact on downstream benchmarks.

View arXiv page View PDF Add to collection

Community

ahans1

Paper author Paper submitter 7 days ago

Do next token prediction
Drop pseudorandom tokens from your loss comp
????
Profits i.e. mitigate training data regurgitation

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2406.10209 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2406.10209 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2406.10209 in a Space README.md to link it from this page.