Spaces:

AlignmentResearch
/

tuned-lens

Running

App Files Files Community

Lev McKinney commited on Mar 15, 2023

Commit

d115284

1 Parent(s): e1b6e5d

App now runs by default and has better documentation

Browse files

Files changed (1) hide show

app.py +18 -5

app.py CHANGED Viewed

@@ -10,13 +10,14 @@ print(f"Using device {device} for inference")
 model = AutoModelForCausalLM.from_pretrained("EleutherAI/pythia-410m-deduped")
 model = model.to(device)
 tokenizer = AutoTokenizer.from_pretrained("EleutherAI/pythia-410m-deduped")
-tuned_lens = TunedLens.load("lens/pythia-410m-deduped", map_location=device)
 logit_lens = LogitLens(model)
 lens_options_dict = {
     "Tuned Lens": tuned_lens,
     "Logit Lens": logit_lens,
 }
 statistic_options_dict = {
     "Entropy": "entropy",
     "Cross Entropy": "ce",
@@ -42,6 +43,12 @@ def make_plot(lens, text, statistic, token_cutoff):
         start_pos=max(len(input_ids[0]) - token_cutoff, 0),
         statistic=statistic_options_dict[statistic],
     )
     return fig
@@ -54,10 +61,13 @@ A tuned lens allows us to peak at the iterative computations a transformer uses
 A lens into a transformer with n layers allows you to replace the last $m$ layers of the model with an [affine transformation](https://pytorch.org/docs/stable/generated/torch.nn.Linear.html) (we call these affine translators).
 This essentially skips over these last few layers and lets you see the best prediction that can be made from the model's representations, i.e. the residual stream, at layer $n - m$. Since the representations may be rotated, shifted, or stretched from layer to layer it's useful to train the len's affine adapters specifically on each layer. This training is what differentiates this method from simpler approaches that decode the residual stream of the network directly using the unembeding layer i.e. the logit lens. We explain this process in [the paper](https://arxiv.org/abs/2303.08112).
-"""
-with gr.Blocks() as iface:
     gr.Markdown(preamble)
     with gr.Column():
         text = gr.Textbox(
@@ -74,10 +84,13 @@ with gr.Blocks() as iface:
                 label="Select Statistic",
             )
             token_cutoff = gr.Slider(
-                maximum=20, minimum=2, value=10, step=1, label="Token Cut Off"
             )
         examine_btn = gr.Button(value="Submit")
         plot = gr.Plot()
     examine_btn.click(make_plot, [lens_options, text, statistic, token_cutoff], plot)
-iface.launch()

 model = AutoModelForCausalLM.from_pretrained("EleutherAI/pythia-410m-deduped")
 model = model.to(device)
 tokenizer = AutoTokenizer.from_pretrained("EleutherAI/pythia-410m-deduped")
+tuned_lens = TunedLens.load("pythia-410m-deduped", map_location=device)
 logit_lens = LogitLens(model)
 lens_options_dict = {
     "Tuned Lens": tuned_lens,
     "Logit Lens": logit_lens,
 }
 statistic_options_dict = {
     "Entropy": "entropy",
     "Cross Entropy": "ce",
         start_pos=max(len(input_ids[0]) - token_cutoff, 0),
         statistic=statistic_options_dict[statistic],
     )
+    fig.update_layout(template="plotly_dark")
+    # Update the colorscale of the heatmap trace
+    for trace in fig.data:
+        if trace.type == "heatmap":
+            trace.update(colorscale="Inferno")
     return fig
 A lens into a transformer with n layers allows you to replace the last $m$ layers of the model with an [affine transformation](https://pytorch.org/docs/stable/generated/torch.nn.Linear.html) (we call these affine translators).
 This essentially skips over these last few layers and lets you see the best prediction that can be made from the model's representations, i.e. the residual stream, at layer $n - m$. Since the representations may be rotated, shifted, or stretched from layer to layer it's useful to train the len's affine adapters specifically on each layer. This training is what differentiates this method from simpler approaches that decode the residual stream of the network directly using the unembeding layer i.e. the logit lens. We explain this process in [the paper](https://arxiv.org/abs/2303.08112).
+## Usage
+Since the tuned lens produces a distribution of predictions to visualize it's output we need to we need to provide a summary statistic to plot.  The default is simply [entropy](https://en.wikipedia.org/wiki/Entropy_(information_theory)), but you can also choose the [cross entropy](https://en.wikipedia.org/wiki/Cross_entropy) with the target token, or the [KL divergence](https://en.wikipedia.org/wiki/Kullback%E2%80%93Leibler_divergence) between the model's predictions and the tuned lens' predictions. You can also hover over a token to see more of the distribution i.e. the top 10 most probable tokens and their probabilities.
+"""
+with gr.Blocks() as demo:
     gr.Markdown(preamble)
     with gr.Column():
         text = gr.Textbox(
                 label="Select Statistic",
             )
             token_cutoff = gr.Slider(
+                maximum=20, minimum=2, value=10, step=1, label="Plot Last N Tokens"
             )
         examine_btn = gr.Button(value="Submit")
         plot = gr.Plot()
     examine_btn.click(make_plot, [lens_options, text, statistic, token_cutoff], plot)
+    demo.load(make_plot, [lens_options, text, statistic, token_cutoff], plot)
+if __name__ == "__main__":
+    demo.launch()