Spaces:

wi-lab
/

lwm-interactive-demo

Running

wi-lab commited on Sep 27, 2024

Commit

d75b6d0

verified ·

1 Parent(s): a9c2596

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -692,7 +692,7 @@ with gr.Blocks(css="""
             ```
         """)
     with gr.Tab("LWM Model and Framework"):
-        gr.Image("images/lwm.PNG")
         gr.Markdown("This figure depicts the offline pre-training and online embedding generation process for LWM. The channel is divided into fixed-size patches, which are linearly embedded and combined with positional encodings before being passed through a Transformer encoder. During self-supervised pre-training, some embeddings are masked, and LWM leverages self-attention to extract deep features, allowing the decoder to reconstruct the masked values. For downstream tasks, the generated LWM embeddings enhance performance. The right block shows the LWM architecture, inspired by the original Transformer introduced in the [**Attention Is All You Need**](https://arxiv.org/abs/1706.03762) paper.")
 # Launch the app

             ```
         """)
     with gr.Tab("LWM Model and Framework"):
+        gr.Image("images/lwm_model_v2.png")
         gr.Markdown("This figure depicts the offline pre-training and online embedding generation process for LWM. The channel is divided into fixed-size patches, which are linearly embedded and combined with positional encodings before being passed through a Transformer encoder. During self-supervised pre-training, some embeddings are masked, and LWM leverages self-attention to extract deep features, allowing the decoder to reconstruct the masked values. For downstream tasks, the generated LWM embeddings enhance performance. The right block shows the LWM architecture, inspired by the original Transformer introduced in the [**Attention Is All You Need**](https://arxiv.org/abs/1706.03762) paper.")
 # Launch the app