A newer version of the Gradio SDK is available: 5.29.1
5.29.1
Example script of using FlashAttention for inference coming soon.