marcorez8
/

flash-attn-windows-blackwell

flash-attention

prebuilt-wheels

machine-learning

Model card Files Files and versions

marcorez8 commited on Sep 27

Commit

e1480e1

·

verified ·

1 Parent(s): a9ebc1b

Update README.md

Files changed (1) hide show

README.md +6 -3

README.md CHANGED Viewed

@@ -16,11 +16,14 @@ tags:
 # Flash-Attention 2.7.4 Prebuilt Wheels for NVIDIA Blackwell (cu128) on Windows
-This repository provides prebuilt wheels for **Flash-Attention 2.7.4** optimized for NVIDIA Blackwell GPUs (cu128) on Windows systems. These wheels are compatible with Python 3.10 and 3.11, enabling seamless integration for high-performance attention mechanisms in deep learning workflows.
 ## Available Wheels
-- `flash_attn-2.7.4.post1-cp310-cp310-win_amd64.whl` (Python 3.10)
-- `flash_attn-2.7.4.post1-cp311-cp311-win_amd64.whl` (Python 3.11)
 ## Compatibility
 The prebuilt wheels are designed for NVIDIA Blackwell GPUs but have been tested and confirmed compatible with previous-generation NVIDIA GPUs, including:

 # Flash-Attention 2.7.4 Prebuilt Wheels for NVIDIA Blackwell (cu128) on Windows
+This repository provides prebuilt wheels for **Flash-Attention 2.7.4** optimized for NVIDIA Blackwell GPUs (cu128 and cu129) on Windows systems.
+These wheels are compatible with Python 3.10 and 3.11, enabling seamless integration for high-performance attention mechanisms in deep learning workflows.
 ## Available Wheels
+- `flash_attn-2.7.4.post1-cp310-cp310-win_amd64.whl` (Python 3.10) * pytorch 2.7 cu128
+- `flash_attn-2.7.4.post1-cp311-cp311-win_amd64.whl` (Python 3.11) * pytorch 2.7 cu128
+- `flash_attn-2.7.4.post1-cp310-cp310-win_amd64.whl` (Python 3.10) * pytorch 2.8 cu129
 ## Compatibility
 The prebuilt wheels are designed for NVIDIA Blackwell GPUs but have been tested and confirmed compatible with previous-generation NVIDIA GPUs, including: