Fix/wan2.1 flash attention by vadseshu · Pull Request #153 · ROCm/MAD

vadseshu · 2026-04-22T12:10:11Z

Motivation

Updated wan2.1 dockerfile with FA steps taken from ROCM FA repo.

Technical Details

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Copilot

Pull request overview

Updates AMD inference Dockerfiles to adjust FlashAttention build/install behavior (notably for Wan2.1) and expands the supported ROCm arch list for Mochi.

Changes:

Replaces the pinned/parameterized FlashAttention wheel build in the Wan2.1 Dockerfile with a direct setup.py install from an unpinned ROCm/flash-attention clone.
Adds gfx950 to the PYTORCH_ROCM_ARCH list in the Mochi inference Dockerfile.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
docker/pyt_wan2.1_inference.ubuntu.amd.Dockerfile	Changes FlashAttention installation steps for Wan2.1 image builds.
docker/pyt_mochi_inference.ubuntu.amd.Dockerfile	Updates the ROCm architecture list used when building FlashAttention.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+#ARG BUILD_FA="1"
+#ARG FA_BRANCH="v3.0.0.r1-cktile"
+#ARG FA_REPO="https://github.com/ROCm/flash-attention.git"
+#RUN if [ "$BUILD_FA" = "1" ]; then \
+#    cd ${WORKSPACE_DIR} \
+#    && pip uninstall -y flash-attention \
+#    && rm -rf flash-attention \
+#    && git clone ${FA_REPO} \
+#    && cd flash-attention \
+#    && git checkout ${FA_BRANCH} \
+#    && git submodule update --init \
+#    && GPU_ARCHS=${HIP_ARCHITECTURES} python3 setup.py bdist_wheel --dist-dir=dist \
+#    && pip install dist/*.whl \
+#    && python -c "import flash_attn; print(f'Flash Attention version == {flash_attn.__version__}')"; \
+#    fi
+# install flash attention
+ENV FLASH_ATTENTION_TRITON_AMD_ENABLE="TRUE"
+
+RUN git clone https://github.com/ROCm/flash-attention.git &&\
+    cd flash-attention &&\
+    python setup.py install


+#ARG BUILD_FA="1"
+#ARG FA_BRANCH="v3.0.0.r1-cktile"
+#ARG FA_REPO="https://github.com/ROCm/flash-attention.git"
+#RUN if [ "$BUILD_FA" = "1" ]; then \
+#    cd ${WORKSPACE_DIR} \
+#    && pip uninstall -y flash-attention \
+#    && rm -rf flash-attention \
+#    && git clone ${FA_REPO} \
+#    && cd flash-attention \
+#    && git checkout ${FA_BRANCH} \
+#    && git submodule update --init \
+#    && GPU_ARCHS=${HIP_ARCHITECTURES} python3 setup.py bdist_wheel --dist-dir=dist \
+#    && pip install dist/*.whl \
+#    && python -c "import flash_attn; print(f'Flash Attention version == {flash_attn.__version__}')"; \
+#    fi
+# install flash attention
+ENV FLASH_ATTENTION_TRITON_AMD_ENABLE="TRUE"
+
+RUN git clone https://github.com/ROCm/flash-attention.git &&\
+    cd flash-attention &&\
+    python setup.py install


 ARG FA_REPO="https://github.com/Dao-AILab/flash-attention.git"
-ARG PYTORCH_ROCM_ARCH=gfx90a;gfx942;gfx1100;gfx1101;gfx1200;gfx1201
+ARG PYTORCH_ROCM_ARCH=gfx950;gfx90a;gfx942;gfx1100;gfx1101;gfx1200;gfx1201


lcskrishna · 2026-04-24T05:04:57Z

+# install flash attention
+ENV FLASH_ATTENTION_TRITON_AMD_ENABLE="TRUE"
+
+RUN git clone https://github.com/ROCm/flash-attention.git &&\


Please use FA_BRANCH & FA_REPO arguments. These are meant to build using build arguments with whatever branch is needed. Already existing branch is the latest tag from Flash-attention, any specific reason to remove it?

@lcskrishna as per the steps mentioned on SWDEV-564747, thus why the args. has removed, and also please refer the steps mentioned in this repo - https://github.com/Dao-AILab/flash-attention.

vadseshu1 and others added 2 commits April 16, 2026 13:10

Added gfx950a to pyt_mochi_inference.ubuntu dockerfile to enable fa

549e9ec

steps are updated for FA for wan2.1

263425b

Copilot AI review requested due to automatic review settings April 22, 2026 12:10

vadseshu requested review from Rohan138, amathews-amd, coketaste, gargrahul and ppalaniappan-amd as code owners April 22, 2026 12:10

Copilot started reviewing on behalf of vadseshu April 22, 2026 12:11 View session

Copilot AI reviewed Apr 22, 2026

View reviewed changes

lcskrishna requested changes Apr 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/wan2.1 flash attention#153

Fix/wan2.1 flash attention#153
vadseshu wants to merge 2 commits intoROCm:developfrom
vadseshu:fix/wan2.1_flash_attention

vadseshu commented Apr 22, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

lcskrishna Apr 24, 2026

Uh oh!

vadseshu May 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

vadseshu commented Apr 22, 2026

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

lcskrishna Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

vadseshu May 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants