通过aten适配器引入flash attention库 by PanZezhong1725 · Pull Request #1034 · InfiniTensor/InfiniCore

PanZezhong1725 · 2026-02-28T07:56:16Z

No description provided.

PanZezhong1725 · 2026-02-28T08:06:29Z

已过期（适用commit d2aa36d）
复现方法：

在third_party目录里拉取cutlass和flash_attn源码
在third_party/flash-attention/csrc/flash_attn里添加如下CMakeLists.txt，注意替换其中路径

cmake_minimum_required(VERSION 3.18)
project(flash_attn LANGUAGES CXX CUDA)

set(CMAKE_CXX_STANDARD 17)
set(CMAKE_POSITION_INDEPENDENT_CODE ON)

# LibTorch include dirs
set(TORCH_INCLUDE_DIRS
    /home/panzezhong/Projects/InfiniCore/third_party/flash-attention/csrc/flash_attn/src
    /home/panzezhong/.conda/envs/myenv/lib/python3.13/site-packages/torch/include/torch/csrc/api/include
    /home/panzezhong/.conda/envs/myenv/lib/python3.13/site-packages/torch/include/
    /home/panzezhong/.conda/envs/myenv/include/python3.13/
    /home/panzezhong/Projects/InfiniCore/third_party/cutlass/include
)

# LibTorch libraries
set(TORCH_LIBS
    /home/panzezhong/.conda/envs/myenv/lib/python3.13/site-packages/torch/lib/libtorch.so
    /home/panzezhong/.conda/envs/myenv/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so
    /home/panzezhong/.conda/envs/myenv/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so
    /home/panzezhong/.conda/envs/myenv/lib/python3.13/site-packages/torch/lib/libc10.so
    /home/panzezhong/.conda/envs/myenv/lib/python3.13/site-packages/torch/lib/libc10_cuda.so
    /home/panzezhong/.conda/envs/myenv/lib/python3.13/site-packages/torch/lib/libtorch_python.so
)



# Collect all CUDA source files
file(GLOB FLASH_CU_SOURCES
    "${CMAKE_CURRENT_SOURCE_DIR}/flash_attn/src/*.cu"
)

add_library(flash_attn SHARED
    ${CMAKE_CURRENT_SOURCE_DIR}/flash_attn/flash_api.cpp
    ${FLASH_CU_SOURCES}
)
target_include_directories(flash_attn PRIVATE
    ${TORCH_INCLUDE_DIRS}
    ${CMAKE_CURRENT_SOURCE_DIR}/flash_attn
)

target_link_libraries(flash_attn PRIVATE
    ${TORCH_LIBS}
    /home/panzezhong/.conda/envs/myenv/lib/libpython3.13.so
    /home/panzezhong/.conda/envs/myenv/lib/libpython3.so
)

target_link_options(flash_attn PRIVATE "-Wl,--no-undefined")

set_target_properties(flash_attn PROPERTIES
    CUDA_ARCHITECTURES "80;86;90"
)

target_compile_options(flash_attn PRIVATE
    $<$<COMPILE_LANGUAGE:CUDA>:--expt-relaxed-constexpr>
    $<$<COMPILE_LANGUAGE:CUDA>:--use_fast_math>
)

add_definitions(-D_GLIBCXX_USE_CXX11_ABI=1)

编译flash_attn

mkdir build
cd build 
cmake .. -DCMAKE_BUILD_TYPE=Release
make -j8

编译成功后会在build目录里得到 libflash_attn.so，核心就是把它link到infinicore中去

正常编译安装infinicore，运行mha_varlen算子测试

PanZezhong1725 · 2026-03-05T06:28:44Z

更新自动化编译流程：

设置cutlass路径环境变量 CUTLASS_ROOT
配置环节打开 --aten 开关，并设置 --flash-attn 库位置
xmake f --nv-gpu=y --ccl=y --cuda=$CUDA_HOME --aten=y --flash-attn=/home/panzezhong/Projects/InfiniCore/third_party/flash-attention -cv
flash attenion库会跟随infinicore_cpp_api一同编译安装

PanZezhong1725 requested a review from wooway777 February 28, 2026 07:56

PanZezhong1725 force-pushed the issue/1033 branch from b854df1 to e695c4f Compare February 28, 2026 07:58

PanZezhong1725 linked an issue Feb 28, 2026 that may be closed by this pull request

[DEV] 通过aten适配器引入flash attention库 #1033

Open

PanZezhong1725 removed a link to an issue Feb 28, 2026

[DEV] 通过aten适配器引入flash attention库 #1033

Open

PanZezhong1725 force-pushed the issue/1033 branch 2 times, most recently from 974bd8f to 475be86 Compare March 3, 2026 09:33

PanZezhong1725 added 3 commits March 5, 2026 06:25

issue/1033 support flash_attn lib with aten adaptor

930c8a0

issue/1033 fix mha_varlen test

d2aa36d

issue/1033 support stream guard

61cc09d

PanZezhong1725 force-pushed the issue/1033 branch from 475be86 to c2486f2 Compare March 5, 2026 06:25

PanZezhong1725 force-pushed the issue/1033 branch from c2486f2 to 00932ad Compare March 5, 2026 06:29

issue/1033 add flash-attn compile target

06c3df5

PanZezhong1725 force-pushed the issue/1033 branch from 00932ad to 06c3df5 Compare March 5, 2026 06:37

PanZezhong1725 added 紧急！类型：开发准备好了类型：重构模块：算子 labels Mar 5, 2026

PanZezhong1725 marked this pull request as ready for review March 5, 2026 06:39

PanZezhong1725 requested a review from a team March 5, 2026 06:39

PanZezhong1725 force-pushed the issue/1033 branch from d0b8049 to 2a534c8 Compare March 5, 2026 10:41

issue/1033 add fpic ldflag for infinirt and fix python link

d575e7a

PanZezhong1725 force-pushed the issue/1033 branch from 2a534c8 to d575e7a Compare March 5, 2026 10:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

通过aten适配器引入flash attention库#1034

通过aten适配器引入flash attention库#1034
PanZezhong1725 wants to merge 5 commits intomainfrom
issue/1033

PanZezhong1725 commented Feb 28, 2026

Uh oh!

PanZezhong1725 commented Feb 28, 2026 •

edited

Loading

Uh oh!

PanZezhong1725 commented Mar 5, 2026 •

edited by pengcheng888

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

PanZezhong1725 commented Feb 28, 2026

Uh oh!

PanZezhong1725 commented Feb 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PanZezhong1725 commented Mar 5, 2026 • edited by pengcheng888 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

PanZezhong1725 commented Feb 28, 2026 •

edited

Loading

PanZezhong1725 commented Mar 5, 2026 •

edited by pengcheng888

Loading