Skip to content

Pull requests: NVIDIA/cutlass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

use compiler macro to imporve the compatibility
#3008 opened Feb 6, 2026 by reed-lau Loading…
[CuTeDSL] Add sub_packed_f32x2 operation
#3004 opened Feb 4, 2026 by tridao Loading…
Resolve build warnings in C++20
#2998 opened Feb 3, 2026 by Algy Loading…
add: add comments to help understand
#2993 opened Feb 2, 2026 by meiniangpp416 Loading…
Fix/nvfp4 tensor init
#2989 opened Jan 28, 2026 by michael604work Loading…
Fix mixed_input_fmha_decode example
#2986 opened Jan 26, 2026 by anakinxc Loading…
[CuTeDSL]fix tvm-ffi path in from_dlpack
#2971 opened Jan 22, 2026 by rsmallblue Loading…
Update profiler.md with how to use generator.py
#2943 opened Jan 10, 2026 by aidando73 Loading…
feat(examples/test_run): use runtime sm arch
#2916 opened Dec 31, 2025 by tpoisonooo Loading…
Fix finding cuDNN
#2890 opened Dec 19, 2025 by TLescoatTFX Loading…
Remove redundant "from" from comment inactive-30d
#2853 opened Dec 8, 2025 by crcrpar Loading…
Remove deprecated newshape argument. inactive-30d
#2844 opened Dec 4, 2025 by Artem-B Loading…
ProTip! Follow long discussions with comments:>50.