-
Notifications
You must be signed in to change notification settings - Fork 258
Pull requests: ROCm/composable_kernel
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CK_TILE] Align FMHA BWD Reference with Kernel Implementation
#3486
opened Dec 24, 2025 by
DDEle
Loading…
7 tasks done
Ensure Consistent 64-Bit Unsigned Handling Across OS Platforms
#3483
opened Dec 23, 2025 by
afagaj
Loading…
7 tasks done
[CK profiler] Perform verification on GPU when using GPU reference
#3482
opened Dec 23, 2025 by
johannes-graner
Loading…
3 of 7 tasks
add double buffering support for gemm abquant
#3481
opened Dec 23, 2025 by
kensclin
Loading…
1 of 7 tasks
Replace grouped conv bwd wei wmmaV3 bilin/scale bf16f32bf16 support with bf16bf16bf16
organization: streamhpc
#3470
opened Dec 19, 2025 by
krithalith
Loading…
7 tasks
[CK_TILE] Refactor
UniversalGemm::MakeGemmTensorViews to separate descriptor and view creation
#3467
opened Dec 19, 2025 by
amd-meskelin
•
Draft
7 tasks
Add support for direct store in epilogue and padding support for wave transfer without transpose
organization: streamhpc
#3465
opened Dec 19, 2025 by
EnricoDeg
Loading…
6 of 7 tasks
[CI, CK examples] Disable time_kernel for CI tests and examples
#3464
opened Dec 19, 2025 by
johannes-graner
Loading…
7 tasks
[CK TILE ENGINE] Enable gfx950 GPU architecture for supported combination
#3463
opened Dec 19, 2025 by
ThruptiRajLakshmanaGowda
•
Draft
1 of 7 tasks
[CK_Tile] Support for group size 128 for Preshuffle quant for 2d block scale gemm
#3462
opened Dec 19, 2025 by
amd-khushbu
Loading…
7 tasks
Grouped convolution backward data WMMA v3 implementation
organization: streamhpc
#3460
opened Dec 19, 2025 by
ApoorvaKalyani
Loading…
4 of 7 tasks
[CK_TILE] Allow
UniversalGemmKernel::RunGemm to be called using tensor descriptors
#3457
opened Dec 18, 2025 by
amd-meskelin
•
Draft
7 tasks
[CKTILE] Support A/B Quantization in Blockscale Grouped Gemm
#3452
opened Dec 18, 2025 by
kyle-256
Loading…
1 of 7 tasks
[CK_Tile] Support for various group sizes Preshuffle quant for 2d block scale gemm
#3445
opened Dec 17, 2025 by
amd-khushbu
Loading…
1 of 7 tasks
[FMHA] Batch Prefill Support Improvements: Change KV Cache Layout & Large Page Size Support
#3442
opened Dec 16, 2025 by
Jeff-Huang
Loading…
7 tasks
Merge group related improvements for convolution operations
organization: streamhpc
#3439
opened Dec 16, 2025 by
zsotakal
Loading…
7 tasks
Adding remaining flavors for grouped conv fwd
#3436
opened Dec 16, 2025 by
wj-laskowski
•
Draft
6 of 7 tasks
Implement device_gemm_universal_preshuffle_instance for RDNA4
organization: streamhpc
#3429
opened Dec 15, 2025 by
yungshengtu
Loading…
6 of 7 tasks
[CK_Builder] [testing] Integrate device random generators
#3427
opened Dec 15, 2025 by
kabrahamAMD
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.