Files
CTI-Inference-Opt/代码
OwnerSunshine530 1083aca9fa feat: Triton BLOCK_M 可调(triton_block_m,默认64);bench --triton-bm 扫描
突破:triton评测39.92s/69.72(vs chunked 47.84/67.998)。继续调BLOCK_M榨。

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-17 13:01:50 +08:00
..