This website requires JavaScript.
Explore
Help
Register
Sign In
Serendipity
/
CTI-Inference-Opt
Watch
1
Star
1
Fork
0
You've already forked CTI-Inference-Opt
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
e69ba714e54b28b51279d77515a713e8a29e3af9
CTI-Inference-Opt
/
代码
/
code
T
History
Serendipity
e69ba714e5
revert: 移除 2:4 稀疏(PCOC 2.067 + 耗时反增 265s,to_sparse_semi_structured 与 nn.Linear 不兼容)
...
回退到稳定版:FP16 + Flash Attention + inference_mode(57.45 分)
2026-06-13 12:34:29 +08:00
..
build_env.sh
fix: build_env.sh 简化为纯净版本(避免 CUDA 预热导致异常)
2026-06-12 21:55:09 +08:00
infer.py
revert: 移除 2:4 稀疏(PCOC 2.067 + 耗时反增 265s,to_sparse_semi_structured 与 nn.Linear 不兼容)
2026-06-13 12:34:29 +08:00
requirements.txt
revert: requirements.txt 还原为原始完整依赖列表
2026-06-12 21:24:22 +08:00