Files
CTI-Inference-Opt/代码
Serendipity 7e0876c671 revert: RepEncoder 批量 embedding 查表(94.3s vs 92.5s,略慢)
回退到稳定版:FP16 + Flash Attention + inference_mode(57.45 分)
2026-06-13 13:05:14 +08:00
..