Logo
Explore Help
Register Sign In
Serendipity/CTI-Inference-Opt
1
1
Fork 0
You've already forked CTI-Inference-Opt
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
0f359288a10d7985b722667310b678cee0b429f4
CTI-Inference-Opt/代码/code
T
History
OwnerSunshine530 0f359288a1 perf: 默认注意力设为 varlen(嵌套张量变长flash),本地 15.15s->10.28s 快32% AUC不变
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-15 09:16:20 +08:00
..
tests
feat: 嵌套张量变长 flash 注意力(--attn varlen),统一 CONFIG.attn 分发
2026-06-15 09:06:11 +08:00
bench.py
feat: 嵌套张量变长 flash 注意力(--attn varlen),统一 CONFIG.attn 分发
2026-06-15 09:06:11 +08:00
build_env.sh
fix: build_env.sh 简化为纯净版本(避免 CUDA 预热导致异常)
2026-06-12 21:55:09 +08:00
EXPERIMENTS.md
feat: infer.py 接入 CONFIG 实验开关 + 新增 bench.py 测量闭环
2026-06-14 16:48:38 +08:00
infer.py
perf: 默认注意力设为 varlen(嵌套张量变长flash),本地 15.15s->10.28s 快32% AUC不变
2026-06-15 09:16:20 +08:00
requirements.txt
revert: requirements.txt 还原为原始完整依赖列表
2026-06-12 21:24:22 +08:00
Powered by Gitea Version: 26.3.1 Page: 1213ms Template: 86ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API