Logo
Explore Help
Register Sign In
Serendipity/CTI-Inference-Opt
1
1
Fork 0
You've already forked CTI-Inference-Opt
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
a358dfd0a3f1f97cebf847ab79a03047ddd94e6d
CTI-Inference-Opt/代码/code
T
History
OwnerSunshine530 a358dfd0a3 perf: dedup_embedding 默认开启 — 本地7.80->6.49s(快17%),AUC逐位不变
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-15 14:21:45 +08:00
..
tests
feat: 分块SDPA注意力(--attn chunked),按用户边界切块降O(S²)
2026-06-15 13:13:13 +08:00
bench.py
feat: dedup_embedding 选项 — 查表前对sign去重(slot19等高重复),减少大表随机访存
2026-06-15 14:07:23 +08:00
build_env.sh
fix: build_env.sh 简化为纯净版本(避免 CUDA 预热导致异常)
2026-06-12 21:55:09 +08:00
EXPERIMENTS.md
feat: infer.py 接入 CONFIG 实验开关 + 新增 bench.py 测量闭环
2026-06-14 16:48:38 +08:00
infer.py
perf: dedup_embedding 默认开启 — 本地7.80->6.49s(快17%),AUC逐位不变
2026-06-15 14:21:45 +08:00
requirements.txt
revert: requirements.txt 还原为原始完整依赖列表
2026-06-12 21:24:22 +08:00
Powered by Gitea Version: 26.3.1 Page: 159ms Template: 5ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API