revert: 真稀疏MoE默认关 — 评测净负(lat34.64->37.64,本地快评测慢如varlen;+容量丢弃降AUC)。回到 dense/70.96

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
This commit is contained in:
OwnerSunshine530
2026-06-17 21:36:23 +08:00
parent 2cf7f185fc
commit 6278d4a050
+4 -2
View File
@@ -144,8 +144,10 @@ CONFIG = {
# 须靠提交验证。AUC中性、MoE仅占2%算力故风险极低。
"vectorize_moe": True, # True=稠密向量化MoE(无同步点)False=原逐expert循环(.nonzero同步)
"moe_baddbmm": True, # MoE FFN 用 baddbmm(cutlass GEMM+bias epilogue融合),省 bias add kernel
"moe_sparse": True, # 真稀疏MoE(只算top-k,capacity分组),本地4.77->4.05s(-15%),AUC微降无碍
"moe_capacity": 2.0, # 每expert容量 = ceil(Nk/E*factor);cap=2.0 PCOC1.105在区间(1.25会炸到1.418)
# 真稀疏MoE实测评测净负:lat 34.64->37.64s(本地快15%但argsort/scatter开销评测放大,如varlen)
# +容量丢弃降AUC(0.7525->0.7507)。已退回 dense。
"moe_sparse": False, # True=真稀疏MoE(评测净负,勿开)
"moe_capacity": 2.0,
"skip_moe_loss": True, # 推理跳过 moe_loss(load-balance,推理无用),省 importance/std/mean kernel
# PCOC 校准:本地拟合-0.1067(本地PCOC1.109),但评测PCOC稳定1.059,按斜率换算评测最优≈-0.059。
"logit_bias": -0.06, # logit 加常数偏移使评测 PCOC→~1.0(单调,AUC不变,免费+~0.33分)