Files
microsoft-SkillOpt/skillopt
zq afb552008b fix(trainer): support continuous reward scores in bucket aggregation
int() truncates any float in [0,1) to 0. Replace with float().
Also fix falsy float check in failure detection.
Backward compatible with binary hard=0/1.
2026-05-29 19:03:52 +08:00
..
2026-05-21 17:22:04 +00:00
2026-05-21 17:22:04 +00:00