fix(reflect): support continuous reward scores in failure filtering

not r.get("hard") treats non-zero floats as success.
Add explicit float threshold check (< 1e-9).
Backward compatible with binary hard=0/1.
This commit is contained in:
zq
2026-05-29 19:04:42 +08:00
parent afb552008b
commit a62ec857f1

File diff suppressed because it is too large Load Diff