microsoft-SkillOpt/skillopt at ffe581098bf35d5f2c539c417c8fc6c41c4d69a8 - microsoft-SkillOpt - 网新新思

github/microsoft-SkillOpt

mirror of https://github.com/microsoft/SkillOpt.git synced 2026-07-03 14:02:58 +08:00

Files

History

Cuzyoung ffe581098b feat(trainer): final-skill val + best promotion; keep best unpolluted by slow_update

- slow_update force-inject now writes current_skill ONLY (best_skill stays a
  faithful val-best snapshot, never receives un-validated slow_update content)
- after training, run one val on the final skill; if its gate score beats the
  incumbent best, promote final to best (updates best_skill/best_step/best_origin)
- trainer now evaluates final skill on test itself (reuses best test result when
  final==best); records final_selection_* and final_test_* in summary.json
- spreadsheetbench: head+tail truncate the post-execution verification report at
  source to fix multi-MB conversation bloat

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

2026-06-10 13:03:17 +00:00

..

SkillOpt v0.1.0: initial release

2026-05-21 17:22:04 +00:00

feat(trainer): final-skill val + best promotion; keep best unpolluted by slow_update

2026-06-10 13:03:17 +00:00

feat(trainer): final-skill val + best promotion; keep best unpolluted by slow_update

2026-06-10 13:03:17 +00:00

Add configurable gate metric (hard / soft / mixed) for skill validation

2026-05-30 14:45:27 +08:00

fix(spreadsheetbench)+optimizer: fix verify-feedback bloat, drop optimizer-side truncation, soft-disable gate

2026-06-10 13:03:17 +00:00

fix(model): forward Qwen timeout and only set enable_thinking when true

2026-06-07 07:41:35 -07:00

fix(spreadsheetbench)+optimizer: fix verify-feedback bloat, drop optimizer-side truncation, soft-disable gate

2026-06-10 13:03:17 +00:00

cleanup: remove unused benchmarks, deep_probe, meta_reflect

2026-05-24 19:36:48 +00:00

SkillOpt v0.1.0: initial release

2026-05-21 17:22:04 +00:00

fix(scoring): use float() instead of int() for continuous reward scores

2026-05-30 07:47:41 +08:00

__init__.py

refactor: rename teacher/student to optimizer/target, remove best skills, fix slow update

2026-05-24 19:15:10 +00:00

config.py

fix(spreadsheetbench)+optimizer: fix verify-feedback bloat, drop optimizer-side truncation, soft-disable gate

2026-06-10 13:03:17 +00:00

types.py

refactor: rename teacher/student to optimizer/target, remove best skills, fix slow update

2026-05-24 19:15:10 +00:00