microsoft-SkillOpt

mirror of https://github.com/microsoft/SkillOpt.git synced 2026-07-03 14:02:58 +08:00

Author	SHA1	Message	Date
Yifan Yang	2ca2910649	docs: align API reference and Add-a-Benchmark guide with real EnvAdapter ABC docs/reference/api.md previously documented a fictional EnvAdapter API (execute / evaluate / build_prompt + DataItem / TaskResult) and a BENCHMARK_REGISTRY that never existed in code. Anyone following the documented contract would hit ImportError or TypeError on the first instantiation. Replace both pages with the real shape from skillopt/envs/base.py and skillopt/datasets/base.py: - EnvAdapter: build_train_env, build_eval_env, rollout, reflect, get_task_types (the 5 actual abstract methods). - Rollout dicts: id / hard / soft required; everything else preserved into RolloutResult.extras. - Reflect dicts: {patch, source_type} schema as consumed by run_minibatch_reflect. - BatchSpec: slotted-but-mutable dataclass matching the actual definition (payload defaults to None, metadata to dict()). - SplitDataLoader.load_split_items as the one mandatory loader method. - Registry: _ENV_REGISTRY in scripts/train.py (lazy try/except ImportError block), not a non-existent BENCHMARK_REGISTRY in skillopt/envs/__init__.py. - _base_: documented as a string path, since the current YAML loader only accepts strings. The new-benchmark.md guide now walks through a docfaithful worked example with a real rollout helper (chat_target + scorer) instead of hand-waving over the rollout step. Refs microsoft/SkillOpt#30. Co-Authored-By: Claude Opus 4 <noreply@anthropic.com>	2026-06-01 20:14:54 +00:00
kaikai-macbook	41012e2d5e	Support Qwen chat as optimizer backend	2026-06-01 16:44:49 +08:00
yongjin	657b987de6	docs: add local environment smoke test guide	2026-05-29 09:26:38 +09:00
Cuzyoung	4a1b984d87	refactor: rename teacher/student to optimizer/target, remove best skills, fix slow update - Rename teacher -> optimizer, student -> target across all code, configs, docs, prompts - CLI: --teacher_model -> --optimizer_model, --student_model -> --target_model - Remove best_skill files, keep only initial skills - Fix slow update gate (force write into skill) - Fix SLOW_UPDATE marker stripping - Remove deep_reflect and meta_reflect mechanisms - Update .env.example with export prefix and azure_cli docs - Add endpoint empty validation in azure_openai.py Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-24 19:15:10 +00:00
CharlesYang030	244e346b83	SkillOpt v0.1.0: initial release - Skill optimization framework with training loop analogy - 11 benchmarks, 4 model backends (Azure OpenAI, Claude, Codex, Qwen) - WebUI for browser-based training control - Pluggable architecture for extending benchmarks and backends	2026-05-21 17:22:04 +00:00

5 Commits