microsoft-SkillOpt

mirror of https://github.com/microsoft/SkillOpt.git synced 2026-07-03 14:02:58 +08:00

Author	SHA1	Message	Date
CharlesYang030	e4ea6a6771	chore(release): v0.2.0 Highlights since v0.1.0: - feat: SkillOpt-Sleep engine — nightly offline self-evolution (harvest -> mine -> replay -> consolidate behind a validation gate), with multi-objective reward, experience replay + dream rollouts, slow-update long-term memory, and secret redaction in cycle diagnostics. Shipped as the `skillopt-sleep` CLI. - feat: cross-tool backends & plugin shells — Claude, Codex (+Desktop harvest), Copilot, Devin, and OpenClaw. - feat: SearchQA split materialization + rollout fail-fast. - fix: Windows robustness for claude/codex backends, hardened JSON fallback, Qwen timeout/thinking gating, Codex failure surfacing. Packaging: - Bump pyproject / skillopt / skillopt_sleep to 0.2.0. - Restore skillopt_webui to the packaged wheel. See CHANGELOG.md for the full changelog and contributor acknowledgements. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-02 22:11:10 +08:00
Yif Yang	26e5338def	Update citation from @misc to @article format Co-Authored-By: Claude <noreply@anthropic.com>	2026-06-26 02:54:46 +00:00
Yifan Yang	baad64a3b9	docs(readme): remove Acknowledgements section (#81 ) The contributor is already credited via the Co-authored-by trailer carried into main by #79; a dedicated README section is unnecessary. Co-authored-by: Claude <noreply@anthropic.com>	2026-06-23 19:13:16 +08:00
Yifan Yang	c2e47c50fb	docs(readme): acknowledge community contributor @samuelgoofus-boop (#80 ) Add an Acknowledgements section crediting @samuelgoofus-boop for the Windows-robustness work on the Claude/Codex backends (originally #77, merged via #79). Co-authored-by: Claude <noreply@anthropic.com>	2026-06-23 19:03:30 +08:00
Yifan Yang	c98eac18c7	docs(readme): add Trendshift daily/weekly badges (#1 ) Add the microsoft/SkillOpt Trendshift badges (daily + weekly) side by side in the README header. Co-authored-by: Claude <noreply@anthropic.com>	2026-06-23 16:50:47 +08:00
Yifan Yang	de3be75bac	docs(sleep): add a SkillOpt-Sleep module readme + News mention Adds docs/sleep/README.md — a concise intro to the SkillOpt-Sleep plugin (what it is, how to use it across the three agents, the opt-in experience-replay / dream-rollout knobs, and headline results), linking to the full guide section. Adds a News bullet pointing to it. No code changes.	2026-06-15 16:31:15 +00:00
Yifan Yang	b701d9b6d9	docs: move SkillOpt-Sleep into the guide; clean docs/sleep; fix guide link Per maintainer request: - Remove the internal/scratch docs/sleep/ tree (reports, raw logs, blog run JSON, sweep.jsonl) — 23 files — and the root PUBLISHING.md. These were working notes, not reference docs. - Take the dedicated SkillOpt-Sleep content out of the main README (News bullet + section) and host it in the rendered guide instead: new section 9 in docs/guideline.html (deployment companion, the three plugins, opt-in experience replay / dream rollouts) with a sidebar entry. - Fix the README's opening reference so "Documentation & Reproduction Guide" links directly to the rendered GitHub Pages page, not the raw .html source. - Repoint the now-removed docs/sleep links in the plugin READMEs to the guide section. The plugin code (plugins/, skillopt_sleep/) is unchanged; only docs move. Co-Authored-By: Claude Opus 4 <noreply@anthropic.com>	2026-06-15 16:20:50 +00:00
Kirill Kostarev	31715a8b43	Add Codex Desktop transcript harvesting	2026-06-15 10:23:08 +00:00
Kirill Kostarev	1953484822	Make Codex integration skill-first	2026-06-15 10:21:30 +00:00
Yifan Yang	86bad36ffe	feat(sleep): SkillOpt-Sleep plugin update (preview) — engine robustness + scheduling Updates the SkillOpt-Sleep plugin on top of the current main. User-facing and engine improvements since the initial drop: * Command renamed /sleep -> /skillopt-sleep across Claude Code + Codex shells; refreshed plugin READMEs and install scripts. * Built-in scheduling (skillopt_sleep/scheduler.py + __main__): schedule / unschedule the nightly cycle without external cron wiring. * Backend robustness: bounded retry with backoff (no more silent empty-string on transient 429/timeout), content-filter-safe rollout prompt, an output-contract guardrail that rejects edits violating the task's required format, and a per-sample cache key so repeated dream rollouts are independent samples (fixes degenerate single-sample reflection). * consolidate / rollout / replay: parallel multi-rollout dreaming, gate-mode controls, TaskRecord.system framing field. Scope: this commit ships only the plugin engine + shells. Research/benchmark harnesses and their data are intentionally not included; the public package has no dependency on them (the one research-evaluator import is now guarded). Marked as an early preview in the README; we'll keep iterating. 99/99 unit tests pass. Co-Authored-By: Claude Opus 4 <noreply@anthropic.com>	2026-06-14 16:12:00 +00:00
Cuzyoung	b0b62fcb86	docs(readme): slim README — move install/quick-start/data/config details to the guideline page README now: badges + one-line pointer to docs/guideline.html, overview, demo, sleep section, extensibility pointers, WebUI launch, citation. All run-the-demo commands live in the guideline (which already covered install, credentials, training, eval, outputs, data prep, and config). Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>	2026-06-10 13:27:36 +00:00
Cuzyoung	0d5b331cd5	Merge branch 'docs/guideline' into feat/skill-aware-reflection # Conflicts: # README.md	2026-06-10 13:27:12 +00:00
Yifan Yang	dae974a5e3	chore(sleep): English-only across the engine, plugins, and docs Remove every non-ASCII/CJK character for a professional open-source repo: - harvest.py: drop hardcoded Chinese feedback phrases; add an env-based extensibility hook (SKILLOPT_SLEEP_NEG_FEEDBACK / _POS_FEEDBACK) so any locale can be added without baking one in. Verified with a German example. - rollout.py / consolidate.py: English comments. - README.md section heading + anchor, CONTROLLABLE_DREAMING.md, plugin.json, marketplace.json (also fixed stale path skillopt-sleep-plugin -> plugins/claude-code), SKILL.md: English only. - Remove the internal WAKE_UP_SUMMARY.md note (not user-facing, not referenced). Verified: zero CJK chars remain anywhere; 29 tests pass. Co-Authored-By: Claude Opus 4 <noreply@anthropic.com>	2026-06-08 14:31:52 +00:00
Yifan Yang	f9db99853b	feat(plugins): ship SkillOpt-Sleep for Claude Code, Codex, and Copilot Restructure into plugins/{claude-code,codex,copilot}/ — one engine, three thin shells, all calling the shared plugins/run-sleep.sh -> python -m skillopt_sleep. - claude-code/: existing plugin moved here; runner delegates to the shared launcher (fixes repo-root resolution after the move). - codex/: ~/.codex/prompts/sleep.md custom prompt + ~/.agents/skills SKILL.md + install.sh + AGENTS.md hint — Codex's documented, stable extension surfaces. - copilot/: a stdlib-only MCP server (mcp_server.py) exposing sleep_* tools, plus mcp-config.example.json and a copilot-instructions snippet. Verified end to end (initialize -> tools/list -> tools/call returns real engine output). - plugins/README.md overview table; main README News + a dedicated SkillOpt-Sleep section; pyproject lists skillopt_sleep as a first-class package. Decoupling emphasized throughout: open-source tool (skillopt_sleep/) with zero dependency on the research package. 29 tests pass; all three shells resolve. Co-Authored-By: Claude Opus 4 <noreply@anthropic.com>	2026-06-08 14:31:52 +00:00
Yif Yang	ee9931ec01	docs: add SkillOpt integration news	2026-06-03 16:07:56 +00:00
CharlesYang030	3f194d58e5	docs: trim News entry wording Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-06-02 23:12:40 +08:00
CharlesYang030	c7513d54f3	docs: update News section to match LLM2CLIP style Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-06-02 23:09:10 +08:00
CharlesYang030	abc9acd82e	docs: add fire emoji to News section heading Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-06-02 22:59:06 +08:00
CharlesYang030	46cc2efd8a	docs: add News section, PyPI install instructions, and PyPI badge to README Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-06-02 22:54:54 +08:00
Yifan Yang	fb1a76371d	Merge pull request #29 from LifeIsSoSolong/codex/qwen-chat-optimizer-backend Support qwen_chat as optimizer backend	2026-06-02 03:27:50 +08:00
hwq	181d71b737	Release data split manifests	2026-06-01 16:02:14 +00:00
kaikai-macbook	41012e2d5e	Support Qwen chat as optimizer backend	2026-06-01 16:44:49 +08:00
Yif Yang	8ebede0efd	Refine README for clarity on optimization results Removed redundant wording about math benchmarks.	2026-05-31 18:20:00 +08:00
Yif Yang	266fca72ab	docs: clarify optional features and ckpt artifacts	2026-05-31 09:36:25 +00:00
Yif Yang	9265545c45	docs: clarify README and paper-aligned skill artifacts	2026-05-31 09:23:07 +00:00
Cuzyoung	8acc2dd03e	docs: add self-contained reproduction & usage guideline page Add docs/guideline.html, a single self-contained documentation guide (left-nav + content + on-this-page TOC) covering installation, data preparation, training/eval, full configuration reference, framework internals, and an API reference. Link it from the README with local, htmlpreview, and GitHub Pages access instructions. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-31 09:01:25 +00:00
hwq	42e555d28e	Update eval-only README example	2026-05-30 15:28:17 +00:00
Yif Yang	4f3a9bc055	docs: scope PR #25 gate_metric as opt-in example, not default Move the soft/mixed gate-metric configuration introduced in PR #25 out of the base default config and into a standalone example config so that default SkillOpt runs (and paper reproduction) remain bit-for-bit on the original hard gate. - configs/_base_/default.yaml: drop gate_metric / gate_mixed_weight keys. The trainer's cfg.get("gate_metric", "hard") fallback preserves the original behavior unchanged. - configs/examples/soft_gate.yaml: new standalone reference config with a header explaining when to consider it (small selection split with continuous rewards) and when not to (paper reproduction, large or binary-reward settings). - README.md: add a short "Community-contributed configs" section that clearly flags this as user-contributed and non-default.	2026-05-30 08:09:03 +00:00
Huangzisu	dbc90bd755	fix(auth): let env vars override yaml for openai_compatible mode The yaml default `azure_openai_auth_mode: azure_cli` was silently overwriting `AZURE_OPENAI_AUTH_MODE` exported by the user, because `configure_clients()` treats any non-empty config value as an explicit override. Switching the three auth_mode defaults (shared / optimizer / target) to "" lets `_clean()` drop them and restores the intended fallback chain: yaml → env var → module default ("azure_cli"). Also update README and .env.example to document the openai_compatible mode introduced in `d5c5b61`, and remove the misleading `OPENAI_API_KEY` snippet — SkillOpt reuses the `AZURE_OPENAI_*` env vars in this mode.	2026-05-30 06:58:05 +00:00
Cuzyoung	99212e3956	docs: remove Star History section for now Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 08:12:51 +00:00
Cuzyoung	fc54c44e93	docs: add Star History chart to README Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 08:10:16 +00:00
Yif Yang	48adf5a69f	Update citation format in README.md	2026-05-26 02:56:58 +08:00
Yif Yang	b11e6dcfb9	Enhance training description in README Updated README to include '(mini-)batchsize' in the training description.	2026-05-26 02:35:10 +08:00
Yif Yang	c98bcdd5b3	Update README.md	2026-05-25 13:27:40 +08:00
Yif Yang	0f6db9afc4	Update README.md	2026-05-25 13:26:55 +08:00
Cuzyoung	7ae2d8766e	docs: restore clean README with Install/Data/QuickStart/WebUI/Citation only Keep remote project page header (badges, video), replace body with our streamlined 5-section README focused on reproducibility. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-24 19:19:19 +00:00
Cuzyoung	4a1b984d87	refactor: rename teacher/student to optimizer/target, remove best skills, fix slow update - Rename teacher -> optimizer, student -> target across all code, configs, docs, prompts - CLI: --teacher_model -> --optimizer_model, --student_model -> --target_model - Remove best_skill files, keep only initial skills - Fix slow update gate (force write into skill) - Fix SLOW_UPDATE marker stripping - Remove deep_reflect and meta_reflect mechanisms - Update .env.example with export prefix and azure_cli docs - Add endpoint empty validation in azure_openai.py Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-24 19:15:10 +00:00
Lliar-liar	c42d541828	Refine project links and citation section	2026-05-24 18:24:48 +00:00
Lliar-liar	2e05edc399	Add project links and citation section	2026-05-24 18:18:36 +00:00
Yif Yang	441ccb9bda	Update README.md	2026-05-25 02:15:02 +08:00
CharlesYang030	e27aac30ef	docs: add draft release notice	2026-05-21 17:39:36 +00:00
CharlesYang030	76a58e6e7a	docs: polish README header and remove license section	2026-05-21 17:34:47 +00:00
CharlesYang030	244e346b83	SkillOpt v0.1.0: initial release - Skill optimization framework with training loop analogy - 11 benchmarks, 4 model backends (Azure OpenAI, Claude, Codex, Qwen) - WebUI for browser-based training control - Pluggable architecture for extending benchmarks and backends	2026-05-21 17:22:04 +00:00

43 Commits