microsoft-SkillOpt

mirror of https://github.com/microsoft/SkillOpt.git synced 2026-07-03 14:02:58 +08:00

Author	SHA1	Message	Date
Cuzyoung	b0b62fcb86	docs(readme): slim README — move install/quick-start/data/config details to the guideline page README now: badges + one-line pointer to docs/guideline.html, overview, demo, sleep section, extensibility pointers, WebUI launch, citation. All run-the-demo commands live in the guideline (which already covered install, credentials, training, eval, outputs, data prep, and config). Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>	2026-06-10 13:27:36 +00:00
Cuzyoung	0d5b331cd5	Merge branch 'docs/guideline' into feat/skill-aware-reflection # Conflicts: # README.md	2026-06-10 13:27:12 +00:00
Yifan Yang	dae974a5e3	chore(sleep): English-only across the engine, plugins, and docs Remove every non-ASCII/CJK character for a professional open-source repo: - harvest.py: drop hardcoded Chinese feedback phrases; add an env-based extensibility hook (SKILLOPT_SLEEP_NEG_FEEDBACK / _POS_FEEDBACK) so any locale can be added without baking one in. Verified with a German example. - rollout.py / consolidate.py: English comments. - README.md section heading + anchor, CONTROLLABLE_DREAMING.md, plugin.json, marketplace.json (also fixed stale path skillopt-sleep-plugin -> plugins/claude-code), SKILL.md: English only. - Remove the internal WAKE_UP_SUMMARY.md note (not user-facing, not referenced). Verified: zero CJK chars remain anywhere; 29 tests pass. Co-Authored-By: Claude Opus 4 <noreply@anthropic.com>	2026-06-08 14:31:52 +00:00
Yifan Yang	f9db99853b	feat(plugins): ship SkillOpt-Sleep for Claude Code, Codex, and Copilot Restructure into plugins/{claude-code,codex,copilot}/ — one engine, three thin shells, all calling the shared plugins/run-sleep.sh -> python -m skillopt_sleep. - claude-code/: existing plugin moved here; runner delegates to the shared launcher (fixes repo-root resolution after the move). - codex/: ~/.codex/prompts/sleep.md custom prompt + ~/.agents/skills SKILL.md + install.sh + AGENTS.md hint — Codex's documented, stable extension surfaces. - copilot/: a stdlib-only MCP server (mcp_server.py) exposing sleep_* tools, plus mcp-config.example.json and a copilot-instructions snippet. Verified end to end (initialize -> tools/list -> tools/call returns real engine output). - plugins/README.md overview table; main README News + a dedicated SkillOpt-Sleep section; pyproject lists skillopt_sleep as a first-class package. Decoupling emphasized throughout: open-source tool (skillopt_sleep/) with zero dependency on the research package. 29 tests pass; all three shells resolve. Co-Authored-By: Claude Opus 4 <noreply@anthropic.com>	2026-06-08 14:31:52 +00:00
Yif Yang	ee9931ec01	docs: add SkillOpt integration news	2026-06-03 16:07:56 +00:00
CharlesYang030	3f194d58e5	docs: trim News entry wording Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-06-02 23:12:40 +08:00
CharlesYang030	c7513d54f3	docs: update News section to match LLM2CLIP style Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-06-02 23:09:10 +08:00
CharlesYang030	abc9acd82e	docs: add fire emoji to News section heading Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-06-02 22:59:06 +08:00
CharlesYang030	46cc2efd8a	docs: add News section, PyPI install instructions, and PyPI badge to README Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-06-02 22:54:54 +08:00
Yifan Yang	fb1a76371d	Merge pull request #29 from LifeIsSoSolong/codex/qwen-chat-optimizer-backend Support qwen_chat as optimizer backend	2026-06-02 03:27:50 +08:00
hwq	181d71b737	Release data split manifests	2026-06-01 16:02:14 +00:00
kaikai-macbook	41012e2d5e	Support Qwen chat as optimizer backend	2026-06-01 16:44:49 +08:00
Yif Yang	8ebede0efd	Refine README for clarity on optimization results Removed redundant wording about math benchmarks.	2026-05-31 18:20:00 +08:00
Yif Yang	266fca72ab	docs: clarify optional features and ckpt artifacts	2026-05-31 09:36:25 +00:00
Yif Yang	9265545c45	docs: clarify README and paper-aligned skill artifacts	2026-05-31 09:23:07 +00:00
Cuzyoung	8acc2dd03e	docs: add self-contained reproduction & usage guideline page Add docs/guideline.html, a single self-contained documentation guide (left-nav + content + on-this-page TOC) covering installation, data preparation, training/eval, full configuration reference, framework internals, and an API reference. Link it from the README with local, htmlpreview, and GitHub Pages access instructions. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-31 09:01:25 +00:00
hwq	42e555d28e	Update eval-only README example	2026-05-30 15:28:17 +00:00
Yif Yang	4f3a9bc055	docs: scope PR #25 gate_metric as opt-in example, not default Move the soft/mixed gate-metric configuration introduced in PR #25 out of the base default config and into a standalone example config so that default SkillOpt runs (and paper reproduction) remain bit-for-bit on the original hard gate. - configs/_base_/default.yaml: drop gate_metric / gate_mixed_weight keys. The trainer's cfg.get("gate_metric", "hard") fallback preserves the original behavior unchanged. - configs/examples/soft_gate.yaml: new standalone reference config with a header explaining when to consider it (small selection split with continuous rewards) and when not to (paper reproduction, large or binary-reward settings). - README.md: add a short "Community-contributed configs" section that clearly flags this as user-contributed and non-default.	2026-05-30 08:09:03 +00:00
Huangzisu	dbc90bd755	fix(auth): let env vars override yaml for openai_compatible mode The yaml default `azure_openai_auth_mode: azure_cli` was silently overwriting `AZURE_OPENAI_AUTH_MODE` exported by the user, because `configure_clients()` treats any non-empty config value as an explicit override. Switching the three auth_mode defaults (shared / optimizer / target) to "" lets `_clean()` drop them and restores the intended fallback chain: yaml → env var → module default ("azure_cli"). Also update README and .env.example to document the openai_compatible mode introduced in `d5c5b61`, and remove the misleading `OPENAI_API_KEY` snippet — SkillOpt reuses the `AZURE_OPENAI_*` env vars in this mode.	2026-05-30 06:58:05 +00:00
Cuzyoung	99212e3956	docs: remove Star History section for now Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 08:12:51 +00:00
Cuzyoung	fc54c44e93	docs: add Star History chart to README Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-26 08:10:16 +00:00
Yif Yang	48adf5a69f	Update citation format in README.md	2026-05-26 02:56:58 +08:00
Yif Yang	b11e6dcfb9	Enhance training description in README Updated README to include '(mini-)batchsize' in the training description.	2026-05-26 02:35:10 +08:00
Yif Yang	c98bcdd5b3	Update README.md	2026-05-25 13:27:40 +08:00
Yif Yang	0f6db9afc4	Update README.md	2026-05-25 13:26:55 +08:00
Cuzyoung	7ae2d8766e	docs: restore clean README with Install/Data/QuickStart/WebUI/Citation only Keep remote project page header (badges, video), replace body with our streamlined 5-section README focused on reproducibility. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-24 19:19:19 +00:00
Cuzyoung	4a1b984d87	refactor: rename teacher/student to optimizer/target, remove best skills, fix slow update - Rename teacher -> optimizer, student -> target across all code, configs, docs, prompts - CLI: --teacher_model -> --optimizer_model, --student_model -> --target_model - Remove best_skill files, keep only initial skills - Fix slow update gate (force write into skill) - Fix SLOW_UPDATE marker stripping - Remove deep_reflect and meta_reflect mechanisms - Update .env.example with export prefix and azure_cli docs - Add endpoint empty validation in azure_openai.py Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-05-24 19:15:10 +00:00
Lliar-liar	c42d541828	Refine project links and citation section	2026-05-24 18:24:48 +00:00
Lliar-liar	2e05edc399	Add project links and citation section	2026-05-24 18:18:36 +00:00
Yif Yang	441ccb9bda	Update README.md	2026-05-25 02:15:02 +08:00
CharlesYang030	e27aac30ef	docs: add draft release notice	2026-05-21 17:39:36 +00:00
CharlesYang030	76a58e6e7a	docs: polish README header and remove license section	2026-05-21 17:34:47 +00:00
CharlesYang030	244e346b83	SkillOpt v0.1.0: initial release - Skill optimization framework with training loop analogy - 11 benchmarks, 4 model backends (Azure OpenAI, Claude, Codex, Qwen) - WebUI for browser-based training control - Pluggable architecture for extending benchmarks and backends	2026-05-21 17:22:04 +00:00

33 Commits