mirror of
https://github.com/microsoft/SkillOpt.git
synced 2026-07-03 14:02:58 +08:00
- Rename teacher -> optimizer, student -> target across all code, configs, docs, prompts - CLI: --teacher_model -> --optimizer_model, --student_model -> --target_model - Remove best_skill files, keep only initial skills - Fix slow update gate (force write into skill) - Fix SLOW_UPDATE marker stripping - Remove deep_reflect and meta_reflect mechanisms - Update .env.example with export prefix and azure_cli docs - Add endpoint empty validation in azure_openai.py Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
29 lines
728 B
Python
29 lines
728 B
Python
"""ReflACT: Reflective Agent Tuning.
|
|
|
|
A general-purpose framework for iteratively optimizing LLM agent skills
|
|
through structured reflection and self-improvement.
|
|
|
|
Pipeline stages:
|
|
1. Rollout — execute episodes with current skill
|
|
2. Reflect — analyze trajectories, generate patches
|
|
3. Aggregate — hierarchical merge of patches
|
|
4. Select — rank and select top edits
|
|
5. Update — apply edits to skill document
|
|
6. Evaluate — validate candidate skill, accept/reject
|
|
"""
|
|
|
|
__version__ = "0.1.0"
|
|
|
|
from skillopt.types import ( # noqa: F401
|
|
BatchSpec,
|
|
Edit,
|
|
EditOp,
|
|
FailureSummaryEntry,
|
|
GateAction,
|
|
GateResult,
|
|
Patch,
|
|
RawPatch,
|
|
RolloutResult,
|
|
SlowUpdateResult,
|
|
)
|