GitHub - scaleapi/on-policy-expert-corrections-os: open source code for on-policy-expert-corrections

A fork of SWE-agent for generating OEC trajectories.

Student models will have to be hosted (e.g., with vLLM) and the correct names and ports with have to be specified in "config/qwen32b_switch_claude_python_tools.yaml". Then run generate_oec.sh to generate OEC trajectories. Then use eval_swesmith.sh, convert_to_sft.sh, and trajectories/prep_for_sft.sh in order to generate data for SFT.

Training problem instances can be sourced from the SWE-smith Github: https://github.com/SWE-bench/SWE-smith.

failure_categorization contains code for the LLM-as-judge categorization of trajectories into buckets.

covariate_shift_analysis contains code for embedding SWE-agent trajectories and computing the divergence.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
config		config
covariate_shift_analysis		covariate_shift_analysis
docs		docs
failure_categorization		failure_categorization
sweagent		sweagent
tests		tests
tools		tools
traj_mgr		traj_mgr
trajectories		trajectories
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
codecov.yml		codecov.yml
convert_to_sft.sh		convert_to_sft.sh
eval_swesmith.sh		eval_swesmith.sh
generate_oec.sh		generate_oec.sh
mkdocs.yml		mkdocs.yml
mlc_config.json		mlc_config.json
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

scaleapi/on-policy-expert-corrections-os

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages