The leaderboard below provides an interactive summary of our experimental results. Use the toggles to switch tracks and click table headers to sort.
Tracks
- Instructed Track gives the agent a detailed refactoring specification to execute.
- Open Track gives the agent only a focus area, so the agent must decide what refactoring to do (and implement it).
Open Track Modes
- Direct: the agent needs to implement immediately.
- Plan: the agent proposes a plan, then in a second run implements it.
- Multiplan: propose multiple plans and an oracle is used to select the one that is best aligned with the human refactoring. In a second run the agent implements the selected plan.