Skip to content

Boxoban incremental#570

Open
TBBristol wants to merge 6 commits into
PufferAI:4.0from
TBBristol:boxoban-incremental
Open

Boxoban incremental#570
TBBristol wants to merge 6 commits into
PufferAI:4.0from
TBBristol:boxoban-incremental

Conversation

@TBBristol
Copy link
Copy Markdown
Contributor

@TBBristol TBBristol commented May 20, 2026

Adds a new incremental difficulty (5) which is based on maze_incremental
Standard difficulties remain the same and tested
Targets hit is now a % on all difficulties
When a map is solve it increases difficulty up to level 3 (hard) and does not end episode
Keeps loading hard maps if hard is solved
Episode tick is now total ticks, puzzle tick is tick final puzzle
Timeout is 150 per puzzle not episode and based on puzzle tick
For incremental mode scoring is as follows:
score = 0..4 for highest difficulty cleared in the episode (note +1 offset from difficulty labels)
perf = score / 4.0
So if all envs are reaching hard perf is 1

@TBBristol TBBristol force-pushed the boxoban-incremental branch from c9edf83 to 47bafce Compare May 20, 2026 12:25
@TBBristol
Copy link
Copy Markdown
Contributor Author

perf changed to

perf = (score + targets_hit) / 5.0f;

Where targets his is a % of targets on the level.
This normalises to at all targets hit on HARD but allows for incremental gains where the next level is not yet achieved.

Score is still level achieved.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant