Add Bomberland arena#105
Open
Muhtasham wants to merge 3 commits into
Open
Conversation
4e12494 to
9a53f59
Compare
9a53f59 to
365a122
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Adds a Bomberland arena to CodeClash as a separate game adapter.
Bomberland is a Bomberman-style multi-agent grid arena based on Coder One's Bomberland competition. Players submit
bomberland_agent.pywithnext_actions(game_state), controlling multiple units that move, place bombs, destroy blocks, damage opponents, and score across deterministic seeded simulations.Competition Format
Bomberland is direct head-to-head competition.
Each simulation contains both submitted players in the same grid world. Their units, bombs, blasts, movement collisions, damage, kills, and block destruction all affect the shared game state. Scores are computed from that shared interaction, and
sims_per_roundis required to be even so the runtime can alternate starting sides across paired seeded simulations.This is the standard CodeClash arena shape where submitted code directly competes in the same environment, unlike independent score-maximization arenas such as CybORG or ABIDES.
What changed
BomberlandArenaand registered it in the arena registry.codeclash/bomberlandDocker image that pins the upstreamCoderOneHQ/bomberlandrepo for provenance at/opt/bomberland.bomberland_agent.py, runtime README, public docs page, and dummy smoke config.Runtime contract
sims_per_roundmust be even so both players receive paired starting sides.validation_timeoutbounds validation-time import/probe execution.agent_timeoutbounds each runtime agent call in a child process, so hanging submitted code is scored as an agent error instead of hanging the round.average_scores,total_scores,sims, and per-simulationdetailsJSON strings.Notes
connection.agent_id,agents[player].unit_ids,unit_state,entities,world, andtick.Verification
uv run ruff check codeclash/arenas/bomberland/bomberland.py codeclash/arenas/bomberland/runtime/run_bomberland.py tests/arenas/test_bomberland.py-> passeduv run pytest -q tests/arenas/test_bomberland.py-> 16 passed after final validation-timeout fixuv run pytest -q tests/arenas-> 201 passed during final reviewer passuv run pre-commit run --files codeclash/arenas/bomberland/bomberland.py codeclash/arenas/bomberland/runtime/run_bomberland.py tests/arenas/test_bomberland.py-> passeddocker build -t codeclash/bomberland -f codeclash/arenas/bomberland/Bomberland.Dockerfile .-> passeduv run python main.py configs/examples/Bomberland__dummy__r1__s2.yaml -o /private/tmp/codeclash-bomberland-review.4EK2ezaverage_scores, per-sim details,alive_hp,alive_units, and per-player stats fieldsinvalid_actionsagent_errorsopenai/gpt-4.1-miniplayer vs dummy, 1 round, 2 Bomberland sims, Docker runtime. Both pre-edit and post-edit Bomberland rounds completed with zero agent errors and scores of220.0for both players.openai/gpt-5.5player vs dummy, 1 round, 2 Bomberland sims, Docker runtime. The pre-edit round tied at220.0/220.0; GPT-5.5 edited the bot; the post-edit round validated and scored445.0/220.0forgpt55_smoke, with no invalid submissions.CRASH_SCOREdetails instead of silent crash scores.