Add ABIDES arena#104
Open
Muhtasham wants to merge 6 commits into
Open
Conversation
54128a6 to
bbc23f7
Compare
2879449 to
f9ccb8f
Compare
f9ccb8f to
6adff22
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
abides-sim/abidesc4bf157678928934417aba6073eb0651aeaf6d15, constrain Python dependencies, and pinpipin the arena imageabides_agent.pydefinesdecide(observation)and returns declarative buy/sell limit-order intentsCompetition Format
ABIDES is an independent score-maximization arena, not direct model-vs-model trading in the same market instance.
Each CodeClash player is evaluated in an isolated ABIDES market process using matched seeds/configuration: exchange, market maker, and background zero-intelligence traders. CodeClash compares players by average trusted mark-to-market profit across those matched market runs.
This is similar in spirit to the CybORG arena format: the submitted policies do not directly mutate or inspect each other inside one shared simulator world. The competition is over who writes the better policy for the same seeded simulator conditions.
Runtime Behavior
decide(observation)code out-of-process with a per-decision timeout and passes only plain observation dictionariesLimitOrdersORDER_EXECUTEDmessages plus final exchange price, not from mutable submitted-code stateCRASH_SCOREwithout stalling the whole roundfrom helper import XworkHardening
TradingAgentsubclass submission contract with a restricted protocol boundaryvalidation_timeout,decision_timeout, andplayer_timeoutconfig knobsabides_results.jsonhandling from neutral0.0ties toCRASH_SCOREwith error detailsVerification
uv run ruff check codeclash/arenas/abides/abides.py codeclash/arenas/abides/runtime/run_abides.py tests/arenas/test_abides.py-> passeduv run pytest -q tests/arenas/test_abides.py-> 10 passed after final validation-timeout fixuv run pytest -q tests/arenas-> 195 passed during final reviewer passuv run pytest -q-> 196 passed before final reviewer-pass patch; PR CI pytest is green after the final patchuv run pre-commit run --files codeclash/arenas/abides/abides.py codeclash/arenas/abides/runtime/run_abides.py codeclash/arenas/abides/runtime/abides_agent.py codeclash/arenas/abides/runtime/README.md tests/arenas/test_abides.py configs/examples/ABIDES__dummy__r1__s2.yaml docs/reference/arenas/abides.md-> passeddocker build -t codeclash/abides -f codeclash/arenas/abides/ABIDES.Dockerfile .-> passedstatus: "ok"orders_submitted: 18,policy_errors: 0-900.0for both playersuv run python main.py configs/examples/ABIDES__dummy__r1__s2.yaml -o "$SMOKE_DIR"cash,shares,orders_submitted,policy_errors, andstatusdecide()calls hit the decision timeout and did not stall the simulationsubprocess.TimeoutExpiredNote:
uv run mkdocs build --strictwas previously blocked by pre-existing docs warnings aboutREADME.md/nav entries; normal docs build was already verified earlier in this PR.