Add months mixing by SarahAlidoost · Pull Request #24 · ESMValGroup/ClimaNet

SarahAlidoost · 2026-02-19T14:32:39Z

closes #23

🔴 This branch is based on #20 and is waiting for #19 to modify example notebook

In this PR:

I added monthly mixing
I improved the dataset
I fix the documentation github action
I updated the notebook

Todo:

fix tests of dataset ✔️
update the model description ✔️

SarahAlidoost · 2026-02-26T09:31:08Z

@meiertgrootes the part related to st_encoder_decoder is ready for your review. The example notebook in this branch shows how to run the model. Also, there is code description for most of the code, but still I have to update it to include the month mixing.

SarahAlidoost · 2026-02-26T09:32:26Z

@rogerkuou I added you as reviewer because in this PR I changed the dataset modules making it faster. But still I have to update the tests. Also, if you are working on #25, it is better to start from this branch.

meiertgrootes · 2026-02-26T12:39:26Z

            embed_dim: Dimension of the embedding.The default is 128.
                Many vision transformers use embedding dimensions that are multiples
                of 64 (e.g., 64, 128, 256). This can be tuned.
            max_len: Maximum length of the temporal dimension to precompute


It makes sense to implement temporal encodings of the same embedding dimension for architecture purposes. A simple sine and cosine would likely suffice though. The underlying assumption to the use is a cyclical nature of the temporal variable w.r.t. the modelled process. That may be debatable. Nevertheless, I believe it makes sense to use/investigate this approach here.

The approach of using sin/cos in encoding the temporal position is based on "Attention Is All You Need", section 3.5 Positional Encoding, page 6. The reason is mentioned as: " the sinusoidal version because it may allow the model to extrapolate to sequence lengths longer than the ones encountered during training." Which I think it is useful in our case! In this section, they also compared this with another approach in Convolutional sequence to sequence learning. We might explore this in future. 🤔

meiertgrootes

The implementation looks sound, with the adaption of the final convolutional smoothing and mixing of monthly aggregated information.
See comment about underlying assumptions on encoding time (and by extent spatial position) w.r.t. year(/global reference), but that is for further exploration

rogerkuou

Thanks @SarahAlidoost. I think it is ready to merge. I tested this PR in #27. I still need to subset data and shrink the patch size to make the training executable in my local. The training process runs smoothly.

Add months mixing

SarahAlidoost added 16 commits February 19, 2026 15:30

add support for cross-months mixing

8263961

add a util function

bed8884

Merge branch 'main' into add_months_attn

7635b0a

fix linters

0454f91

move util functions to utils

d03e5ff

fix STDataset to support M dimension

589a6d6

make STDataset faster

c09beb9

add a few chacks

6beb3d5

update nb

e667efc

fix padded_mask, and scale and bias

8aa8c68

improve decoder

1ec109e

fix linters

f9c6787

restore changes to nb

d47f7c0

fix linters

77a7682

Merge branch 'main' into add_months_attn

08d443e

update nb

28f4611

SarahAlidoost marked this pull request as ready for review February 26, 2026 09:15

SarahAlidoost added 2 commits February 26, 2026 10:24

fix imports

d17caa5

fix doc action

a2d0d89

SarahAlidoost mentioned this pull request Feb 26, 2026

Consider year based temporal encoding #14

Closed

SarahAlidoost requested review from meiertgrootes and rogerkuou February 26, 2026 09:28

SarahAlidoost mentioned this pull request Feb 26, 2026

Test training on the two year dataset #25

Open

5 tasks

SarahAlidoost added 2 commits February 26, 2026 10:39

fix tests

448bb8e

update docs

02e21c5

meiertgrootes reviewed Feb 26, 2026

View reviewed changes

meiertgrootes approved these changes Feb 26, 2026

View reviewed changes

rogerkuou approved these changes Feb 27, 2026

View reviewed changes

SarahAlidoost merged commit b13ef4b into main Feb 27, 2026
3 checks passed

SarahAlidoost deleted the add_months_attn branch February 27, 2026 09:50

meiertgrootes pushed a commit that referenced this pull request May 28, 2026

Merge pull request #24 from ESMValGroup/add_months_attn

1642870

Add months mixing

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add months mixing#24

Add months mixing#24
SarahAlidoost merged 20 commits into
mainfrom
add_months_attn

SarahAlidoost commented Feb 19, 2026 •

edited

Loading

Uh oh!

SarahAlidoost commented Feb 26, 2026

Uh oh!

SarahAlidoost commented Feb 26, 2026

Uh oh!

meiertgrootes Feb 26, 2026

Uh oh!

SarahAlidoost Feb 27, 2026

Uh oh!

meiertgrootes left a comment

Uh oh!

rogerkuou left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

SarahAlidoost commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SarahAlidoost commented Feb 26, 2026

Uh oh!

SarahAlidoost commented Feb 26, 2026

Uh oh!

meiertgrootes Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

SarahAlidoost Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

meiertgrootes left a comment

Choose a reason for hiding this comment

Uh oh!

rogerkuou left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SarahAlidoost commented Feb 19, 2026 •

edited

Loading

rogerkuou left a comment •

edited

Loading