perf: matrix accessor rewrite by coroa · Pull Request #630 · PyPSA/linopy

coroa · 2026-03-23T09:28:28Z

Changes proposed in this Pull Request

Ok, i went ahead and tried to optimize the direct solver path which looked very promising from #596 by @CharlieFModo . Most solvers accept scipy sparse matrices in CSR format, those CSR matrices are easy to stack and correspond quite closely to how constraints are stored within xarray currently.

New constraint class hierarchy

A new abstract base class ConstraintBase provides all
shared read-only properties and methods. A new frozen CSRConstraint class
stores constraint data directly as a scipy CSR sparse matrix plus flat numpy
arrays for RHS and duals — avoiding the xarray Dataset overhead
constraints.

ConstraintBase (ABC)
├── Constraint   — xarray Dataset-backed, mutable
└── CSRConstraint          — CSR-backed, immutable ("frozen")

One can losslessly convert between Constraint and CSRConstraint, by calling mutable() or freeze().

add_constraints(..., freeze=True) converts the Constraint to a CSRConstraint before assigning it to the model. I measured memory savings of up to 90% for constraints with many terms.

EDIT:

The feature is made opt-in and the globally freezing constraint can be configured with

Model(freeze_constraints = True)

additional gains from not setting the name (even though I made it more performant)

Model(set_names_in_solver_io=False)

.to_matrix() on both of these now always returns a csr matrix of shape(n_active_constraints, n_active_variables) and con_labels a numpy array of size n_active_constraints with the label of the constraint in it.

So that the number of columns of these non-missing matrices can match with n_active_variables there is a new VariableLabelIndex on m.vars which holds two cached properties:

vlabels: active variable labels in encounter order, shape (n_active_vars,)
label_to_pos: derived from vlabels; size _xCounter, maps label -> position (-1 if masked)

Chunking is not immediately useful anymore, because each constraint will be turned quite quickly into a numpy array before being reduced to the csr matrix. We probably could implement an optimized to_matrix path which uses the chunking.

Open ToDos

Benchmark against current main branch
Document API changes: need for .mutable(), .freeze(), DataArray API only on Constraint
LP writer works by reconstructing mutable constraints out of the frozen ones, should be replaced by a new implementation.
Tests of direct apis other than highs
~~Think about Chunking~~
mypy is quite unhappy with my changes :(

Quick benchmarking with pypsa.examples.carbon_management

Slightly cheated, because i commented out the names setting in to_highspy, which is unnecessary for solving and retrieving the solution, because we can just work with the indices. This takes ~2s. But i removed it from both branches (because it affected them equally).

On main branch

In [1]: import pypsa, linopy as lp
   ...: n = pypsa.examples.carbon_management()
In [2]: %time m = n.optimize.create_model()
[...]
CPU times: user 4.49 s, sys: 483 ms, total: 4.97 s
Wall time: 4.97 s
In [5]: %time lp.io.to_highspy(m)
Running HiGHS 1.12.0 (git hash: 755a8e0): Copyright (c) 2025 HiGHS under MIT licence terms
CPU times: user 10.9 s, sys: 1.78 s, total: 12.7 s
Wall time: 12.7 s
Out[5]: <highspy.highs.Highs at 0x70228531d010>

On `perf/matrix-acccesor-rewrite` (this PR)

In [1]: import pypsa, linopy as lp
   ...: n = pypsa.examples.carbon_management()
In [2]: %time m = n.optimize.create_model()
CPU times: user 5.33 s, sys: 519 ms, total: 5.84 s
Wall time: 5.86 s
In [5]: %time lp.io.to_highspy(m)
Running HiGHS 1.12.0 (git hash: 755a8e0): Copyright (c) 2025 HiGHS under MIT licence terms
CPU times: user 446 ms, sys: 266 ms, total: 712 ms
Wall time: 713 ms
Out[5]: <highspy.highs.Highs at 0x767c48a17ad0>

EDIT (by @FabianHofmann): Changed back naming to keep Constraint class untouched and add CSRConstriaint

Checklist

Code changes are sufficiently documented; i.e. new functions contain docstrings and further explanations may be given in doc.
Unit tests for new features were added (if applicable).
A note for the release notes doc/release_notes.rst of the upcoming release is included.
I consent to the release of this PR's code under the MIT license.

FBumann

I commented on some minor things. Im not using the direct interface and wont be able to add that much value probably.
I verified the speed claims on the direct io paths.
I also verified that the existing LP writer is a lot slower.

FBumann · 2026-03-23T16:14:19Z

Independent benchmark results (using pytest-benchmark suite from #567)

Matrix generation (the core win — direct solver path)

Model	Size	master	this PR	Speedup
basic	n=10	5.5ms	0.14ms	39x
basic	n=100	26ms	0.34ms	79x
basic	n=500	516ms	5.4ms	96x
basic	n=1000	2,391ms	22ms	110x
basic	n=1600	6,871ms	58ms	120x
expr_arith	n=1000	3,128ms	29ms	109x
sparse_net	n=1000	200ms	20ms	10x

Matrix generation is 30–120x faster, with speedup increasing at larger problem sizes.

Build phase (model construction)

Roughly neutral — within noise for small models, ~7% slower for pypsa_scigrid builds (likely due to the freeze step during add_constraints).

LP file writing

LP writing regressed 15–70% because frozen constraints must call .mutable() → reconstruct full xarray Dataset before conversion to polars for writing. The LP writer in constraints_to_file() calls con.to_polars() which on frozen Constraint delegates to self.mutable().to_polars().

Additionally, iterate_slices() crashes on frozen constraints because it uses ds.isel() which doesn't exist on the CSR-backed class. This means lp_write_pypsa_scigrid fails entirely.

Fix: #631 adds a direct CSR-to-polars path in Constraint.to_polars() and overrides iterate_slices() with a CSR row-batch iterator. This brings LP write performance to 20–40% faster than master and fixes the crash.

CI failures

mypy — MutableConstraint not accepted where Constraint is expected in add_constraints() signature. The type hint needs to accept ConstraintBase.
Windows Python 3.10 — assert dtype('int64') == int fails in test_constraint_assignment and test_anonymous_constraint_assignment. Likely np.intp resolving to int32 on Windows.
Doc notebooks — exit code 1 with minimal log output, needs investigation.

FBumann · 2026-03-23T16:16:23Z

I implemented a first version of a new lp writer on #631. The result is already quite a bit faster than the current master. So no issues there.

- Fix __repr__ passing CSR positions instead of variable labels - Fix set_blocks failing on frozen Constraint - Extract _active_to_dataarray helper to reduce DRY violations - Simplify reset_dual to direct mutation instead of reconstruction - Add tests for freeze/mutable roundtrip, VariableLabelIndex, to_matrix_with_rhs, from_mutable mixed signs, repr correctness

FabianHofmann · 2026-03-24T09:15:22Z

this is great, one big question that came up when looking at the code is: why renaming Constraint to MutableConstraint and making Constraint the frozen one, and not instead keep Constraint and add a FrozenConstraint? From a user perspective the current renaming is breaking and the exposed object has a non-intuitive name.

coroa · 2026-03-24T09:26:16Z

this is great, one big question that came up when looking at the code is: why renaming Constraint to MutableConstraint and making Constraint the frozen one, and not instead keep Constraint and add a FrozenConstraint? From a user perspective the current renaming is breaking and the exposed object has a non-intuitive name.

Well two reasons:

I was not sure whether i would like to completely remove the old representation or need to keep it for something, my goal was to replace it completely, so i wanted to emphasize that the new default is: Constraint.
Also FrozenConstraint implies that we cannot modify it at all, while we could readd setters; one only needs to think carefully about those (and i only had a limited amount of time :), and wanted to reap the benefits earlier).

Currently they interact nicely enough that we might want to keep the MutableConstraint indefinitely and then I agree that it makes sense to keep the old representation under the old name and maybe name the new one CSRConstraint. Do you think that is clearer?

… MatrixAccessor compat

…or mypy

FBumann · 2026-03-30T12:40:51Z

@FabianHofmann I didnt realize that freezing was opt in. I just ran integration tests again with freezing on Model, and my testing fails due to linopy.testing.assert_con_equal raising.

Bug report

assert_linequal compares coeffs and vars positionally along _term. With freeze_constraints=True, the CSR round-trip sorts terms by variable label, so stored constraints have different term order than freshly built expressions. This causes false failures on mathematically equivalent constraints.

from linopy import Model
from linopy.testing import assert_linequal
import pandas as pd

m = Model(freeze_constraints=True)
coords = pd.RangeIndex(3, name="time")
x = m.add_variables(coords=[coords], name="x")
y = m.add_variables(coords=[coords], name="y")

m.add_constraints(y - x == 0, name="con")

desired = y == x
assert_linequal(m.constraints["con"].lhs, desired.lhs)  # fails

Sorting both sides by vars along _term before comparing would fix it probably. I think this is a quick fix in testing.py, but maybe this is unintended behaviour after all?

FabianHofmann · 2026-04-01T09:05:00Z

@FabianHofmann I didnt realize that freezing was opt in. I just ran integration tests again with freezing on Model, and my testing fails due to linopy.testing.assert_con_equal raising.

Bug report

assert_linequal compares coeffs and vars positionally along _term. With freeze_constraints=True, the CSR round-trip sorts terms by variable label, so stored constraints have different term order than freshly built expressions. This causes false failures on mathematically equivalent constraints.
from linopy import Model
from linopy.testing import assert_linequal
import pandas as pd

m = Model(freeze_constraints=True)
coords = pd.RangeIndex(3, name="time")
x = m.add_variables(coords=[coords], name="x")
y = m.add_variables(coords=[coords], name="y")

m.add_constraints(y - x == 0, name="con")

desired = y == x
assert_linequal(m.constraints["con"].lhs, desired.lhs)  # fails
Sorting both sides by vars along _term before comparing would fix it probably. I think this is a quick fix in testing.py, but maybe this is unintended behaviour after all?

@FBumann (I am back from a small break I had to take to restore my computer) good catch. that is a good question. I would argue we want to test the semantic equality of the linear expressions so sorting before would be fair. could you spot where exactly the resorting takes place?

Sort both sides by variable labels along _term before comparing, so expressions with different term orderings (e.g. from CSR round-trip with freeze_constraints=True) are correctly recognized as equal. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

FBumann · 2026-04-01T09:50:41Z

@FabianHofmann I didnt realize that freezing was opt in. I just ran integration tests again with freezing on Model, and my testing fails due to linopy.testing.assert_con_equal raising.

Bug report

assert_linequal compares coeffs and vars positionally along _term. With freeze_constraints=True, the CSR round-trip sorts terms by variable label, so stored constraints have different term order than freshly built expressions. This causes false failures on mathematically equivalent constraints.
from linopy import Model
from linopy.testing import assert_linequal
import pandas as pd

m = Model(freeze_constraints=True)
coords = pd.RangeIndex(3, name="time")
x = m.add_variables(coords=[coords], name="x")
y = m.add_variables(coords=[coords], name="y")

m.add_constraints(y - x == 0, name="con")

desired = y == x
assert_linequal(m.constraints["con"].lhs, desired.lhs)  # fails
Sorting both sides by vars along _term before comparing would fix it probably. I think this is a quick fix in testing.py, but maybe this is unintended behaviour after all?
@FBumann (I am back from a small break I had to take to restore my computer) good catch. that is a good question. I would argue we want to test the semantic equality of the linear expressions so sorting before would be fair. could you spot where exactly the resorting takes place?

constraints.py:478 — csr.sum_duplicates() sorts CSR column indices by variable label within each row. When CSRConstraint._to_dataset() (line 730-731) reconstructs the constraint, terms come out in variable-label order instead of insertion order.

csr.sum_duplicates() does the sorting using scipy. We cant change that afaik, and term order doesnt matter for math, so we need to adjust the testing instead.

for more information, see https://pre-commit.ci

coroa

Ok, went over it. Quite some stuff still to clean up

coroa · 2026-05-08T09:33:43Z

+    @property
+    def rhs(self) -> DataArray:
+        """Get RHS DataArray, shape (*coord_dims)."""
+        return self._active_to_dataarray(self._rhs, fill=np.nan)
+
+    @rhs.setter
+    def rhs(self, value: ExpressionLike | VariableLike | ConstantLike) -> None:
+        self._refreeze_after(lambda mc: setattr(mc, "rhs", value))
+
+    @property
+    def lhs(self) -> expressions.LinearExpression:
+        """Get LHS as LinearExpression (triggers Dataset reconstruction)."""
+        return self.mutable().lhs
+
+    @lhs.setter
+    def lhs(self, value: ExpressionLike | VariableLike | ConstantLike) -> None:
+        self._refreeze_after(lambda mc: setattr(mc, "lhs", value))
+
+    def _refreeze_after(self, mutate: Callable[[Constraint], None]) -> None:
+        mc = self.mutable()
+        mutate(mc)
+        refrozen = CSRConstraint.from_mutable(mc, self._cindex)
+        self._csr = refrozen._csr
+        self._con_labels = refrozen._con_labels
+        self._rhs = refrozen._rhs
+        self._sign = refrozen._sign
+        self._coords = refrozen._coords
+        self._dual = None


I really don't like this hack

coroa · 2026-05-08T10:01:37Z

+    def lhs(self) -> expressions.LinearExpression:
+        """Get the left-hand-side linear expression of the constraint."""
+        data = self.data[["coeffs", "vars"]].rename({self.term_dim: TERM_DIM})
+        return expressions.LinearExpression(data, self.model)


I'd move this onto Constraint

coroa · 2026-05-08T10:02:48Z

+    def to_matrix(
+        self, label_index: VariableLabelIndex
+    ) -> tuple[scipy.sparse.csr_array, np.ndarray]:
+        """
+        Construct a dense CSR matrix for this constraint.
+
+        Only active (non-masked) rows are included. Column indices are dense
+        positions in the active variable array, as given by ``label_index``.
+
+        Parameters
+        ----------
+        label_index : VariableLabelIndex
+            Variable label index providing ``label_to_pos`` and ``n_active_vars``.
+
+        Returns
+        -------
+        csr : scipy.sparse.csr_array
+            Shape (n_active_cons, n_active_vars).
+        con_labels : np.ndarray
+            Active constraint labels in row order.
+        """
+        con_labels, _, cols, data, indptr = _matrix_export_data(self, label_index)
+        csr = scipy.sparse.csr_array(
+            (data, cols, indptr), shape=(len(con_labels), label_index.n_active_vars)
+        )
+        csr.sum_duplicates()
+        return csr, con_labels


I'd make this abstract here and move the implementation onto Constraint

…helper Replace the convoluted cumsum/diff/range loop with a clean while-loop helper that uses searchsorted directly on indptr. Batch slices pass coords=[] since batches cover contiguous active rows, not a contiguous slice of the coordinate grid. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Use to_polars() instead. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…straint _matrix_export_data becomes a method on Constraint instead of a module-level function. ncons, lhs, and to_matrix are now abstract in ConstraintBase, with xarray-based implementations on Constraint and CSR-based implementations on CSRConstraint. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…_properties call

…array round-trip - rhs setter writes _rhs directly, rejects expressions - lhs setter raises AttributeError (call .mutable() to modify terms) - lhs getter skips mutable() wrapper, builds LinearExpression from _to_dataset - to_polars uses pl.lit for scalar sign

…mypy

…odify

FabianHofmann · 2026-05-11T10:45:06Z

@coroa thanks for the follow up. I just ran the tests in pypsa to confirm the implementation works, all lopf tests pass. I would say let's merge this do not longer create merge conflicts (I would like to get the ball rolling for solver refac). we can always follow up further

FBumann · 2026-05-11T12:54:29Z

Amazing that you are progressing 👍

coroa added 10 commits March 19, 2026 00:43

perf: add to_matrix_via_csr

fb9f6ab

perf: improve per-constraint csr matrix construction

80a8b70

Add conversion functions

89115c2

feat: add ability to freeze constraints into csr

bcd0228

Add io.to_netcdf support for frozen Constraint

b4dc1ea

fix: re-implement matrices

304a2e7

Move sum_duplicates

0fc2673

feat: VariableLabelIndex

3c8c5d6

fix: until solve

0b9de00

fix: disentangle range and ncons

1122b16

coroa requested review from FBumann and FabianHofmann March 23, 2026 09:28

fix: don't freeze if model is chunked

19125ac

coroa force-pushed the perf/matrix-accessor-rewrite branch from 7259d42 to 19125ac Compare March 23, 2026 09:53