Fix `DataArrayRolling.iter` with `center=True` by headtr1ck · Pull Request #6744 · pydata/xarray (original) (raw)

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Conversation9 Commits7 Checks0 Files changed

Conversation

This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters

[ Show hidden characters]({{ revealButtonHref }})

I am now trying to get this to work for Datasets as well.
But the check of min_periods fails for variables that do not contain the rolling dim, since the count is 1.
Any idea how to solve this issue?

Is there a way to get a boolean, scalar DataArray/set that contains this information?

Would it be useful to add a function has_dim that returns e.g.:

xr.DataArray([], dims="x").has_dim("x")  #-> xr.DataArray(True)
xr.Dataset({"x": ("x", []), "y": ("y", [])}).has_dim("x")  # -> xr.Dataset({"x": True, "y": False})

?
Since these are scalar (no dims) they are nicely broadcastable and can be used in xarray pipelines.

But maybe this already works easily and I just don't know about it?

But maybe that is already too complicated.

Anyone has an idea how to get this:

counts = window.count(dim=self.dim[0])
window = window.where(counts >= self.min_periods)

operate only on variables that contain self.dim[0]?
If possible in a way that works both on DataArrays and Datasets :)

Btw: this issue #6749 is the reason why it currently does not work for datasets.

Would you mind moving the first commit to a new PR that we can merge quickly please? It'll be easier to see any new tests you've added then

Would you mind moving the first commit to a new PR that we can merge quickly please? It'll be easier to see any new tests you've added then

I'm fast in adding scope kreep xD
Done in #6777

We could merge this PR after a final review.
For now it fixes the bug in the issue.

For a DatasetRolling.iter support we could either open a new issue or just wait until someone requests it :)

dcherian added a commit to keewis/xarray that referenced this pull request

Jul 22, 2022

main: (313 commits) Update whats-new Release notes for v2022.06.0 (pydata#6815) Drop multi-indexes when assigning to a multi-indexed variable (pydata#6798) Support NumPy array API (experimental) (pydata#6804) Add cumsum to DatasetGroupBy (pydata#6525) Refactor groupby binary ops code. (pydata#6789) Update DataArray.rename + docu (pydata#6665) Switch to T_DataArray and T_Dataset in concat (pydata#6784) Fix typos found by codespell (pydata#6794) Update groupby attrs tests (pydata#6787) Update map_blocks to use chunksizes property. (pydata#6776) Fix DataArrayRolling.__iter__ with center=True (pydata#6744) [test-upstream] Update flox repo URL (pydata#6780) Move _infer_meta_data and _parse_size to utils (pydata#6779) Make the sel error more descriptive when method is unset (pydata#6774) Move Rolling tests to their own testing module (pydata#6777) [pre-commit.ci] pre-commit autoupdate (pydata#6773) move da and ds fixtures to conftest.py (pydata#6730) Bump EnricoMi/publish-unit-test-result-action from 1 to 2 (pydata#6770) Type shape methods (pydata#6767) ...

Fix DataArrayRolling.__iter__ with center=True by headtr1ck · Pull Request #6744 · pydata/xarray (original) (raw)

Conversation

Fix `DataArrayRolling.iter` with `center=True` by headtr1ck · Pull Request #6744 · pydata/xarray (original) (raw)