When SlotNotCoveredError is raised, the cluster topology should be reinitialized as part of error handling and retrying of the commands. by petyaslavova · Pull Request #3621 · redis/redis-py (original) (raw)

Pull Request check-list

Please make sure to review and check all of these items:

Do tests and lints pass with this change?
Do the CI tests pass with this change (enable it first in your forked repo and wait for the github action build to finish)?
Is the new or changed code fully tested?
Is a documentation update included (if this change modifies existing APIs, or introduces new ones)?
Is there an example added to the examples folder (if applicable)?
Was the change added to CHANGES file?

NOTE: these things are not required to open a PR and can be done afterwards / while the PR is open.

Description of change

Fixes issue #3620

When we are handling different errors, we run cluster reinitialization that calls CLUSTER SLOTS.
Sometimes the cluster slots temporarily might not be fully covered and if we try to execute a command against key from the missing slots we receive SlotNotCoveredError.
The problem is that after this error appears we don't try to reinitialize again the slots and the Cluster client can't heal itself.

With this change I'm adding error handling for the SlotNotCoveredError, the same way we handle ClusterDownErrors - try to extract the slots coverage after a little sleep.