Restart entire cluster

hi

I didn’t know English so I used a translator, but thank you for your reference. ㅠㅠ

It is currently being used with 4 master nodes and 20 data nodes.

When usage on an unspecified data node is over 90%

It can be seen that the usage of other unspecified data nodes is less than 50%.

Is it a clustering problem?

For this reason, a full reboot method is used.

In this case, data loss inevitably occurs.

cluster.graceful_stop.min_availability = full cannot be added because there is an issue of adding backup equipment.

Options for smooth clustering

Is there a way to store shards without adding additional equipment?

Please
If you have any ideas please give the newbie grace

Hi,

When usage on an unspecified data node is over 90%
It can be seen that the usage of other unspecified data nodes is less than 50%.

Is this about disk space usage or CPU or memory usage?

For disk space usage you could check the Shards browser — CrateDB: Admin UI to see if rebalancing is happening or if maybe one node got larger shards because of table/partition properties.

For CPU/memory usage you may want to check if requests coming from the clients are getting balanced across the nodes.

These are all situations that can be addressed without a full cluster restart, if you want to share more details we would be happy to help.