Preparation
Before you shut down the node and replace the memory module, check the status of the new memory module and the cluster as follows to ensure proper preparation.
Check the problematic memory module in the host and the memory module to be installed. Ensure that both the model and clock speed of the two memory modules are the same.
For ACOS (VMware ESXi) clusters, refer to the official VMware documentation to manually migrate all virtual machines from the ESXi host whose memory is to be replaced to other ESXi hosts.
Operation recommendation
Taking snapshots will disable the agile recovery mechanism when the host is in maintenance mode. Therefore, it is recommended to replace the memory module at a time when no snapshot operations are being performed to avoid triggering a snapshot during host maintenance.
If special circumstances require adding a memory module during a snapshot operation, make sure you understand the risk described above before proceeding.
Procedure
Log in to AOC and set the node whose memory module is to be replaced to maintenance mode.
Shut down the node in AOC.
Replace the memory module and record the slot location.
Power on the server. Log in to the IPMI console of the node and verify that the newly added memory module has been recognized by the IPMI console.
Log in to AOC and exit maintenance mode for this node.
Log in to this node using SSH and run the following command to check whether all services are running properly.
sudo /usr/share/tuna/script/control_all_services.sh --action=status --group=role
Log in to AOC, go to the Overview page of the host, and check the Memory field under Host information to confirm that the memory capacity is displayed as expected.