Search Docs...
⌘ K
OverviewDeploymentManagementOperationReferenceGlossary
    ACOS 6.2.0
  • Acrfra Cloud Operation System cluster>
  • ACOS operations and maintenance

Routine checks

After an ACOS cluster is officially brought online, IT infrastructure administrators or operations engineers should regularly log in to the cluster management platform to monitor the operational status of the ACOS cluster. This includes analyzing logs and alert information in a timely manner to identify issues and ensure stable cluster operations.

Viewing resources

You can view the operational status and resource usage of virtual machines, hosts, and clusters on AOC.

  • Virtual machine: On the AOC main page, you can select an organization, data center, cluster, an availability zone of an active-active cluster, or a host in the left sidebar. Then, click VM on the right, and all virtual machines within the selected range will be displayed as a resource list.

  • Host: On the AOC main page, you can select an organization, data center, cluster, or an availability zone of an active-active cluster in the left sidebar. Then, click the Host tab, and all hosts within the selected range will be displayed as a resource list.

  • Cluster: On the AOC main page, you can select an organization or data center in the left sidebar. Then, click the Cluster widget, and all clusters within the selected range will be displayed as a resource list.

Viewing monitoring data

With the monitoring and analysis feature, you can analyze resource usage by monitoring metrics for a variety of resource objects, such as virtual machines and hosts within a cluster, and organizing and presenting the data in views and charts.

In addition to monitoring and analysis provided by ACOS, you can implement more comprehensive monitoring by deploying the observability and advanced monitoring services.

  • With the observability service, you can scale CPU, memory, and storage resources to ensure stable monitoring performance even if the data size increases significantly. It also enables deeper analysis and processing of monitoring data.
  • Advanced monitoring supports longer data retention and allows you to customize CPU, memory, and storage resources.

Compared with advanced monitoring, the observability service provides more stable data collection and storage, more comprehensive alerting capabilities, and unified data processing across clusters. It is recommended to deploy the observability service as the preferred monitoring solution. If a cluster is already using advanced monitoring, it is also recommended to deploy the observability service, and migrate the monitoring data from advanced monitoring to the observability service.

Viewing alerts

Alert information includes the cause, potential impact, and recommended solution. You can use alerts to evaluate the operational status of a cluster and identify specific issues.

When the observability service is associated, you can send cluster alerts to designated email addresses, webhooks, or third-party monitoring platforms for further alert notifications.

Viewing logs

You can collect operating system logs, ACOS service logs, core dump logs, and node information.

When the observability service is associated, you can use the log search feature to quickly search logs that meet specific conditions. You can also use the log transmission feature to send logs to a configured Syslog server in real time and store the logs on third-party servers.

Viewing events

Super administrators, Operations & management administrators, and Security auditors can view the event audit information to trace operational events.