Skip to main content

Monitoring metrics

Explains the metrics provided by Monitoring service in KakaoCloud.
These metrics are used in Custom dashboards, Metric Explorer, Metric Export, and Alert Center for metric-based alert policies provided by Monitoring.

Virtual Machine, GPU, Bare Metal Server Metrics

Service Scope

Monitoring - Custom Dashboard, Metric Explorer, Metric Export, Alert Center - Alert Policies (Metric-based)

info

The metrics mem_buffered, mem_cached, and disk_inodes_usage are collected and provided only on servers with Linux OS.
The nvidia_smi metric is only collected on servers with a GPU.

info

When updating the NVIDIA library on GPU instances, please ensure compatibility between the library version and CUDA version.
If version compatibility issues arise due to updates via apt upgrade or similar methods, the monitoring agent installed by the user may fail to collect metrics related to Nvidia.

info

If an alert policy is set for the network_rx_bytes_persec metric in Alert Center, it applies to the entire network interface. That is, if the instance has multiple network interfaces, an alarm will be triggered if any of the connected network interfaces exceed the set threshold.

Metric NameDescriptionUnit
cpu_usageMeasures the overall CPU usage%
cpu_usage_iowaitCPU usage, CPU state: iowait%
cpu_usage_systemCPU usage, CPU state: system%
cpu_usage_userCPU usage, CPU state: user%
cpu_usage_per_coreMeasures the CPU usage per core%
mem_bufferedMemory usage, memory state: bufferedbytes(IEC)
mem_cachedMemory usage, memory state: cachedbytes(IEC)
mem_usedMemory usagebytes(IEC)
mem_usageMemory usage percentage%
disk_usedDisk usagebytes(IEC)
disk_used_percentDisk usage percentage%
disk_inodes_usageDisk inode usage percentage%
disk_read_bytes_persecBytes read per second from diskbytes/s(IEC)
disk_write_bytes_persecBytes written per second to diskbytes/s(IEC)
disk_read_iopsDisk I/O operations (reads) per secondcount/s
disk_write_iopsDisk I/O operations (writes) per secondcount/s
network_rx_bytes_persecBytes received per second by network interfacebytes/s(IEC)
network_tx_bytes_persecBytes transmitted per second by network interfacebytes/s(IEC)
network_rx_packets_persecPackets received per second by network interfacepackets/s
network_tx_packets_persecPackets transmitted per second by network interfacepackets/s
nvidia_smi_memory_freeFree memory per GPU coreMiB(IEC)
nvidia_smi_memory_totalTotal memory per GPU coreMiB(IEC)
nvidia_smi_memory_usedUsed memory per GPU coreMiB(IEC)
nvidia_smi_power_drawPower draw per GPU corewatt
nvidia_smi_utilization_gpuGPU usage percentage per core%

Libvirt metrics

Service Scope

Monitoring - Metric Export

Metric NameDescriptionUnit
libvirt_domain_info_cpu_time_seconds_totalTotal CPU time usedcount
libvirt_domain_info_virtual_cpusNumber of CPU corescount
libvirt_domain_block_stats_read_bytes_totalTotal bytes read from diskbytes(IEC)
libvirt_domain_block_stats_write_bytes_totalTotal bytes written to diskbytes(IEC)
libvirt_domain_interface_stats_receive_bytes_totalTotal bytes received by network interfacebytes(IEC)
libvirt_domain_interface_stats_transmit_bytes_totalTotal bytes transmitted by network interfacebytes(IEC)
info

The metrics cpu_credit_usage and cpu_credit_balance are only collected on t1i servers with Burstable option enabled (*except for t1i.medium.dns.default).

Metric NameDescriptionUnit
cpu_credit_usageCPU credit usagecount
cpu_credit_balanceRemaining CPU creditscount

Kubernetes Engine Metrics

Service Scope

Monitoring - Metric Export

Metric NameDescriptionUnit
cluster_autoscaler_node_group_min_countMinimum node count during node group autoscalingcount
cluster_autoscaler_node_group_max_countMaximum node count during node group autoscalingcount
cluster_autoscaler_node_group_target_countTarget node count during node group autoscalingcount
kube_node_status_allocatableAmount of resources allocatable to pods on the nodenone
kube_node_status_capacityTotal available resources on the nodenone
node_cpu_seconds_totalTime spent in each CPU mode on nodecount
node_filesystem_avail_bytesAvailable file system space for non-root usersbytes(IEC)
node_filesystem_size_bytesFile system size on nodebytes(IEC)
node_memory_Active_bytesActive memory that is currently used or reusablebytes(IEC)
node_memory_Buffers_bytesBuffer memory used by the kernel for disk block I/Obytes(IEC)
node_memory_Cached_bytesMemory used for file system cachebytes(IEC)
node_memory_MemAvailable_bytesEstimated available memory that can be used by new processesbytes(IEC)
node_memory_MemFree_bytesCurrently unused memorybytes(IEC)
node_memory_MemTotal_bytesTotal memory on the nodebytes(IEC)
node_memory_SReclaimable_bytesReclaimable memory in slab cachebytes(IEC)
node_network_receive_bytes_totalTotal bytes received by node's network devicesbytes(IEC)
node_network_transmit_bytes_totalTotal bytes transmitted by node's network devicesbytes(IEC)
kube_pod_container_infoInformation about containers within a podnone
kube_pod_container_resource_limitsResource limits requested by the containernone
kube_pod_container_resource_requestsResource values requested by the containernone
kube_pod_container_status_runningWhether the container is in Running statecount
kube_pod_container_status_terminatedWhether the container is in Terminated statecount
kube_pod_infoInformation about the podnone
container_cpu_usage_seconds_totalTotal CPU time consumed by the containercount
container_memory_working_set_bytesMemory used by the container that cannot be reclaimed by the OSbytes(IEC)
container_network_receive_bytes_totalTotal network bytes received by the containerbytes(IEC)
container_network_transmit_bytes_totalTotal network bytes transmitted by the containerbytes(IEC)

Load Balancing Metrics

Service Scope

Monitoring - Custom Dashboard, Metric Explorer, Metric Export, Alert Center - Alert Policies (Metric-based)

Metric NameDescriptionUnit
lb_bytes_in_persecInbound trafficbytes/s(IEC)
lb_bytes_out_persecOutbound trafficbytes/s(IEC)
lb_connections_persecNumber of connections per secondcount/s
lb_current_connectionsNumber of connected connectionscount

MySQL Metrics

Service Scope

Monitoring - Custom Dashboard, Metric Explorer, Metric Export, Alert Center - Alert Policies (Metric-based)

Metric NameDescriptionUnit
mem_swap_totalTotal swap memorybytes(IEC)
mem_swap_cachedCached swap memorybytes(IEC)
mem_swap_freeFree swap memorybytes(IEC)
mysql_logstorage_disk_write_bytes_persecBytes written per second to the log storage diskbytes/s(IEC)
mysql_defaultstorage_disk_write_bytes_persecBytes written per second to the default storage diskbytes/s(IEC)
mysql_logstorage_disk_read_bytes_persecBytes read per second from the log storage diskbytes/s(IEC)
mysql_defaultstorage_disk_read_bytes_persecBytes read per second from the default storage diskbytes/s(IEC)
mysql_logstorage_disk_write_iopsWrite operations completed per second on log storage diskcount/s
mysql_defaultstorage_disk_write_iopsWrite operations completed per second on default storage diskcount/s
mysql_logstorage_disk_read_iopsRead operations completed per second on log storage diskcount/s
mysql_defaultstorage_disk_read_iopsRead operations completed per second on default storage diskcount/s
mysql_logstorage_disk_usedLog storage disk usagebytes(IEC)
mysql_defaultstorage_disk_usedDefault storage disk usagebytes(IEC)
mysql_defaultstorage_disk_used_percentDefault storage disk usage percentage%
mysql_logstorage_disk_used_percentLog storage disk usage percentage%
mysql_logstorage_disk_inodes_usageLog storage inode usage percentage%
mysql_defaultstorage_disk_inodes_usageDefault storage inode usage percentage%
mysql_network_rx_bytes_persecBytes received per second by network interfacebytes/s(IEC)
mysql_network_tx_bytes_persecBytes transmitted per second by network interfacebytes/s(IEC)
mysql_network_rx_packets_persecPackets received per second by network interfacepackets/s
mysql_network_tx_packets_persecPackets transmitted per second by network interfacepackets/s
mysql_innodb_row_lock_current_waitsNumber of current row lock waitscount
mysql_binary_size_bytesSize of binary logsbytes(IEC)
mysql_binary_files_countNumber of binary log filescount
mysql_variables_max_binlog_sizeMaximum binary log sizebytes(IEC)
mysql_connections_countNumber of connected connectionscount
mysql_slow_query_countNumber of slow queries in the last 5 minutescount
mysql_com_insert_countNumber of INSERT queries in the last 5 minutescount
mysql_com_select_countNumber of SELECT queries in the last 5 minutescount
mysql_com_delete_countNumber of DELETE queries in the last 5 minutescount
mysql_com_commit_countNumber of COMMIT queries in the last 5 minutescount
mysql_com_update_countNumber of UPDATE queries in the last 5 minutescount
mysql_query_persecQueries per second (QPS)count/s
mysql_connection_usage_percentPercentage of connections used compared to max connections%
mysql_innodb_buffer_pool_read_requestsTotal buffer pool read requestscount
mysql_innodb_row_lock_timeRow lock timemilliseconds
mysql_innodb_buffer_pool_readsNumber of buffer pool readscount
mysql_innodb_buffer_cache_hit_ratioMySQL InnoDB buffer pool cache hit ratio%
mysql_uptimeUptime durationduration
mysql_instance_statusInstance statuscount
mysql_instance_group_statusInstance group statuscount
mysql_replication_lagBinlog replication lagseconds
mysql_max_connections_countMaximum number of connectionscount

PostgreSQL Metrics

Service Scope

Monitoring - Custom Dashboard, Metric Explorer, Metric Export, Alert Center - Alert Policies (Metric-based)

Metric NameDescriptionUnit
pg_active_connectionsNumber of active PostgreSQL connectionscount
pg_active_transactionsNumber of active PostgreSQL transactionscount
pg_buffer_hit_ratioPostgreSQL buffer hit ratio%
pg_defaultstorage_disk_inodes_usageDefault storage inode usage percentage%
pg_defaultstorage_disk_read_bytes_persecBytes read per second from default storage diskbytes/s(IEC)
pg_defaultstorage_disk_read_iopsRead operations per second on default storage diskcount/s
pg_defaultstorage_disk_usedDefault storage disk usagebytes(IEC)
pg_defaultstorage_disk_used_percentDefault storage disk usage percentage%
pg_defaultstorage_disk_write_bytes_persecBytes written per second to default storage diskbytes/s(IEC)
pg_defaultstorage_disk_write_iopsWrite operations per second on default storage diskcount/s
pg_lock_sessionsNumber of lock sessions in PostgreSQLcount
pg_logstorage_disk_inodes_usageLog storage inode usage percentage%
pg_logstorage_disk_read_bytes_persecBytes read per second from log storage diskbytes/s(IEC)
pg_logstorage_disk_read_iopsRead operations per second from log storage diskcount/s
pg_logstorage_disk_usedLog storage disk usagebytes(IEC)
pg_logstorage_disk_used_percentLog storage disk usage percentage%
pg_logstorage_disk_write_bytes_persecBytes written per second to log storage diskbytes/s(IEC)
pg_logstorage_disk_write_iopsWrite operations per second to log storage diskcount/s
pg_network_rx_bytes_persecBytes received per second by network interfacebytes/s(IEC)
pg_network_rx_packets_persecPackets received per second by network interfacepackets/s
pg_network_tx_bytes_persecBytes transmitted per second by network interfacebytes/s(IEC)
pg_network_tx_packets_persecPackets transmitted per second by network interfacepackets/s
pg_replication_lagPostgreSQL replication lagseconds
pg_temp_file_ratio_per_groupPostgreSQL temporary file usage per instance group%
pg_total_connectionsTotal number of PostgreSQL connectionscount
pg_total_deadlocksTotal number of deadlocks in PostgreSQLcount
pg_xid_age_per_groupPostgreSQL vacuum xid per instance groupcount

MemStore Metrics

Service Scope

Monitoring - Custom Dashboard, Metric Explorer, Metric Export, Alert Center - Alert Policies (Metric-based)

Metric NameDescriptionUnit
memstore_allocator_rss_bytesRSS memory sizebytes(IEC)
memstore_clientsNumber of connected clientscount
memstore_connected_slavesNumber of connected replicascount
memstore_evicted_keysNumber of evicted keys due to maxmemory limitcount
memstore_expired_keysNumber of expired keyscount
memstore_instantaneous_ops_per_secCommands processed per secondcount
memstore_client_ratioRatio of current clients to max clients%
memstore_memory_usageMemory usage percentage in MemStore instance%
memstore_keyspace_hitsNumber of keyspace hitscount
memstore_keyspace_missesNumber of keyspace missescount
memstore_maxclientsMaximum number of clients allowedcount
memstore_maxmemoryMaximum available memorybytes(IEC)
memstore_replication_lagReplication lag times
memstore_uptimeUptime durations
memstore_used_memoryUsed memory in MemStorebytes(IEC)
memstore_cmdstat_calls_persecNumber of command calls per secondcount/s
memstore_keyspace_hitrate_percentKeyspace hit rate percentage%
memstore_lru_clockLRU (Least Recently Used) time value for managing algorithmcount
memstore_blocked_clientsNumber of clients waiting due to BLPOP, BRPOP, etc.count
memstore_cluster_connectionsNumber of sockets used by the cluster buscount
memstore_allocator_activeActive memory in allocator, including external fragmentationbytes(IEC)
memstore_allocator_allocatedAllocated memory in allocator, including internal fragmentationbytes(IEC)
memstore_allocator_residentResident memory managed by allocator, including OS-returnable memorybytes(IEC)
memstore_allocator_frag_bytesDifference between activated memory and allocated memorybytes(IEC)
memstore_allocator_frag_ratioRatio between activated memory and allocated memory%
memstore_allocator_rss_ratioRatio between resident memory and activated memory%
memstore_lazyfree_pending_objectsNumber of objects waiting to be freed due to UNLINK calls or ASYNC optioncount
memstore_lazyfreed_objectsNumber of objects freed via Lazy Free processcount
memstore_mem_fragmentation_bytesDifference between used resident memory and allocated memorybytes(IEC)
memstore_mem_fragmentation_ratioRatio between used resident memory and allocated memory%
memstore_mem_not_counted_for_evictMemory not counted for eviction due to temporary replicas and AOF buffersbytes(IEC)
memstore_rss_overhead_bytesDifference between MemStore process resident memory and allocator's resident memorybytes(IEC)
memstore_rss_overhead_ratioRatio between MemStore process resident memory and allocator's resident memory%
memstore_total_system_memoryTotal system memory where MemStore is runningbytes(IEC)
memstore_used_memory_datasetMemory used for actual data storage, considering overhead memorybytes(IEC)
memstore_used_memory_dataset_percPercentage of memory used for actual data storage%
memstore_used_memory_luaMemory used by Lua engine for script executionbytes(IEC)
memstore_used_memory_overheadMemory needed for managing internal data structuresbytes(IEC)
memstore_used_memory_peakMaximum memory used by MemStorebytes(IEC)
memstore_used_memory_peak_percPercentage of maximum memory used relative to total memory%
memstore_used_memory_rssResident set size memory assigned by the OSbytes(IEC)
memstore_instantaneous_input_kbpsData read from network per secondKiB/s(IEC)
memstore_instantaneous_output_kbpsData written to network per secondKiB/s(IEC)
memstore_io_threaded_reads_processedTotal number of read events processed by main and I/O threadscount
memstore_io_threaded_writes_processedTotal number of write events processed by main and I/O threadscount
memstore_pubsub_channelsNumber of pub/sub channels subscribed by clientscount
memstore_pubsub_patternsNumber of pub/sub patterns subscribed by clientscount
memstore_total_commands_processedTotal number of commands processed by the servercount
memstore_total_connections_receivedTotal number of connections accepted by the servercount
memstore_total_error_repliesTotal number of error replies (sum of rejected

and failed commands) | count | | memstore_total_net_input_bytes | Total network input bytes | bytes(IEC) | | memstore_total_net_output_bytes | Total network output bytes | bytes(IEC) | | memstore_total_reads_processed | Total number of read events processed | count | | memstore_total_writes_processed | Total number of write events processed | count | | memstore_used_cpu_sys | System CPU used by all threads (main and background) of the server | count | | memstore_used_cpu_sys_main_thread | System CPU used by the main thread of the server | count | | memstore_used_cpu_user | User CPU used by all threads (main and background) of the server | count | | memstore_used_cpu_user_main_thread | User CPU used by the main thread of the server | count | | memstore_cluster_enabled | Whether the cluster is enabled | count |

info

The metrics cpu_credit_usage and cpu_credit_balance are only collected on clusters where the flavor is set to t1i.

Metric NameDescriptionUnit
cpu_credit_usageCPU credit usagecount
cpu_credit_balanceRemaining CPU creditscount

Pub/Sub Metrics

Service Scope

Monitoring - Custom Dashboard, Metric Explorer, Alert Center - Alert Policies (Metric-based)

Metric NameDescriptionUnit
pubsub_published_message_count_persecMessages published per secondcount/s
pubsub_published_message_bytes_persecBytes of messages published per secondbytes/s(IEC)
pubsub_publish_request_count_persecPublish requests per secondcount/s
pubsub_topic_storage_used_bytesTopic storage usedbytes(IEC)
pubsub_seek_request_count_perminSeek requests per 5 minutescount
pubsub_ack_request_count_persecAcknowledgment requests per secondcount/s
pubsub_acked_message_count_persecAcknowledged messages per secondcount/s
pubsub_unprocessed_messagesNumber of unprocessed messagescount
pubsub_pulled_message_count_persecMessages pulled per secondcount/s
pubsub_streaming_pull_response_count_persecStreaming pull responses per secondcount/s
pubsub_push_count_persecPush requests per secondcount/s
pubsub_pushed_message_count_persecPushed messages per secondcount/s
pubsub_subscription_storage_used_bytesSubscription storage usedbytes(IEC)
pubsub_exported_message_count_persecMessages exported to Object Storage per secondcount/s
pubsub_object_storage_api_call_count_perminObject Storage API calls per minutecount/m

Direct Connect Metrics

Service Scope

Monitoring - Metric Export

Metric NameDescriptionUnit
dx_virtual_intrerface_stateDirect Connect virtual interface statecount
dx_virtual_interface_output_packets_persecPackets transmitted per second by Direct Connect virtual interfacepackets/s
dx_virtual_interface_output_bits_persecBits transmitted per second by Direct Connect virtual interfacebits/s(IEC)
dx_virtual_interface_input_packets_persecPackets received per second by Direct Connect virtual interfacepackets/s
dx_virtual_interface_input_bits_persecBits received per second by Direct Connect virtual interfacebits/s(IEC)