You need to enable JavaScript to run this app.
导航

容器服务基础指标

最近更新时间2023.10.18 16:25:38

首次发布时间2023.06.05 22:23:34

托管 Prometheus 将您上报的指标分为:云产品基础指标云产品其他指标自定义指标。指标的定义和计费方式,请参见 计费项。本文为您介绍容器服务(VKE)产品的基础指标。

apiserver 基础指标

任务名称(Job Name)指标名称
kubernetes-apiserverworkqueue_adds_total
workqueue_depth
workqueue_queue_duration_seconds_bucket
workqueue_queue_duration_seconds_count
workqueue_queue_duration_seconds_sum
workqueue_retries_total
workqueue_work_duration_seconds_bucket
rest_client_request_duration_seconds_bucket
rest_client_request_duration_seconds_count
rest_client_request_duration_seconds_sum
rest_client_requests_total
process_cpu_seconds_total
process_resident_memory_bytes
kubernetes_build_info
apiserver_admission_controller_admission_duration_seconds_bucket
apiserver_admission_controller_admission_duration_seconds_count
apiserver_admission_controller_admission_duration_seconds_sum
apiserver_admission_webhook_admission_duration_seconds_bucket
apiserver_admission_webhook_admission_duration_seconds_count
apiserver_admission_webhook_admission_duration_seconds_sum
apiserver_current_inflight_requests
apiserver_request_duration_seconds_bucket
apiserver_request_duration_seconds_count
apiserver_request_duration_seconds_sum
apiserver_request_total
apiserver_requested_deprecated_apis
apiserver_response_sizes_bucket
apiserver_response_sizes_count
apiserver_response_sizes_sum
etcd_request_duration_seconds_bucket
etcd_request_duration_seconds_count
etcd_request_duration_seconds_sum

etcd 基础指标

任务名称(Job Name)指标名称
kubernetes-etcdprocess_cpu_seconds_total
process_resident_memory_bytes
etcd_cluster_version
etcd_disk_backend_commit_duration_seconds_bucket
etcd_disk_backend_commit_duration_seconds_count
etcd_disk_backend_commit_duration_seconds_sum
etcd_disk_wal_fsync_duration_seconds_bucket
etcd_disk_wal_fsync_duration_seconds_count
etcd_disk_wal_fsync_duration_seconds_sum
etcd_network_peer_received_bytes_total
etcd_network_peer_round_trip_time_seconds_bucket
etcd_network_peer_round_trip_time_seconds_count
etcd_network_peer_round_trip_time_seconds_sum
etcd_network_peer_sent_bytes_total
etcd_network_peer_sent_failures_total
etcd_server_id
etcd_server_is_leader
etcd_server_has_leader
etcd_server_leader_changes_seen_total
etcd_server_proposals_applied_total
etcd_server_proposals_committed_total
etcd_server_proposals_failed_total
etcd_server_proposals_pending

kube-scheduler 基础指标

任务名称(Job Name)指标名称
kube-schedulerworkqueue_adds_total
workqueue_depth
workqueue_queue_duration_seconds_bucket
workqueue_queue_duration_seconds_count
workqueue_queue_duration_seconds_sum
workqueue_retries_total
workqueue_work_duration_seconds_bucket
scheduler_pending_pods
scheduler_scheduler_cache_size
process_cpu_seconds_total
process_resident_memory_bytes
kubernetes_build_info
rest_client_request_duration_seconds_bucket
rest_client_request_duration_seconds_count
rest_client_request_duration_seconds_sum
rest_client_requests_total

kube-state-metrics 基础指标

任务名称(Job Name)指标名称
kube-state-metricskube_daemonset_created
kube_daemonset_labels
kube_daemonset_status_current_number_scheduled
kube_daemonset_status_desired_number_scheduled
kube_daemonset_status_number_available
kube_daemonset_status_number_misscheduled
kube_daemonset_status_number_ready
kube_daemonset_status_number_unavailable
kube_daemonset_status_updated_number_scheduled
kube_deployment_created
kube_deployment_labels
kube_deployment_spec_replicas
kube_deployment_status_condition
kube_deployment_status_replicas
kube_deployment_status_replicas_available
kube_deployment_status_replicas_ready
kube_deployment_status_replicas_updated
kube_deployment_status_replicas_unavailable
kube_namespace_labels
kube_node_labels
kube_node_spec_unschedulable
kube_node_info
kube_node_spec_taint
kube_node_status_allocatable
kube_node_status_capacity
kube_node_status_condition
kube_persistentvolume_capacity_bytes
kube_persistentvolume_labels
kube_persistentvolume_status_phase
kube_persistentvolumeclaim_labels
kube_persistentvolumeclaim_status_condition
kube_persistentvolumeclaim_status_phase
kube_pod_container_info
kube_pod_container_resource_limits
kube_pod_container_resource_requests
kube_pod_container_status_last_terminated_reason
kube_pod_container_status_ready
kube_pod_container_status_restarts_total
kube_pod_container_status_running
kube_pod_container_status_terminated
kube_pod_container_status_terminated_reason
kube_pod_container_status_waiting
kube_pod_container_status_waiting_reason
kube_pod_info
kube_pod_owner
kube_pod_status_phase
kube_pod_created
kube_pod_labels
kube_pod_status_ready
kube_pod_status_reason
kube_replicaset_labels
kube_replicaset_spec_replicas
kube_replicaset_status_fully_labeled_replicas
kube_replicaset_status_observed_generation
kube_replicaset_status_ready_replicas
kube_replicaset_status_replicas
kube_replicaset_owner

kubelet 基础指标

任务名称(Job Name)指标名称
kubeletworkqueue_adds_total
workqueue_depth
workqueue_queue_duration_seconds_bucket
workqueue_queue_duration_seconds_count
workqueue_queue_duration_seconds_sum
workqueue_retries_total
workqueue_work_duration_seconds_bucket
storage_operation_duration_seconds_bucket
storage_operation_duration_seconds_count
storage_operation_duration_seconds_sum
storage_operation_errors_total
volume_manager_total_volumes
rest_client_request_duration_seconds_bucket
rest_client_request_duration_seconds_count
rest_client_request_duration_seconds_sum
rest_client_requests_total
process_cpu_seconds_total
process_resident_memory_bytes
kubernetes_build_info
kubelet_cgroup_manager_duration_seconds_bucket
kubelet_cgroup_manager_duration_seconds_count
kubelet_cgroup_manager_duration_seconds_sum
kubelet_node_config_error
kubelet_node_name
kubelet_certificate_manager_client_expiration_renew_errors
kubelet_certificate_manager_client_ttl_seconds
kubelet_http_inflight_requests
kubelet_http_requests_duration_seconds_bucket
kubelet_http_requests_duration_seconds_count
kubelet_http_requests_duration_seconds_sum
kubelet_http_requests_total
kubelet_pleg_relist_duration_seconds_bucket
kubelet_pleg_relist_duration_seconds_count
kubelet_pleg_relist_duration_seconds_sum
kubelet_pleg_relist_interval_seconds_bucket
kubelet_pleg_relist_interval_seconds_count
kubelet_pleg_relist_interval_seconds_sum
kubelet_pod_start_duration_seconds_bucket
kubelet_pod_start_duration_seconds_count
kubelet_pod_start_duration_seconds_sum
kubelet_pod_worker_duration_seconds_bucket
kubelet_pod_worker_duration_seconds_count
kubelet_pod_worker_duration_seconds_sum
kubelet_pod_worker_start_duration_seconds_bucket
kubelet_run_podsandbox_duration_seconds_bucket
kubelet_run_podsandbox_errors_total
kubelet_running_containers
kubelet_running_pods
kubelet_runtime_operations_duration_seconds_bucket
kubelet_runtime_operations_duration_seconds_count
kubelet_runtime_operations_duration_seconds_sum
kubelet_runtime_operations_errors_total
kubelet_runtime_operations_total
kubelet_volume_stats_available_bytes
kubelet_volume_stats_capacity_bytes
kubelet_volume_stats_inodes
kubelet_volume_stats_inodes_free
kubelet_volume_stats_inodes_used
kubelet_volume_stats_used_bytes

kubelet-cadvisor 基础指标

任务名称(Job Name)指标名称
kubelet-cadvisorcadvisor_version_info
machine_cpu_cores
machine_memory_bytes
container_cpu_cfs_periods_total
container_cpu_cfs_throttled_periods_total
container_cpu_usage_seconds_total
container_fs_inodes_free
container_fs_inodes_total
container_fs_io_current
container_fs_io_time_seconds_total
container_fs_io_time_weighted_seconds_total
container_fs_limit_bytes
container_fs_read_seconds_total
container_fs_reads_bytes_total
container_fs_reads_total
container_fs_usage_bytes
container_fs_write_seconds_total
container_fs_writes_bytes_total
container_fs_writes_total
container_memory_cache
container_memory_failcnt
container_memory_failures_total
container_memory_max_usage_bytes
container_memory_rss
container_memory_swap
container_memory_usage_bytes
container_memory_working_set_bytes
container_network_receive_bytes_total
container_network_receive_errors_total
container_network_receive_packets_dropped_total
container_network_receive_packets_total
container_network_transmit_bytes_total
container_network_transmit_errors_total
container_network_transmit_packets_dropped_total
container_network_transmit_packets_total
container_processes
container_sockets
container_start_time_seconds
container_tasks_state
container_threads
container_threads_max

node-exporter 基础指标

任务名称(Job Name)指标名称
node-exporterprocess_cpu_seconds_total
process_resident_memory_bytes
node_boot_time_seconds
node_context_switches_total
node_cpu_seconds_total
node_disk_io_now
node_disk_io_time_seconds_total
node_disk_io_time_weighted_seconds_total
node_disk_read_bytes_total
node_disk_read_time_seconds_total
node_disk_reads_completed_total
node_disk_write_time_seconds_total
node_disk_writes_completed_total
node_disk_written_bytes_total
node_exporter_build_info
node_filefd_allocated
node_filefd_maximum
node_filesystem_device_error
node_filesystem_files
node_filesystem_files_free
node_filesystem_avail_bytes
node_filesystem_free_bytes
node_filesystem_readonly
node_filesystem_size_bytes
node_ipvs_backend_connections_active
node_ipvs_backend_connections_inactive
node_ipvs_backend_weight
node_ipvs_connections_total
node_ipvs_incoming_bytes_total
node_ipvs_incoming_packets_total
node_ipvs_outgoing_bytes_total
node_ipvs_outgoing_packets_total
node_load1
node_load15
node_load5
node_memory_Buffers_bytes
node_memory_Cached_bytes
node_memory_Active_anon_bytes
node_memory_Active_bytes
node_memory_Active_file_bytes
node_memory_Inactive_anon_bytes
node_memory_Inactive_bytes
node_memory_Inactive_file_bytes
node_memory_MemAvailable_bytes
node_memory_MemFree_bytes
node_memory_MemTotal_bytes
node_memory_Slab_bytes
node_netstat_TcpExt_ListenDrops
node_netstat_TcpExt_TCPSynRetrans
node_netstat_Tcp_ActiveOpens
node_netstat_Tcp_CurrEstab
node_netstat_Tcp_InErrs
node_netstat_Tcp_OutRsts
node_netstat_Tcp_InSegs
node_netstat_Tcp_OutSegs
node_netstat_Tcp_PassiveOpens
node_netstat_Tcp_RetransSegs
node_network_receive_bytes_total
node_network_receive_drop_total
node_network_receive_errs_total
node_network_receive_packets_total
node_network_transmit_bytes_total
node_network_transmit_drop_total
node_network_transmit_packets_total
node_sockstat_TCP6_inuse
node_sockstat_TCP_alloc
node_sockstat_TCP_inuse
node_sockstat_TCP_tw
node_sockstat_UDP6_inuse
node_sockstat_UDP_inuse
node_sockstat_sockets_used
node_time_seconds
node_uname_info
node_vmstat_pgmajfault

ingress-nginx 基础指标

任务名称(Job Name)指标名称
ingress-nginxnginx_ingress_controller_build_info
nginx_ingress_controller_bytes_sent_bucket
nginx_ingress_controller_bytes_sent_count
nginx_ingress_controller_bytes_sent_sum
nginx_ingress_controller_config_hash
nginx_ingress_controller_config_last_reload_successful
nginx_ingress_controller_config_last_reload_successful_timestamp_seconds
nginx_ingress_controller_ingress_upstream_header_seconds
nginx_ingress_controller_ingress_upstream_header_seconds_count
nginx_ingress_controller_ingress_upstream_header_seconds_sum
nginx_ingress_controller_ingress_upstream_latency_seconds
nginx_ingress_controller_ingress_upstream_latency_seconds_count
nginx_ingress_controller_ingress_upstream_latency_seconds_sum
nginx_ingress_controller_leader_election_status
nginx_ingress_controller_nginx_process_connections
nginx_ingress_controller_nginx_process_connections_total
nginx_ingress_controller_nginx_process_cpu_seconds_total
nginx_ingress_controller_nginx_process_num_procs
nginx_ingress_controller_nginx_process_oldest_start_time_seconds
nginx_ingress_controller_nginx_process_read_bytes_total
nginx_ingress_controller_nginx_process_requests_total
nginx_ingress_controller_nginx_process_resident_memory_bytes
nginx_ingress_controller_nginx_process_virtual_memory_bytes
nginx_ingress_controller_nginx_process_write_bytes_total
nginx_ingress_controller_request_duration_seconds_bucket
nginx_ingress_controller_request_duration_seconds_count
nginx_ingress_controller_request_duration_seconds_sum
nginx_ingress_controller_request_size_bucket
nginx_ingress_controller_request_size_count
nginx_ingress_controller_request_size_sum
nginx_ingress_controller_requests
nginx_ingress_controller_response_duration_seconds_bucket
nginx_ingress_controller_response_duration_seconds_count
nginx_ingress_controller_response_duration_seconds_sum
nginx_ingress_controller_response_size_bucket
nginx_ingress_controller_response_size_count
nginx_ingress_controller_response_size_sum
nginx_ingress_controller_ssl_expire_time_seconds
nginx_ingress_controller_success
process_cpu_seconds_total
process_resident_memory_bytes

DCGM 基础指标

任务名称(Job Name)指标名称
dcgmDCGM_FI_DEV_GPU_TEMP
DCGM_FI_DEV_MEMORY_TEMP
DCGM_FI_DEV_NVLINK_BANDWIDTH_TOTAL
DCGM_FI_DEV_POWER_USAGE
DCGM_FI_DEV_XID_ERRORS
DCGM_FI_DEV_DEC_UTIL
DCGM_FI_DEV_ENC_UTIL
DCGM_FI_DEV_FB_FREE
DCGM_FI_DEV_FB_USED
DCGM_FI_DEV_GPU_UTIL
DCGM_FI_DEV_MEM_COPY_UTIL
DCGM_FI_DEV_SM_CLOCK

mGPU 基础指标

任务名称(Job Name)指标名称
mgpunvml_container_core_request
nvml_container_core_usage
nvml_container_core_utilization
nvml_container_mem_request
nvml_container_mem_usage
nvml_container_mem_utilization
nvml_pod_core_request
nvml_pod_core_usage
nvml_pod_core_utilization
nvml_pod_mem_request
nvml_pod_mem_usage
nvml_pod_mem_utilization