You need to enable JavaScript to run this app.
导航
容器服务基础指标
最近更新时间:2024.08.15 20:14:19首次发布时间:2023.06.05 22:23:34

托管 Prometheus 将您上报的指标分为:云产品基础指标云产品其他指标自定义指标。指标的定义和计费方式,请参见 计费项。本文为您介绍容器服务(VKE)产品的基础指标。

apiserver 基础指标

任务名称(Job Name)指标名称
kubernetes-apiserverworkqueue_adds_total
workqueue_depth
workqueue_queue_duration_seconds_bucket
workqueue_queue_duration_seconds_count
workqueue_queue_duration_seconds_sum
workqueue_retries_total
workqueue_work_duration_seconds_bucket
rest_client_request_duration_seconds_bucket
rest_client_request_duration_seconds_count
rest_client_request_duration_seconds_sum
rest_client_requests_total
process_cpu_seconds_total
process_resident_memory_bytes
kubernetes_build_info
apiserver_admission_controller_admission_duration_seconds_bucket
apiserver_admission_controller_admission_duration_seconds_count
apiserver_admission_controller_admission_duration_seconds_sum
apiserver_admission_webhook_admission_duration_seconds_bucket
apiserver_admission_webhook_admission_duration_seconds_count
apiserver_admission_webhook_admission_duration_seconds_sum
apiserver_current_inflight_requests
apiserver_request_duration_seconds_bucket
apiserver_request_duration_seconds_count
apiserver_request_duration_seconds_sum
apiserver_request_total
apiserver_requested_deprecated_apis
apiserver_response_sizes_bucket
apiserver_response_sizes_count
apiserver_response_sizes_sum
etcd_request_duration_seconds_bucket
etcd_request_duration_seconds_count
etcd_request_duration_seconds_sum
apiserver_request_useragent_total
apiserver_requested_deprecated_apis

etcd 基础指标

任务名称(Job Name)指标名称
kubernetes-etcdprocess_cpu_seconds_total
process_resident_memory_bytes
etcd_cluster_version
etcd_disk_backend_commit_duration_seconds_bucket
etcd_disk_backend_commit_duration_seconds_count
etcd_disk_backend_commit_duration_seconds_sum
etcd_disk_wal_fsync_duration_seconds_bucket
etcd_disk_wal_fsync_duration_seconds_count
etcd_disk_wal_fsync_duration_seconds_sum
etcd_network_peer_received_bytes_total
etcd_network_peer_round_trip_time_seconds_bucket
etcd_network_peer_round_trip_time_seconds_count
etcd_network_peer_round_trip_time_seconds_sum
etcd_network_peer_sent_bytes_total
etcd_network_peer_sent_failures_total
etcd_server_id
etcd_server_is_leader
etcd_server_has_leader
etcd_server_leader_changes_seen_total
etcd_server_proposals_applied_total
etcd_server_proposals_committed_total
etcd_server_proposals_failed_total
etcd_server_proposals_pending
etcd_mvcc_db_total_size_in_use_in_bytes

kube-scheduler 基础指标

任务名称(Job Name)指标名称
kube-schedulerworkqueue_adds_total
workqueue_depth
workqueue_queue_duration_seconds_bucket
workqueue_queue_duration_seconds_count
workqueue_queue_duration_seconds_sum
workqueue_retries_total
workqueue_work_duration_seconds_bucket
scheduler_pending_pods
scheduler_scheduler_cache_size
process_cpu_seconds_total
process_resident_memory_bytes
kubernetes_build_info
rest_client_request_duration_seconds_bucket
rest_client_request_duration_seconds_count
rest_client_request_duration_seconds_sum
rest_client_requests_total
scheduler_e2e_scheduling_duration_seconds_bucket
scheduler_pod_scheduling_attempts_sum
scheduler_preemption_attempts_total
scheduler_scheduling_algorithm_duration_seconds_count

kube-state-metrics 基础指标

任务名称(Job Name)指标名称
kube-state-metricskube_daemonset_created
kube_daemonset_labels
kube_daemonset_status_current_number_scheduled
kube_daemonset_status_desired_number_scheduled
kube_daemonset_status_number_available
kube_daemonset_status_number_misscheduled
kube_daemonset_status_number_ready
kube_daemonset_status_number_unavailable
kube_daemonset_status_updated_number_scheduled
kube_deployment_created
kube_deployment_labels
kube_deployment_spec_replicas
kube_deployment_status_condition
kube_deployment_status_replicas
kube_deployment_status_replicas_available
kube_deployment_status_replicas_ready
kube_deployment_status_replicas_updated
kube_deployment_status_replicas_unavailable
kube_namespace_labels
kube_node_labels
kube_node_spec_unschedulable
kube_node_info
kube_node_spec_taint
kube_node_status_allocatable
kube_node_status_capacity
kube_node_status_condition
kube_persistentvolume_capacity_bytes
kube_persistentvolume_labels
kube_persistentvolume_status_phase
kube_persistentvolumeclaim_labels
kube_persistentvolumeclaim_status_condition
kube_persistentvolumeclaim_status_phase
kube_pod_container_info
kube_pod_container_resource_limits
kube_pod_container_resource_requests
kube_pod_container_status_last_terminated_reason
kube_pod_container_status_ready
kube_pod_container_status_restarts_total
kube_pod_container_status_running
kube_pod_container_status_terminated
kube_pod_container_status_terminated_reason
kube_pod_container_status_waiting
kube_pod_container_status_waiting_reason
kube_pod_info
kube_pod_owner
kube_pod_status_phase
kube_pod_created
kube_pod_labels
kube_pod_status_ready
kube_pod_status_reason
kube_replicaset_labels
kube_replicaset_spec_replicas
kube_replicaset_status_fully_labeled_replicas
kube_replicaset_status_observed_generation
kube_replicaset_status_ready_replicas
kube_replicaset_status_replicas
kube_replicaset_owner
kube_statefulset_status_replicas_ready
kube_statefulset_status_replicas_available
kube_statefulset_created
kube_statefulset_replicas
kube_statefulset_status_replicas_updated
kube_namespace_created
kube_persistentvolumeclaim_resource_requests_storage_bytes
kube_statefulset_status_replicas_updated
kube_horizontalpodautoscaler_info
kube_horizontalpodautoscaler_metadata_generation
kube_horizontalpodautoscaler_spec_max_replicas
kube_horizontalpodautoscaler_spec_min_replicas
kube_horizontalpodautoscaler_spec_target_metric
kube_horizontalpodautoscaler_status_condition
kube_horizontalpodautoscaler_status_current_replicas
kube_horizontalpodautoscaler_status_desired_replicas
kube_horizontalpodautoscaler_status_target_metric
kube_horizontalpodautoscaler_annotations
kube_horizontalpodautoscaler_labels

kubelet 基础指标

任务名称(Job Name)指标名称
kubeletworkqueue_adds_total
workqueue_depth
workqueue_queue_duration_seconds_bucket
workqueue_queue_duration_seconds_count
workqueue_queue_duration_seconds_sum
workqueue_retries_total
workqueue_work_duration_seconds_bucket
storage_operation_duration_seconds_bucket
storage_operation_duration_seconds_count
storage_operation_duration_seconds_sum
storage_operation_errors_total
volume_manager_total_volumes
rest_client_request_duration_seconds_bucket
rest_client_request_duration_seconds_count
rest_client_request_duration_seconds_sum
rest_client_requests_total
process_cpu_seconds_total
process_resident_memory_bytes
kubernetes_build_info
kubelet_cgroup_manager_duration_seconds_bucket
kubelet_cgroup_manager_duration_seconds_count
kubelet_cgroup_manager_duration_seconds_sum
kubelet_node_config_error
kubelet_node_name
kubelet_certificate_manager_client_expiration_renew_errors
kubelet_certificate_manager_client_ttl_seconds
kubelet_http_inflight_requests
kubelet_http_requests_duration_seconds_bucket
kubelet_http_requests_duration_seconds_count
kubelet_http_requests_duration_seconds_sum
kubelet_http_requests_total
kubelet_pleg_relist_duration_seconds_bucket
kubelet_pleg_relist_duration_seconds_count
kubelet_pleg_relist_duration_seconds_sum
kubelet_pleg_relist_interval_seconds_bucket
kubelet_pleg_relist_interval_seconds_count
kubelet_pleg_relist_interval_seconds_sum
kubelet_pod_start_duration_seconds_bucket
kubelet_pod_start_duration_seconds_count
kubelet_pod_start_duration_seconds_sum
kubelet_pod_worker_duration_seconds_bucket
kubelet_pod_worker_duration_seconds_count
kubelet_pod_worker_duration_seconds_sum
kubelet_pod_worker_start_duration_seconds_bucket
kubelet_run_podsandbox_duration_seconds_bucket
kubelet_run_podsandbox_errors_total
kubelet_running_containers
kubelet_running_pods
kubelet_runtime_operations_duration_seconds_bucket
kubelet_runtime_operations_duration_seconds_count
kubelet_runtime_operations_duration_seconds_sum
kubelet_runtime_operations_errors_total
kubelet_runtime_operations_total
kubelet_volume_stats_available_bytes
kubelet_volume_stats_capacity_bytes
kubelet_volume_stats_inodes
kubelet_volume_stats_inodes_free
kubelet_volume_stats_inodes_used
kubelet_volume_stats_used_bytes
kubelet_pleg_discard_events
kubelet_pleg_last_seen_seconds
apiserver_storage_data_key_generation_failures_total
get_token_fail_count
kubelet_started_containers_errors_total
kubelet_started_pods_errors_total
process_max_fds
process_open_fds
rest_client_response_size_bytes_bucket
rest_client_response_size_bytes_sum
rest_client_response_size_bytes_count
volume_operation_total_seconds_bucket
volume_operation_total_seconds_sum
volume_operation_total_seconds_count
kubelet_cpu_manager_pinning_errors_total
apiserver_audit_event_total
apiserver_audit_requests_rejected_total
csi_operations_seconds_bucket
csi_operations_seconds_count
csi_operations_seconds_sum

kubelet-cadvisor 基础指标

任务名称(Job Name)指标名称
kubelet-cadvisorcadvisor_version_info
machine_cpu_cores
machine_memory_bytes
container_cpu_cfs_periods_total
container_cpu_cfs_throttled_periods_total
container_cpu_usage_seconds_total
container_fs_inodes_free
container_fs_inodes_total
container_fs_io_current
container_fs_io_time_seconds_total
container_fs_io_time_weighted_seconds_total
container_fs_limit_bytes
container_fs_read_seconds_total
container_fs_reads_bytes_total
container_fs_reads_total
container_fs_usage_bytes
container_fs_write_seconds_total
container_fs_writes_bytes_total
container_fs_writes_total
container_memory_cache
container_memory_failcnt
container_memory_failures_total
container_memory_max_usage_bytes
container_memory_rss
container_memory_swap
container_memory_usage_bytes
container_memory_working_set_bytes
container_network_receive_bytes_total
container_network_receive_errors_total
container_network_receive_packets_dropped_total
container_network_receive_packets_total
container_network_transmit_bytes_total
container_network_transmit_errors_total
container_network_transmit_packets_dropped_total
container_network_transmit_packets_total
container_processes
container_sockets
container_start_time_seconds
container_tasks_state
container_threads
container_threads_max
container_cpu_cfs_throttled_seconds_total
container_cpu_load_average_10s
container_cpu_user_seconds_total
container_file_descriptors
container_spec_cpu_period
container_spec_cpu_quota
container_spec_memory_reservation_limit_bytes
container_spec_memory_swap_limit_bytes
machine_cpu_sockets
machine_scrape_error
container_blkio_device_usage_total
container_oom_events_total
machine_cpu_physical_cores
container_spec_memory_limit_bytes
container_cpu_system_seconds_total
container_oom_events_total
pod_cpu_usage_seconds_total
pod_memory_working_set_bytes
vci_pod_cpu_resource
vci_pod_memory_resource
vci_pod_gpu_resource

node-exporter 基础指标

任务名称(Job Name)指标名称
node-exporterprocess_cpu_seconds_total
process_resident_memory_bytes
node_boot_time_seconds
node_context_switches_total
node_cpu_seconds_total
node_disk_io_now
node_disk_io_time_seconds_total
node_disk_io_time_weighted_seconds_total
node_disk_read_bytes_total
node_disk_read_time_seconds_total
node_disk_reads_completed_total
node_disk_write_time_seconds_total
node_disk_writes_completed_total
node_disk_written_bytes_total
node_exporter_build_info
node_filefd_allocated
node_filefd_maximum
node_filesystem_device_error
node_filesystem_files
node_filesystem_files_free
node_filesystem_avail_bytes
node_filesystem_free_bytes
node_filesystem_readonly
node_filesystem_size_bytes
node_ipvs_backend_connections_active
node_ipvs_backend_connections_inactive
node_ipvs_backend_weight
node_ipvs_connections_total
node_ipvs_incoming_bytes_total
node_ipvs_incoming_packets_total
node_ipvs_outgoing_bytes_total
node_ipvs_outgoing_packets_total
node_load1
node_load15
node_load5
node_memory_Buffers_bytes
node_memory_Cached_bytes
node_memory_Active_anon_bytes
node_memory_Active_bytes
node_memory_Active_file_bytes
node_memory_Inactive_anon_bytes
node_memory_Inactive_bytes
node_memory_Inactive_file_bytes
node_memory_MemAvailable_bytes
node_memory_MemFree_bytes
node_memory_MemTotal_bytes
node_memory_Slab_bytes
node_netstat_TcpExt_ListenDrops
node_netstat_TcpExt_TCPSynRetrans
node_netstat_Tcp_ActiveOpens
node_netstat_Tcp_CurrEstab
node_netstat_Tcp_InErrs
node_netstat_Tcp_OutRsts
node_netstat_Tcp_InSegs
node_netstat_Tcp_OutSegs
node_netstat_Tcp_PassiveOpens
node_netstat_Tcp_RetransSegs
node_network_receive_bytes_total
node_network_receive_drop_total
node_network_receive_errs_total
node_network_receive_packets_total
node_network_transmit_bytes_total
node_network_transmit_drop_total
node_network_transmit_packets_total
node_sockstat_TCP6_inuse
node_sockstat_TCP_alloc
node_sockstat_TCP_inuse
node_sockstat_TCP_tw
node_sockstat_UDP6_inuse
node_sockstat_UDP_inuse
node_sockstat_sockets_used
node_time_seconds
node_uname_info
node_vmstat_pgmajfault

ingress-nginx 基础指标

任务名称(Job Name)指标名称
ingress-nginxnginx_ingress_controller_build_info
nginx_ingress_controller_bytes_sent_bucket
nginx_ingress_controller_bytes_sent_count
nginx_ingress_controller_bytes_sent_sum
nginx_ingress_controller_config_hash
nginx_ingress_controller_config_last_reload_successful
nginx_ingress_controller_config_last_reload_successful_timestamp_seconds
nginx_ingress_controller_ingress_upstream_header_seconds
nginx_ingress_controller_ingress_upstream_header_seconds_count
nginx_ingress_controller_ingress_upstream_header_seconds_sum
nginx_ingress_controller_ingress_upstream_latency_seconds
nginx_ingress_controller_ingress_upstream_latency_seconds_count
nginx_ingress_controller_ingress_upstream_latency_seconds_sum
nginx_ingress_controller_leader_election_status
nginx_ingress_controller_nginx_process_connections
nginx_ingress_controller_nginx_process_connections_total
nginx_ingress_controller_nginx_process_cpu_seconds_total
nginx_ingress_controller_nginx_process_num_procs
nginx_ingress_controller_nginx_process_oldest_start_time_seconds
nginx_ingress_controller_nginx_process_read_bytes_total
nginx_ingress_controller_nginx_process_requests_total
nginx_ingress_controller_nginx_process_resident_memory_bytes
nginx_ingress_controller_nginx_process_virtual_memory_bytes
nginx_ingress_controller_nginx_process_write_bytes_total
nginx_ingress_controller_request_duration_seconds_bucket
nginx_ingress_controller_request_duration_seconds_count
nginx_ingress_controller_request_duration_seconds_sum
nginx_ingress_controller_request_size_bucket
nginx_ingress_controller_request_size_count
nginx_ingress_controller_request_size_sum
nginx_ingress_controller_requests
nginx_ingress_controller_response_duration_seconds_bucket
nginx_ingress_controller_response_duration_seconds_count
nginx_ingress_controller_response_duration_seconds_sum
nginx_ingress_controller_response_size_bucket
nginx_ingress_controller_response_size_count
nginx_ingress_controller_response_size_sum
nginx_ingress_controller_ssl_expire_time_seconds
nginx_ingress_controller_success
process_cpu_seconds_total
process_resident_memory_bytes
nginx_ingress_controller_admission_roundtrip_duration

DCGM 基础指标

任务名称(Job Name)指标名称
dcgm
dcgm-vci
DCGM_FI_DEV_GPU_TEMP
DCGM_FI_DEV_MEMORY_TEMP
DCGM_FI_DEV_NVLINK_BANDWIDTH_TOTAL
DCGM_FI_DEV_POWER_USAGE
DCGM_FI_DEV_XID_ERRORS
DCGM_FI_DEV_DEC_UTIL
DCGM_FI_DEV_ENC_UTIL
DCGM_FI_DEV_FB_FREE
DCGM_FI_DEV_FB_USED
DCGM_FI_DEV_GPU_UTIL
DCGM_FI_DEV_MEM_COPY_UTIL
DCGM_FI_DEV_SM_CLOCK
DCGM_CUSTOM_XID_ERRORS_COUNTER

mGPU 基础指标

任务名称(Job Name)指标名称
mgpunvml_container_core_request
nvml_container_core_usage
nvml_container_core_utilization
nvml_container_mem_request
nvml_container_mem_usage
nvml_container_mem_utilization
nvml_pod_core_request
nvml_pod_core_usage
nvml_pod_core_utilization
nvml_pod_mem_request
nvml_pod_mem_usage
nvml_pod_mem_utilization
DCGM_FI_DEV_DEC_UTIL
DCGM_FI_DEV_ENC_UTIL
DCGM_FI_DEV_FB_FREE
DCGM_FI_DEV_FB_USED
DCGM_FI_DEV_GPU_TEMP
DCGM_FI_DEV_GPU_UTIL
DCGM_FI_DEV_MEM_COPY_UTIL
DCGM_FI_DEV_MEMORY_TEMP
DCGM_FI_DEV_NVLINK_BANDWIDTH_TOTAL
DCGM_FI_DEV_POWER_USAGE
DCGM_FI_DEV_SM_CLOCK
DCGM_FI_DEV_XID_ERRORS
DCGM_FI_PROF_PCIE_RX_BYTES
DCGM_FI_PROF_PCIE_TX_BYTES

DNS 基础指标

任务名称(Job Name)指标名称
core-dns
node-local-dns
vke-node-local-dns-admission
coredns_build_info
coredns_build_info
coredns_cache_entries
coredns_cache_hits_total
coredns_cache_misses_total
coredns_cache_requests_total
coredns_dns_do_requests_total
coredns_dns_request_duration_seconds_bucket
coredns_dns_request_duration_seconds_count
coredns_dns_request_duration_seconds_sum
coredns_dns_request_size_bytes_bucket
coredns_dns_request_size_bytes_count
coredns_dns_request_size_bytes_sum
coredns_dns_requests_total
coredns_dns_response_size_bytes_bucket
coredns_dns_response_size_bytes_count
coredns_dns_response_size_bytes_sum
coredns_dns_responses_total
coredns_forward_conn_cache_hits_total
coredns_forward_conn_cache_misses_total
coredns_forward_healthcheck_broken_total
coredns_forward_max_concurrent_rejects_total
coredns_forward_request_duration_seconds_bucket
coredns_forward_request_duration_seconds_count
coredns_forward_request_duration_seconds_sum
coredns_forward_requests_total
coredns_forward_responses_total
coredns_health_request_duration_seconds_bucket
coredns_health_request_duration_seconds_count
coredns_health_request_duration_seconds_sum
coredns_local_localhost_requests_total
coredns_reload_failed_total
coredns_panics_total
controller_runtime_webhook_requests_total
controller_runtime_webhook_latency_seconds_bucket
rest_client_request_duration_seconds_bucket
rest_client_request_duration_seconds_count
controller_runtime_webhook_requests_in_flight

弹性伸缩基础指标

任务名称(Job Name)指标名称
autoclustercluster_autoscaler_cluster_safe_to_autoscale
cluster_autoscaler_scale_down_in_cooldown
cluster_autoscaler_last_activity
cluster_autoscaler_unschedulable_pods_count
cluster_autoscaler_function_duration_seconds_bucket
cluster_autoscaler_scaled_up_nodes_total
cluster_autoscaler_scaled_up_gpu_nodes_total
cluster_autoscaler_scaled_down_nodes_total
cluster_autoscaler_scaled_down_gpu_nodes_total
cluster_autoscaler_evicted_pods_total
cluster_autoscaler_failed_scale_ups_total

VPC-CNI 基础指标

任务名称(Job Name)指标名称
vpc-cnirpc_latency_ms
resource_pool_available
resource_manager_error_count
openapi_latency_ms
penapi_error_count
metadata_latency_ms
metadata_error_count
resource_pool_max_cap
resource_pool_target
resource_pool_target_min
resource_pool_total
metadata_latency_ms_sum
metadata_latency_ms_count
metadata_latency_ms_bucket
openapi_latency_ms_sum
openapi_latency_ms_count
openapi_latency_ms_bucket
rpc_latency_ms_sum
rpc_latency_ms_count
rpc_latency_ms_bucket

存储基础指标

任务名称(Job Name)指标名称
csi-ebsgo_goroutines
go_threads
process_cpu_seconds_total
process_max_fds
process_open_fds
process_resident_memory_bytes
process_virtual_memory_bytes
process_virtual_memory_max_bytes
volc_api_request_duration_seconds
volc_api_request_errors
volc_api_throttled_requests_total

镜像基础指标

任务名称(Job Name)指标名称
cr-credential-controllercr_credential_client_request_duration_seconds
cr_credential_client_request_error
cr_credential_client_request_total
cr_credential_resource_request_error
cr_credential_resource_request_total
go_goroutines
process_cpu_seconds_total
process_max_fds
process_open_fds
process_resident_memory_bytes