托管 Prometheus 将您上报的指标分为:云产品基础指标、云产品其他指标 和 自定义指标。指标的定义和计费方式,请参见 计费项。本文为您介绍容器服务(VKE)产品的基础指标。
任务名称(Job Name) | 指标名称 |
---|---|
kubernetes-apiserver | workqueue_adds_total |
workqueue_depth | |
workqueue_queue_duration_seconds_bucket | |
workqueue_queue_duration_seconds_count | |
workqueue_queue_duration_seconds_sum | |
workqueue_retries_total | |
workqueue_work_duration_seconds_bucket | |
rest_client_request_duration_seconds_bucket | |
rest_client_request_duration_seconds_count | |
rest_client_request_duration_seconds_sum | |
rest_client_requests_total | |
process_cpu_seconds_total | |
process_resident_memory_bytes | |
kubernetes_build_info | |
apiserver_admission_controller_admission_duration_seconds_bucket | |
apiserver_admission_controller_admission_duration_seconds_count | |
apiserver_admission_controller_admission_duration_seconds_sum | |
apiserver_admission_webhook_admission_duration_seconds_bucket | |
apiserver_admission_webhook_admission_duration_seconds_count | |
apiserver_admission_webhook_admission_duration_seconds_sum | |
apiserver_current_inflight_requests | |
apiserver_request_duration_seconds_bucket | |
apiserver_request_duration_seconds_count | |
apiserver_request_duration_seconds_sum | |
apiserver_request_total | |
apiserver_requested_deprecated_apis | |
apiserver_response_sizes_bucket | |
apiserver_response_sizes_count | |
apiserver_response_sizes_sum | |
etcd_request_duration_seconds_bucket | |
etcd_request_duration_seconds_count | |
etcd_request_duration_seconds_sum | |
apiserver_request_useragent_total | |
apiserver_requested_deprecated_apis |
任务名称(Job Name) | 指标名称 |
---|---|
kubernetes-etcd | process_cpu_seconds_total |
process_resident_memory_bytes | |
etcd_cluster_version | |
etcd_disk_backend_commit_duration_seconds_bucket | |
etcd_disk_backend_commit_duration_seconds_count | |
etcd_disk_backend_commit_duration_seconds_sum | |
etcd_disk_wal_fsync_duration_seconds_bucket | |
etcd_disk_wal_fsync_duration_seconds_count | |
etcd_disk_wal_fsync_duration_seconds_sum | |
etcd_network_peer_received_bytes_total | |
etcd_network_peer_round_trip_time_seconds_bucket | |
etcd_network_peer_round_trip_time_seconds_count | |
etcd_network_peer_round_trip_time_seconds_sum | |
etcd_network_peer_sent_bytes_total | |
etcd_network_peer_sent_failures_total | |
etcd_server_id | |
etcd_server_is_leader | |
etcd_server_has_leader | |
etcd_server_leader_changes_seen_total | |
etcd_server_proposals_applied_total | |
etcd_server_proposals_committed_total | |
etcd_server_proposals_failed_total | |
etcd_server_proposals_pending | |
etcd_mvcc_db_total_size_in_use_in_bytes |
任务名称(Job Name) | 指标名称 |
---|---|
kube-scheduler | workqueue_adds_total |
workqueue_depth | |
workqueue_queue_duration_seconds_bucket | |
workqueue_queue_duration_seconds_count | |
workqueue_queue_duration_seconds_sum | |
workqueue_retries_total | |
workqueue_work_duration_seconds_bucket | |
scheduler_pending_pods | |
scheduler_scheduler_cache_size | |
process_cpu_seconds_total | |
process_resident_memory_bytes | |
kubernetes_build_info | |
rest_client_request_duration_seconds_bucket | |
rest_client_request_duration_seconds_count | |
rest_client_request_duration_seconds_sum | |
rest_client_requests_total | |
scheduler_e2e_scheduling_duration_seconds_bucket | |
scheduler_pod_scheduling_attempts_sum | |
scheduler_preemption_attempts_total | |
scheduler_scheduling_algorithm_duration_seconds_count |
任务名称(Job Name) | 指标名称 |
---|---|
kube-state-metrics | kube_daemonset_created |
kube_daemonset_labels | |
kube_daemonset_status_current_number_scheduled | |
kube_daemonset_status_desired_number_scheduled | |
kube_daemonset_status_number_available | |
kube_daemonset_status_number_misscheduled | |
kube_daemonset_status_number_ready | |
kube_daemonset_status_number_unavailable | |
kube_daemonset_status_updated_number_scheduled | |
kube_deployment_created | |
kube_deployment_labels | |
kube_deployment_spec_replicas | |
kube_deployment_status_condition | |
kube_deployment_status_replicas | |
kube_deployment_status_replicas_available | |
kube_deployment_status_replicas_ready | |
kube_deployment_status_replicas_updated | |
kube_deployment_status_replicas_unavailable | |
kube_namespace_labels | |
kube_node_labels | |
kube_node_spec_unschedulable | |
kube_node_info | |
kube_node_spec_taint | |
kube_node_status_allocatable | |
kube_node_status_capacity | |
kube_node_status_condition | |
kube_persistentvolume_capacity_bytes | |
kube_persistentvolume_labels | |
kube_persistentvolume_status_phase | |
kube_persistentvolumeclaim_labels | |
kube_persistentvolumeclaim_status_condition | |
kube_persistentvolumeclaim_status_phase | |
kube_pod_container_info | |
kube_pod_container_resource_limits | |
kube_pod_container_resource_requests | |
kube_pod_container_status_last_terminated_reason | |
kube_pod_container_status_ready | |
kube_pod_container_status_restarts_total | |
kube_pod_container_status_running | |
kube_pod_container_status_terminated | |
kube_pod_container_status_terminated_reason | |
kube_pod_container_status_waiting | |
kube_pod_container_status_waiting_reason | |
kube_pod_info | |
kube_pod_owner | |
kube_pod_status_phase | |
kube_pod_created | |
kube_pod_labels | |
kube_pod_status_ready | |
kube_pod_status_reason | |
kube_replicaset_labels | |
kube_replicaset_spec_replicas | |
kube_replicaset_status_fully_labeled_replicas | |
kube_replicaset_status_observed_generation | |
kube_replicaset_status_ready_replicas | |
kube_replicaset_status_replicas | |
kube_replicaset_owner | |
kube_statefulset_status_replicas_ready | |
kube_statefulset_status_replicas_available | |
kube_statefulset_created | |
kube_statefulset_replicas | |
kube_statefulset_status_replicas_updated | |
kube_namespace_created | |
kube_persistentvolumeclaim_resource_requests_storage_bytes | |
kube_statefulset_status_replicas_updated | |
kube_horizontalpodautoscaler_info | |
kube_horizontalpodautoscaler_metadata_generation | |
kube_horizontalpodautoscaler_spec_max_replicas | |
kube_horizontalpodautoscaler_spec_min_replicas | |
kube_horizontalpodautoscaler_spec_target_metric | |
kube_horizontalpodautoscaler_status_condition | |
kube_horizontalpodautoscaler_status_current_replicas | |
kube_horizontalpodautoscaler_status_desired_replicas | |
kube_horizontalpodautoscaler_status_target_metric | |
kube_horizontalpodautoscaler_annotations | |
kube_horizontalpodautoscaler_labels |
任务名称(Job Name) | 指标名称 |
---|---|
kubelet | workqueue_adds_total |
workqueue_depth | |
workqueue_queue_duration_seconds_bucket | |
workqueue_queue_duration_seconds_count | |
workqueue_queue_duration_seconds_sum | |
workqueue_retries_total | |
workqueue_work_duration_seconds_bucket | |
storage_operation_duration_seconds_bucket | |
storage_operation_duration_seconds_count | |
storage_operation_duration_seconds_sum | |
storage_operation_errors_total | |
volume_manager_total_volumes | |
rest_client_request_duration_seconds_bucket | |
rest_client_request_duration_seconds_count | |
rest_client_request_duration_seconds_sum | |
rest_client_requests_total | |
process_cpu_seconds_total | |
process_resident_memory_bytes | |
kubernetes_build_info | |
kubelet_cgroup_manager_duration_seconds_bucket | |
kubelet_cgroup_manager_duration_seconds_count | |
kubelet_cgroup_manager_duration_seconds_sum | |
kubelet_node_config_error | |
kubelet_node_name | |
kubelet_certificate_manager_client_expiration_renew_errors | |
kubelet_certificate_manager_client_ttl_seconds | |
kubelet_http_inflight_requests | |
kubelet_http_requests_duration_seconds_bucket | |
kubelet_http_requests_duration_seconds_count | |
kubelet_http_requests_duration_seconds_sum | |
kubelet_http_requests_total | |
kubelet_pleg_relist_duration_seconds_bucket | |
kubelet_pleg_relist_duration_seconds_count | |
kubelet_pleg_relist_duration_seconds_sum | |
kubelet_pleg_relist_interval_seconds_bucket | |
kubelet_pleg_relist_interval_seconds_count | |
kubelet_pleg_relist_interval_seconds_sum | |
kubelet_pod_start_duration_seconds_bucket | |
kubelet_pod_start_duration_seconds_count | |
kubelet_pod_start_duration_seconds_sum | |
kubelet_pod_worker_duration_seconds_bucket | |
kubelet_pod_worker_duration_seconds_count | |
kubelet_pod_worker_duration_seconds_sum | |
kubelet_pod_worker_start_duration_seconds_bucket | |
kubelet_run_podsandbox_duration_seconds_bucket | |
kubelet_run_podsandbox_errors_total | |
kubelet_running_containers | |
kubelet_running_pods | |
kubelet_runtime_operations_duration_seconds_bucket | |
kubelet_runtime_operations_duration_seconds_count | |
kubelet_runtime_operations_duration_seconds_sum | |
kubelet_runtime_operations_errors_total | |
kubelet_runtime_operations_total | |
kubelet_volume_stats_available_bytes | |
kubelet_volume_stats_capacity_bytes | |
kubelet_volume_stats_inodes | |
kubelet_volume_stats_inodes_free | |
kubelet_volume_stats_inodes_used | |
kubelet_volume_stats_used_bytes | |
kubelet_pleg_discard_events | |
kubelet_pleg_last_seen_seconds | |
apiserver_storage_data_key_generation_failures_total | |
get_token_fail_count | |
kubelet_started_containers_errors_total | |
kubelet_started_pods_errors_total | |
process_max_fds | |
process_open_fds | |
rest_client_response_size_bytes_bucket | |
rest_client_response_size_bytes_sum | |
rest_client_response_size_bytes_count | |
volume_operation_total_seconds_bucket | |
volume_operation_total_seconds_sum | |
volume_operation_total_seconds_count | |
kubelet_cpu_manager_pinning_errors_total | |
apiserver_audit_event_total | |
apiserver_audit_requests_rejected_total | |
csi_operations_seconds_bucket | |
csi_operations_seconds_count | |
csi_operations_seconds_sum |
任务名称(Job Name) | 指标名称 |
---|---|
kubelet-cadvisor | cadvisor_version_info |
machine_cpu_cores | |
machine_memory_bytes | |
container_cpu_cfs_periods_total | |
container_cpu_cfs_throttled_periods_total | |
container_cpu_usage_seconds_total | |
container_fs_inodes_free | |
container_fs_inodes_total | |
container_fs_io_current | |
container_fs_io_time_seconds_total | |
container_fs_io_time_weighted_seconds_total | |
container_fs_limit_bytes | |
container_fs_read_seconds_total | |
container_fs_reads_bytes_total | |
container_fs_reads_total | |
container_fs_usage_bytes | |
container_fs_write_seconds_total | |
container_fs_writes_bytes_total | |
container_fs_writes_total | |
container_memory_cache | |
container_memory_failcnt | |
container_memory_failures_total | |
container_memory_max_usage_bytes | |
container_memory_rss | |
container_memory_swap | |
container_memory_usage_bytes | |
container_memory_working_set_bytes | |
container_network_receive_bytes_total | |
container_network_receive_errors_total | |
container_network_receive_packets_dropped_total | |
container_network_receive_packets_total | |
container_network_transmit_bytes_total | |
container_network_transmit_errors_total | |
container_network_transmit_packets_dropped_total | |
container_network_transmit_packets_total | |
container_processes | |
container_sockets | |
container_start_time_seconds | |
container_tasks_state | |
container_threads | |
container_threads_max | |
container_cpu_cfs_throttled_seconds_total | |
container_cpu_load_average_10s | |
container_cpu_user_seconds_total | |
container_file_descriptors | |
container_spec_cpu_period | |
container_spec_cpu_quota | |
container_spec_memory_reservation_limit_bytes | |
container_spec_memory_swap_limit_bytes | |
machine_cpu_sockets | |
machine_scrape_error | |
container_blkio_device_usage_total | |
container_oom_events_total | |
machine_cpu_physical_cores | |
container_spec_memory_limit_bytes | |
container_cpu_system_seconds_total | |
container_oom_events_total | |
pod_cpu_usage_seconds_total | |
pod_memory_working_set_bytes | |
vci_pod_cpu_resource | |
vci_pod_memory_resource | |
vci_pod_gpu_resource |
任务名称(Job Name) | 指标名称 |
---|---|
node-exporter | process_cpu_seconds_total |
process_resident_memory_bytes | |
node_boot_time_seconds | |
node_context_switches_total | |
node_cpu_seconds_total | |
node_disk_io_now | |
node_disk_io_time_seconds_total | |
node_disk_io_time_weighted_seconds_total | |
node_disk_read_bytes_total | |
node_disk_read_time_seconds_total | |
node_disk_reads_completed_total | |
node_disk_write_time_seconds_total | |
node_disk_writes_completed_total | |
node_disk_written_bytes_total | |
node_exporter_build_info | |
node_filefd_allocated | |
node_filefd_maximum | |
node_filesystem_device_error | |
node_filesystem_files | |
node_filesystem_files_free | |
node_filesystem_avail_bytes | |
node_filesystem_free_bytes | |
node_filesystem_readonly | |
node_filesystem_size_bytes | |
node_ipvs_backend_connections_active | |
node_ipvs_backend_connections_inactive | |
node_ipvs_backend_weight | |
node_ipvs_connections_total | |
node_ipvs_incoming_bytes_total | |
node_ipvs_incoming_packets_total | |
node_ipvs_outgoing_bytes_total | |
node_ipvs_outgoing_packets_total | |
node_load1 | |
node_load15 | |
node_load5 | |
node_memory_Buffers_bytes | |
node_memory_Cached_bytes | |
node_memory_Active_anon_bytes | |
node_memory_Active_bytes | |
node_memory_Active_file_bytes | |
node_memory_Inactive_anon_bytes | |
node_memory_Inactive_bytes | |
node_memory_Inactive_file_bytes | |
node_memory_MemAvailable_bytes | |
node_memory_MemFree_bytes | |
node_memory_MemTotal_bytes | |
node_memory_Slab_bytes | |
node_netstat_TcpExt_ListenDrops | |
node_netstat_TcpExt_TCPSynRetrans | |
node_netstat_Tcp_ActiveOpens | |
node_netstat_Tcp_CurrEstab | |
node_netstat_Tcp_InErrs | |
node_netstat_Tcp_OutRsts | |
node_netstat_Tcp_InSegs | |
node_netstat_Tcp_OutSegs | |
node_netstat_Tcp_PassiveOpens | |
node_netstat_Tcp_RetransSegs | |
node_network_receive_bytes_total | |
node_network_receive_drop_total | |
node_network_receive_errs_total | |
node_network_receive_packets_total | |
node_network_transmit_bytes_total | |
node_network_transmit_drop_total | |
node_network_transmit_packets_total | |
node_sockstat_TCP6_inuse | |
node_sockstat_TCP_alloc | |
node_sockstat_TCP_inuse | |
node_sockstat_TCP_tw | |
node_sockstat_UDP6_inuse | |
node_sockstat_UDP_inuse | |
node_sockstat_sockets_used | |
node_time_seconds | |
node_uname_info | |
node_vmstat_pgmajfault |
任务名称(Job Name) | 指标名称 |
---|---|
ingress-nginx | nginx_ingress_controller_build_info |
nginx_ingress_controller_bytes_sent_bucket | |
nginx_ingress_controller_bytes_sent_count | |
nginx_ingress_controller_bytes_sent_sum | |
nginx_ingress_controller_config_hash | |
nginx_ingress_controller_config_last_reload_successful | |
nginx_ingress_controller_config_last_reload_successful_timestamp_seconds | |
nginx_ingress_controller_ingress_upstream_header_seconds | |
nginx_ingress_controller_ingress_upstream_header_seconds_count | |
nginx_ingress_controller_ingress_upstream_header_seconds_sum | |
nginx_ingress_controller_ingress_upstream_latency_seconds | |
nginx_ingress_controller_ingress_upstream_latency_seconds_count | |
nginx_ingress_controller_ingress_upstream_latency_seconds_sum | |
nginx_ingress_controller_leader_election_status | |
nginx_ingress_controller_nginx_process_connections | |
nginx_ingress_controller_nginx_process_connections_total | |
nginx_ingress_controller_nginx_process_cpu_seconds_total | |
nginx_ingress_controller_nginx_process_num_procs | |
nginx_ingress_controller_nginx_process_oldest_start_time_seconds | |
nginx_ingress_controller_nginx_process_read_bytes_total | |
nginx_ingress_controller_nginx_process_requests_total | |
nginx_ingress_controller_nginx_process_resident_memory_bytes | |
nginx_ingress_controller_nginx_process_virtual_memory_bytes | |
nginx_ingress_controller_nginx_process_write_bytes_total | |
nginx_ingress_controller_request_duration_seconds_bucket | |
nginx_ingress_controller_request_duration_seconds_count | |
nginx_ingress_controller_request_duration_seconds_sum | |
nginx_ingress_controller_request_size_bucket | |
nginx_ingress_controller_request_size_count | |
nginx_ingress_controller_request_size_sum | |
nginx_ingress_controller_requests | |
nginx_ingress_controller_response_duration_seconds_bucket | |
nginx_ingress_controller_response_duration_seconds_count | |
nginx_ingress_controller_response_duration_seconds_sum | |
nginx_ingress_controller_response_size_bucket | |
nginx_ingress_controller_response_size_count | |
nginx_ingress_controller_response_size_sum | |
nginx_ingress_controller_ssl_expire_time_seconds | |
nginx_ingress_controller_success | |
process_cpu_seconds_total | |
process_resident_memory_bytes | |
nginx_ingress_controller_admission_roundtrip_duration |
任务名称(Job Name) | 指标名称 |
---|---|
dcgm dcgm-vci | DCGM_FI_DEV_GPU_TEMP |
DCGM_FI_DEV_MEMORY_TEMP | |
DCGM_FI_DEV_NVLINK_BANDWIDTH_TOTAL | |
DCGM_FI_DEV_POWER_USAGE | |
DCGM_FI_DEV_XID_ERRORS | |
DCGM_FI_DEV_DEC_UTIL | |
DCGM_FI_DEV_ENC_UTIL | |
DCGM_FI_DEV_FB_FREE | |
DCGM_FI_DEV_FB_USED | |
DCGM_FI_DEV_GPU_UTIL | |
DCGM_FI_DEV_MEM_COPY_UTIL | |
DCGM_FI_DEV_SM_CLOCK | |
DCGM_CUSTOM_XID_ERRORS_COUNTER |
任务名称(Job Name) | 指标名称 |
---|---|
mgpu | nvml_container_core_request |
nvml_container_core_usage | |
nvml_container_core_utilization | |
nvml_container_mem_request | |
nvml_container_mem_usage | |
nvml_container_mem_utilization | |
nvml_pod_core_request | |
nvml_pod_core_usage | |
nvml_pod_core_utilization | |
nvml_pod_mem_request | |
nvml_pod_mem_usage | |
nvml_pod_mem_utilization | |
DCGM_FI_DEV_DEC_UTIL | |
DCGM_FI_DEV_ENC_UTIL | |
DCGM_FI_DEV_FB_FREE | |
DCGM_FI_DEV_FB_USED | |
DCGM_FI_DEV_GPU_TEMP | |
DCGM_FI_DEV_GPU_UTIL | |
DCGM_FI_DEV_MEM_COPY_UTIL | |
DCGM_FI_DEV_MEMORY_TEMP | |
DCGM_FI_DEV_NVLINK_BANDWIDTH_TOTAL | |
DCGM_FI_DEV_POWER_USAGE | |
DCGM_FI_DEV_SM_CLOCK | |
DCGM_FI_DEV_XID_ERRORS | |
DCGM_FI_PROF_PCIE_RX_BYTES | |
DCGM_FI_PROF_PCIE_TX_BYTES |
任务名称(Job Name) | 指标名称 |
---|---|
core-dns node-local-dns vke-node-local-dns-admission | coredns_build_info |
coredns_build_info | |
coredns_cache_entries | |
coredns_cache_hits_total | |
coredns_cache_misses_total | |
coredns_cache_requests_total | |
coredns_dns_do_requests_total | |
coredns_dns_request_duration_seconds_bucket | |
coredns_dns_request_duration_seconds_count | |
coredns_dns_request_duration_seconds_sum | |
coredns_dns_request_size_bytes_bucket | |
coredns_dns_request_size_bytes_count | |
coredns_dns_request_size_bytes_sum | |
coredns_dns_requests_total | |
coredns_dns_response_size_bytes_bucket | |
coredns_dns_response_size_bytes_count | |
coredns_dns_response_size_bytes_sum | |
coredns_dns_responses_total | |
coredns_forward_conn_cache_hits_total | |
coredns_forward_conn_cache_misses_total | |
coredns_forward_healthcheck_broken_total | |
coredns_forward_max_concurrent_rejects_total | |
coredns_forward_request_duration_seconds_bucket | |
coredns_forward_request_duration_seconds_count | |
coredns_forward_request_duration_seconds_sum | |
coredns_forward_requests_total | |
coredns_forward_responses_total | |
coredns_health_request_duration_seconds_bucket | |
coredns_health_request_duration_seconds_count | |
coredns_health_request_duration_seconds_sum | |
coredns_local_localhost_requests_total | |
coredns_reload_failed_total | |
coredns_panics_total | |
controller_runtime_webhook_requests_total | |
controller_runtime_webhook_latency_seconds_bucket | |
rest_client_request_duration_seconds_bucket | |
rest_client_request_duration_seconds_count | |
controller_runtime_webhook_requests_in_flight |
任务名称(Job Name) | 指标名称 |
---|---|
autocluster | cluster_autoscaler_cluster_safe_to_autoscale |
cluster_autoscaler_scale_down_in_cooldown | |
cluster_autoscaler_last_activity | |
cluster_autoscaler_unschedulable_pods_count | |
cluster_autoscaler_function_duration_seconds_bucket | |
cluster_autoscaler_scaled_up_nodes_total | |
cluster_autoscaler_scaled_up_gpu_nodes_total | |
cluster_autoscaler_scaled_down_nodes_total | |
cluster_autoscaler_scaled_down_gpu_nodes_total | |
cluster_autoscaler_evicted_pods_total | |
cluster_autoscaler_failed_scale_ups_total |
任务名称(Job Name) | 指标名称 |
---|---|
vpc-cni | rpc_latency_ms |
resource_pool_available | |
resource_manager_error_count | |
openapi_latency_ms | |
penapi_error_count | |
metadata_latency_ms | |
metadata_error_count | |
resource_pool_max_cap | |
resource_pool_target | |
resource_pool_target_min | |
resource_pool_total | |
metadata_latency_ms_sum | |
metadata_latency_ms_count | |
metadata_latency_ms_bucket | |
openapi_latency_ms_sum | |
openapi_latency_ms_count | |
openapi_latency_ms_bucket | |
rpc_latency_ms_sum | |
rpc_latency_ms_count | |
rpc_latency_ms_bucket |
任务名称(Job Name) | 指标名称 |
---|---|
csi-ebs | go_goroutines |
go_threads | |
process_cpu_seconds_total | |
process_max_fds | |
process_open_fds | |
process_resident_memory_bytes | |
process_virtual_memory_bytes | |
process_virtual_memory_max_bytes | |
volc_api_request_duration_seconds | |
volc_api_request_errors | |
volc_api_throttled_requests_total |
任务名称(Job Name) | 指标名称 |
---|---|
cr-credential-controller | cr_credential_client_request_duration_seconds |
cr_credential_client_request_error | |
cr_credential_client_request_total | |
cr_credential_resource_request_error | |
cr_credential_resource_request_total | |
go_goroutines | |
process_cpu_seconds_total | |
process_max_fds | |
process_open_fds | |
process_resident_memory_bytes |