Dashboard Cards

KubeStellar Console provides a variety of cards to monitor and manage your clusters.

Available Card Types

Card TypeDescriptionData Source
Cluster HealthAvailability graph per clusterget_cluster_health
App StatusMulti-cluster application healthget_app_status
Event StreamLive event feed with filteringget_events
Deployment ProgressRollout status visualizationget_app_status
Pod IssuesCrashLoopBackOff, OOMKilled podsfind_pod_issues
Deployment IssuesStuck or failing rolloutsfind_deployment_issues
Resource CapacityCPU/memory/GPU utilizationlist_cluster_capabilities
GPU InventoryGPU nodes and counts across clustersget_gpu_nodes
GPU StatusReal-time GPU allocation and usageget_gpu_nodes
GPU OverviewSummary of GPU resourcesget_gpu_nodes
Security IssuesPrivileged, root, host accesscheck_security_issues
Upgrade StatusCluster version and upgrade stateget_upgrade_status

GPU Cards

GPU Inventory

Shows all GPU nodes across clusters with:

  • Node name and cluster
  • GPU type (e.g., NVIDIA A100)
  • Total GPU count
  • Allocated vs available

GPU Status

Real-time GPU utilization:

  • Allocation percentage per cluster
  • Memory usage
  • Temperature (if available)

GPU Overview

Summary card showing:

  • Total GPUs across all clusters
  • Overall utilization
  • Top consumers

Card Configuration

Each card can be configured with:

interface CardConfig {
  id: string;
  type: string;
  title: string;
  position: { x: number; y: number };
  size: { width: number; height: number };
  config: {
    clusters?: string[];      // Filter to specific clusters
    namespaces?: string[];    // Filter to specific namespaces
    refreshInterval?: number; // Update frequency in seconds
    warningsOnly?: boolean;   // For event stream
  };
}

AI Recommendations

In High AI mode, the console analyzes your cluster state and suggests relevant cards:

  • Pod Issues - Suggested when >5 pods have issues
  • GPU Status - Suggested when GPU utilization >90%
  • Event Stream - Suggested when >10 warning events
  • Cluster Health - Suggested when clusters are unhealthy

You can:

  • Accept - Add the recommended card
  • Snooze - Hide suggestion for 1 hour
  • Dismiss - Don’t suggest this card again