Skip to content

Analytics Dashboard

🧠 Philosophy

Understanding how your deployed workflows perform at runtime is essential for optimizing compute usage, debugging bottlenecks, and improving reliability.

That’s why the INTELLITHING Analytics Dashboard provides live and historical system metrics β€” powered by Kubernetes β€” directly tied to your deployed workflows. No need to integrate third-party observability tools or navigate opaque infrastructure layers.

Whether you're investigating a slowdown, verifying resource limits, or just ensuring stability, the dashboard gives you real-time visibility into container behavior.

πŸ”‘ Key Concepts

Concept Description
Pod Metrics Resource-level statistics from the running container
Date Range Filter Choose a custom time window for data exploration
Live Graphs View real-time usage patterns as your app is running
Resource Categories Metrics are grouped by CPU, memory, GPU, and container-level info
Alignments & Aggregation All metrics are normalized to 1-minute resolution for clarity

πŸ“˜ Key Definitions

Metric Meaning
CPU Usage Time Actual CPU time consumed by the container (in cores)
CPU Request Utilization Percent of allocated CPU request being used
CPU Limit Utilization Percent of CPU limit being used
Memory Used Memory consumption in GB
Memory Limit Utilization Percent of memory limit currently used
Container Restart Count Number of times the container restarted during the time window
GPU Duty Cycle Percent of time the GPU was actively in use (if GPU enabled)
GPU Memory Used Amount of GPU memory consumed
Total GPU Memory Allocated GPU memory (useful for capacity planning)

🧩 Dashboard at a Glance

One-line Summary

Analytics = [ Pod Performance Metrics ] + [ Date Picker ] + [ Live Graphs ]

Example Usage

  1. You deploy a workflow to production.
  2. Head to the Analytics Dashboard.
  3. Choose a date/time range.
  4. Click "Refresh Data".
  5. Review graphs for CPU, memory, and restarts.
  6. Spot performance trends or stability issues quickly.

βš™οΈ How Analytics Fits into INTELLITHING

The Analytics tab provides Kubernetes-native observability, integrated directly into your project UI.

  • Every deployment is backed by containerized infrastructure.
  • These containers emit real-time metrics β€” CPU usage, memory load, restarts, and GPU activity (if used).
  • INTELLITHING collects and displays these metrics using a clean, user-friendly interface.
  • You get instant feedback on how well your runtime is performing β€” without touching kubectl or Prometheus.

🧭 Using the Analytics View

πŸ“… Date Filter

Choose the time window you want to inspect.

  • From: Start date/time
  • To: End date/time
  • Then click "Refresh Data" to reload graphs.

This allows you to explore past performance for debugging or trend analysis.

πŸ“ˆ Interpreting Graphs

Each graph is labeled and auto-updated. Some helpful tips:

Graph What to Look For
CPU Usage Time Spikes or flatlines β€” may indicate heavy load or idling
CPU Request/Limit Utilization Percent-based comparison vs. assigned resources
Memory Used Track memory growth or memory leaks
Memory Limit Utilization Ensure you’re not near the max (100%)
Container Restart Count Anything > 0 suggests instability or failure
GPU Metrics Only visible if GPU is provisioned

Values like "0.78" on memory used = 0.78 GB used at that time point.

🚨 Common Scenarios

Situation What to Check
App is slow Look at CPU Usage Time, Memory Used
App crashes or restarts Check Container Restart Count, Memory Limit Utilization
Model seems underutilized Review GPU Duty Cycle or CPU Request Utilization
Memory leak suspected Look for steady upward trend in Memory Used
Too much allocated but unused Low CPU/Memory utilization β€” consider scaling down

πŸ” Permissions

Feature Required Role
Access to Analytics All users with access to the project
Build/Deploy to see metrics Requires actual deployment to exist

No metrics will be shown unless a valid deployment exists for the selected date range.

πŸ’‘ Best Practices

Tip Why It Helps
Always check after deploy Verify that resource limits and usage align
Investigate restarts immediately Often caused by crashes, memory limits, or bad configs
Monitor during load tests Helps determine real-world CPU/memory usage
Use date range to compare trends Spot regressions across deployments
Keep limits generous but safe Avoid crashing under load or wasting idle capacity

πŸ’¬ Summary

  • The Analytics Dashboard gives you deep insight into real-time container performance.
  • Metrics are broken down by CPU, Memory, GPU, and Container Health.
  • You can choose custom date ranges and explore visual trends interactively.
  • This view helps debug, optimize, and confidently run production-grade workflows β€” all from inside INTELLITHING.