Skip to main content

Documentation Index

Fetch the complete documentation index at: https://unkey.com/docs/llms.txt

Use this file to discover all available pages before exploring further.

Unkey Deploy is in public beta. To try it, open the product switcher in the top-left of the dashboard and select Deploy. During beta, deployed resources are free. We’re eager for feedback, so let us know what you think on Discord, X, or email support@unkey.com.
Each deployment’s overview page includes real-time charts for throughput and latency. Per-instance runtime metrics (CPU, memory, disk, and network) are available from the deployment’s network view. Navigate to a deployment from your project’s Deployments tab to view them.

Requests per second

The RPS chart shows the number of requests your deployment handles over time. Use it to spot traffic spikes, confirm scaling behavior, or correlate with incidents.

Latency

The latency chart displays response time distributions over time. You can switch between p50, p75, p90, p95, and p99 percentiles. Check the Requests tab to find individual slow requests and inspect their latency breakdown.

Runtime metrics

Open the Network tab on a deployment and click an instance node to open the details panel. The panel shows live charts for the selected instance, refreshing every few seconds:
  • CPU. Used vs. allocated CPU in millicores.
  • Memory. Used vs. allocated memory in bytes.
  • Disk. Used vs. allocated disk in bytes. Only shown when the instance has provisioned disk storage.
  • Network. Egress and ingress traffic over time.
Use the time window selector at the top of the panel to switch between the past 15 minutes, 1 hour, 6 hours, or 24 hours. When you select a deployment node instead of a single instance, the charts aggregate across all instances in the deployment.
Last modified on May 4, 2026