The AI compute intelligence API for agents.
A JSON + llms.txt interface to the Servers.Computer index. Built for autonomous agents, RAG pipelines, procurement bots, and inference routers that need real GPU pricing — not stale screenshots.
Endpoints
List all indexed GPUs with TFLOPS, VRAM, $/hour, and provider availability.
Full spec sheet for a single GPU (H100, H200, B200, MI300X, etc.).
All clouds & neoclouds with on-demand and reserved pricing tiers.
Throughput, latency, and $/TFLOP benchmarks normalized across providers.
Compare 2–5 GPUs or providers. Returns a structured diff for agent consumption.
Get a quote for N GPUs × hours across providers, sorted by total cost.
Agent-readable site map (LLM-citation format) with canonical URLs and summaries.
Examples
curl https://servers.computer/api/gpus/h100?sort=price_asc&limit=3
curl -X POST https://servers.computer/api/compare \
-H "Content-Type: application/json" \
-d '{"gpus":["h200","b200"],"workload":"inference"}'curl -X POST https://servers.computer/api/quote \
-d '{"gpu":"h100","count":64,"hours":720}'Public read endpoints are free and rate-limited to 60 req/min per IP. Higher tiers, write endpoints, and webhook subscriptions require an API key.