AI Platform

Pricing

Simple, predictable pricing for every stage of your AI journey.

GPU Instance Pricing

GPU Type VRAM Price/hr Regions Action
RTX 4090 – 24GB 24 GB $0.36 US-NC-RDU Deploy
RTX 4080 16 GB $0.29 US-NC-RDU Deploy
RTX 4070 Ti 12 GB $0.2 Deploy
A100 SXM 80 GB $1.49 EU-RO-1, CA-MTL-1, EU-SE-1, US-IL-1, EUR-IS-1, EU-CZ-1, US-TX-3, EUR-IS-2, US-KS-2, US-GA-2, US-WA-1, US-TX-1, CA-MTL-3, EU-NL-1, US-TX-4, US-CA-2, US-NC-1, OC-AU-1, US-DE-1, EUR-IS-3, CA-MTL-2, AP-JP-1, EUR-NO-1, EU-FR-1, US-KS-3, US-GA-1 Deploy
RTX 3090 24 GB $0.24 Deploy
A30 24 GB $0.24 Deploy
RTX 5090 32 GB $0.74 EU-RO-1, CA-MTL-1, EU-SE-1, US-IL-1, EUR-IS-1, EU-CZ-1, US-TX-3, EUR-IS-2, US-KS-2, US-GA-2, US-WA-1, US-TX-1, CA-MTL-3, EU-NL-1, US-TX-4, US-CA-2, US-NC-1, OC-AU-1, US-DE-1, EUR-IS-3, CA-MTL-2, AP-JP-1, EUR-NO-1, EU-FR-1, US-KS-3, US-GA-1 Deploy
H100 SXM 80 GB $2.88 Deploy
RTX A6000 48 GB $0.35 Deploy
H100 NVL 94 GB $2.77 EU-RO-1, CA-MTL-1, EU-SE-1, US-IL-1, EUR-IS-1, EU-CZ-1, US-TX-3, EUR-IS-2, US-KS-2, US-GA-2, US-WA-1, US-TX-1, CA-MTL-3, EU-NL-1, US-TX-4, US-CA-2, US-NC-1, OC-AU-1, US-DE-1, EUR-IS-3, CA-MTL-2, AP-JP-1, EUR-NO-1, EU-FR-1, US-KS-3, US-GA-1 Deploy
RTX 3080 Ti 12 GB $0.19 US-NC-RDU Deploy
RTX 4080 SUPER 16 GB $0.3 Deploy
RTX 5080 16 GB $0.42 Deploy
A100 PCIe 80 GB $1.27 EU-RO-1, CA-MTL-1, EU-SE-1, US-IL-1, EUR-IS-1, EU-CZ-1, US-TX-3, EUR-IS-2, US-KS-2, US-GA-2, US-WA-1, US-TX-1, CA-MTL-3, EU-NL-1, US-TX-4, US-CA-2, US-NC-1, OC-AU-1, US-DE-1, EUR-IS-3, CA-MTL-2, AP-JP-1, EUR-NO-1, EU-FR-1, US-KS-3, US-GA-1 Deploy
RTX PRO 6000 MaxQ 96 GB $1.75 US-NC-RDU Deploy
unknown 0 GB $0.0 Deploy
RTX PRO 6000 WK 96 GB $1.81 Deploy
V100 SXM2 32GB 32 GB $0.35 Deploy
A40 48 GB $0.37 Deploy
V100 SXM2 16 GB $0.25 Deploy
RTX 4090 24 GB $0.36 US-NC-RDU Deploy
MI300X 192 GB $0.54 Deploy
Tesla V100 16 GB $0.2 Deploy
B200 180 GB $6.4 Deploy
B300 288 GB $6.62 Deploy
RTX 3070 8 GB $0.14 Deploy
H100 PCIe 80 GB $2.13 Deploy
NVIDIA H200 NVL 143 GB $0.54 Deploy
RTX 2000 Ada 16 GB $0.54 Deploy
RTX 4000 Ada 20 GB $0.21 Deploy
RTX 5000 Ada 32 GB $0.52 Deploy
H200 SXM 141 GB $3.84 Deploy
L40S 48 GB $0.85 Deploy
RTX 3090 Ti 24 GB $0.29 Deploy
RTX 4000 Ada SFF 20 GB $0.19 Deploy
RTX 3080 10 GB $0.18 US-NC-RDU Deploy
RTX A2000 6 GB $0.13 Deploy
RTX A4500 20 GB $0.2 Deploy
L4 24 GB $0.47 Deploy
RTX 6000 Ada 48 GB $0.79 US-NC-RDU Deploy
RTX A4000 16 GB $0.18 Deploy
RTX A5000 24 GB $0.17 Deploy
RTX PRO 6000 96 GB $1.82 Deploy
L40 48 GB $0.74 Deploy

Endpoint Pricing per Modality

Language Models

$0.20 / 1M tokens

“Summarizing a 5,000 word document costs about $0.001 under typical settings.”

Image Generation

$0.01 / image

“Generating 100 high-quality product photos costs exactly $1.00.”

Audio (Whisper)

$0.006 / minute

“Transcribing a 1-hour podcast costs about $0.36.”

Estimate your spend

Use our calculator to see how much you'll save compared to legacy providers.

Estimated Monthly Spend

$300.00