AI Platform

Predictable AI Compute Costs

Simple, predictable pricing for every stage of your AI journey.

GPU Instance Pricing

GPU Type VRAM Price/hr Action
A100 SXM 40GB 40 GB $1.07 Deploy
A40 48 GB $0.37 Deploy
RTX A4000 16 GB $0.18 Deploy
A100 SXM 80 GB $1.49 Deploy
RTX 3070 8 GB $0.14 Deploy
RTX 6000 Ada 48 GB $0.79 Deploy
B200 180 GB $6.4 Deploy
RTX A2000 6 GB $0.13 Deploy
B300 288 GB $7.43 Deploy
RTX PRO 4500 32 GB $0.36 Deploy
RTX PRO 6000 96 GB $1.81 Deploy
RTX 3090 24 GB $0.24 Deploy
H100 PCIe 80 GB $2.13 Deploy
RTX 4080 16 GB $0.29 Deploy
RTX 5080 16 GB $0.42 Deploy
H100 SXM 80 GB $2.88 Deploy
L4 24 GB $0.47 Deploy
RTX 4000 Ada 20 GB $0.21 Deploy
RTX 4000 Ada SFF 20 GB $0.19 Deploy
RTX A5000 24 GB $0.17 Deploy
RTX A6000 48 GB $0.35 Deploy
RTX 3080 Ti 12 GB $0.19 Deploy
RTX A4500 20 GB $0.2 Deploy
RTX 4070 Ti 12 GB $0.2 Deploy
Tesla V100 16 GB $0.2 Deploy
V100 SXM2 16 GB $0.25 Deploy
RTX PRO 6000 WK 96 GB $1.81 Deploy
unknown 0 GB $0.0 Deploy
L40 48 GB $0.74 Deploy
RTX 3080 10 GB $0.18 Deploy
RTX 5090 32 GB $0.74 Deploy
RTX PRO 6000 MaxQ 96 GB $1.75 Deploy
A100 PCIe 80 GB $1.27 Deploy
H200 SXM 141 GB $3.84 Deploy
H100 NVL 94 GB $2.77 Deploy
NVIDIA H200 NVL 143 GB $0.54 Deploy
L40S 48 GB $0.85 Deploy
RTX 5000 Ada 32 GB $0.52 Deploy
RTX 2000 Ada 16 GB $0.54 Deploy
MI300X 192 GB $0.54 Deploy
RTX 4090 24 GB $0.36 Deploy
RTX 3090 Ti 24 GB $0.29 Deploy
RTX 4080 SUPER 16 GB $0.3 Deploy
RTX 4090 – 24GB 24 GB $0.36 Deploy

Endpoint Pricing per Modality

Language Models

$0.20 / 1M tokens

“Summarizing a 5,000 word document costs about $0.001 under typical settings.”

Image Generation

$0.01 / image

“Generating 100 high-quality product photos costs exactly $1.00.”

Audio (Whisper)

$0.006 / minute

“Transcribing a 1-hour podcast costs about $0.36.”

DocVault AI Service Plans

DocVault AI Free

$0

per month

  • 2 Docs / Day
  • Basic Q&A
Select Plan

Estimate your spend

Use our calculator to see how much you'll save compared to legacy providers.

Estimated Monthly Spend

$300.00