RATES

One rate sheet. Set by Hoonify.

Hoonify sets a single rate per SKU — operators don't price-compete on the marketplace. Rates are benchmarked against Vast.AI public medians and the major hyperscalers' on-demand list prices.

SourceVast.AI public bundles snapshot · 2026-04-27vast.ai Hyperscaler list prices via published rate sheets

GPU compute

Per-GPU/hr on-demand. Bundles of 1× and 8× GPU available where capacity allows.

GPUHoonify rateVast.AI medianHyperscaler listvs hyperscalerCapacity
H100 80GB SXMDatacenter
NVIDIA · 80 GB · 989 TFLOPS
$1.39/ GPU / HR$1.55
$12.29
AWS p5.48xlarge
~89%724GPUsLaunch
H100 80GB PCIe
NVIDIA · 80 GB · 756 TFLOPS
$1.55/ GPU / HR$1.74
$12.29
AWS p5.48xlarge
~87%669GPUsLaunch
H200 141GBDatacenter
NVIDIA · 141 GB · 989 TFLOPS
$2.09/ GPU / HR$2.32
$14.50
AWS p5e.48xlarge
~86%647GPUsLaunch
A100 80GB SXM
NVIDIA · 80 GB · 312 TFLOPS
$0.66/ GPU / HR$0.73
$4.60
AWS p4d.24xlarge
~86%550GPUsLaunch
A100 40GB
NVIDIA · 40 GB · 250 TFLOPS
$0.55/ GPU / HR$0.73
$3.90
AWS p4d.24xlarge
~86%196GPUsLaunch
L40S 48GB
NVIDIA · 48 GB · 362 TFLOPS
$0.50/ GPU / HR$0.56
$2.95
AWS g6e.12xlarge
~83%200GPUsLaunch
RTX 4090 24GBCommodity
NVIDIA · 24 GB · 165 TFLOPS
$0.29/ GPU / HR$0.32n/a67GPUsLaunch
MI300X 192GBDatacenter
AMD · 192 GB · 1,307 TFLOPS
$1.85/ GPU / HRn/a
$7.99
Azure ND MI300X
~77%520GPUsLaunch
B200 180GBReserved only
NVIDIA · 180 GB · 2,250 TFLOPS
$3.59/ GPU / HR$4.00
$16.50
AWS p6e (preview)
~78%0GPUsLaunch

Inference endpoints

Per-million-token. Hoonify pools quantized variants — operators pick the precision that hits the Hoonify-set per-token rate.

Model$ / 1M IN$ / 1M OUTQuantReplicas
Llama 3.1 8B8B
128K ctx
$0.06$0.12
FP8
1Launch
Llama 3.3 70B70B
128K ctx
$0.18$0.54
FP8FP16
3Launch
Llama 4 ScoutMoE 109B
1M ctx
$0.22$0.66
FP16
1Launch
Qwen 2.5 72B72B
128K ctx
$0.20$0.58
FP8FP16
2Launch
Qwen 3 32B32B
128K ctx
$0.12$0.34
INT4FP8
2Launch
DeepSeek V3MoE 671B
128K ctx
$0.45$1.10
FP8
5Launch
Mistral Large 2123B
128K ctx
$0.30$0.90
FP8
1Launch
Mistral Mixtral 8x22BMoE 8x22B
64K ctx
$0.25$0.80
FP16
2Launch
Gemma 2 27B27B
32K ctx
$0.10$0.30
FP8FP16
8Launch
Phi 414B
16K ctx
$0.07$0.18
FP16INT4FP8
3Launch

MARKET POSITION

Cheaper than the marketplace floor. Cheaper than the hyperscaler list.

Real capacity, real prices, real tokens.

Rates refresh weekly against Vast.AI public bundles and the AWS / Azure / GCP on-demand sheets. We pass the savings to you — you don't haggle with operators.

  • Snapshot2026-04-27
  • SKUs priced10 GPUs · 10 models
  • Rate stability14 day notice