New$5 in starter credits when you create an account

AI INFERENCE PLATFORM // OPEN-SOURCE FIRST

Run open-source inference on real GPUs, on real terms.

Llama, Qwen, DeepSeek, Mistral — pooled across every NeoCloud running Hoonify, with an OpenAI-compatible API. Try a model in the workbench before you write a line of code.

Real capacity, real prices, real tokens.

GPUs in pool

47,275

Pools

3

H100/GPU/hr

$1.39

Aggregate throughput

1.1M tok/s

POOLS

Three pools. One control plane.

You pick the pool. Hoonify picks the rack.

Browse pools

FOR OPERATORS

Have the racks. Need the renters.

Hoonify is the full software stack for NeoCloud operators — TurbOS bare-metal provisioning, Kubernetes, inference scheduler, billing, and the Howl customer portal. The aggregator is the demand side. List your capacity once and it's in front of every customer in the network.

  • Provisioning
    TurbOS images bare metal in <90s.
  • Scheduling
    Inference scheduler & GPU sharing built-in.
  • Billing
    Per-second metered. Credits, invoices, tax.