New1M tokens in starter credits when you create an account

AI INFERENCE PLATFORM // OPEN-SOURCE FIRST

Run open-source inference on real GPUs, on real terms.

DeepSeek, Qwen, Gemma, Kimi — open weights pooled across every NeoCloud running Hoonify, with an OpenAI-compatible API.

WHAT ARE YOU BUILDING?

Start with the model that fits the job.

A starting point for the most common workloads — swap to any other model from the same base URL.

Explore use cases

HOW IT WORKS

From zero to inference in three steps.

1

Pick a model

Browse the catalog or try one live in the workbench. No commitment, no credit card.

2

Point your SDK at us

Swap in the OpenAI-compatible base URL and an API key. Your existing code keeps working.

3

Pay per token

Per-million-token pricing with no surge. Every new account starts on 1M free tokens.

FOR OPERATORS

Have the GPUs? We bring the demand.

List your capacity once and it's in front of every customer on the network. We route the inference demand, handle metering and billing, and keep your GPUs earning — you keep running the hardware.

  • Demand
    Qualified inference traffic, routed to your GPUs.
  • Utilization
    Fill idle hours across the pool, not just your own customers.
  • Settlement
    Metered billing, invoices, and payouts handled for you.