VORN COMPUTE · v2.4.1 · LIVE

Run the world's fastest AI infra.

Zero cold starts. Sub-10ms routing.
Infinite scale. One API.

Start free See benchmarks →

LATENCY2.1ms

LATENCY1.8ms

LATENCY3.2ms

LATENCY0.9ms

LATENCY2.4ms

LATENCY1.1ms

LATENCY1.6ms

LATENCY2.8ms

LATENCY0.7ms

LATENCY3.0ms

LATENCY1.4ms

LATENCY2.2ms

LATENCY1.9ms

LATENCY0.8ms

LATENCY2.6ms

LATENCY1.3ms

LATENCY3.5ms

LATENCY1.7ms

LATENCY2.1ms

LATENCY1.8ms

LATENCY3.2ms

LATENCY0.9ms

LATENCY2.4ms

LATENCY1.1ms

LATENCY1.6ms

LATENCY2.8ms

LATENCY0.7ms

LATENCY3.0ms

LATENCY1.4ms

LATENCY2.2ms

LATENCY2.1ms

LATENCY1.8ms

LATENCY3.2ms

LATENCY0.9ms

LATENCY2.4ms

LATENCY1.1ms

LATENCY1.6ms

LATENCY2.8ms

LATENCY0.7ms

LATENCY3.0ms

LATENCY1.4ms

LATENCY2.2ms

LATENCY1.9ms

LATENCY0.8ms

LATENCY2.6ms

LATENCY1.3ms

LATENCY3.5ms

LATENCY1.7ms

LATENCY2.1ms

LATENCY1.8ms

LATENCY3.2ms

LATENCY0.9ms

LATENCY2.4ms

LATENCY1.1ms

LATENCY1.6ms

LATENCY2.8ms

LATENCY0.7ms

LATENCY3.0ms

LATENCY1.4ms

LATENCY2.2ms

GPU-01: 94%

GPU-02: 87%

GPU-03: 99%

GPU-04: 91%

GPU-05: 76%

GPU-06: 98%

GPU-07: 83%

GPU-08: 95%

GPU-09: 88%

GPU-10: 100%

GPU-11: 72%

GPU-12: 96%

GPU-13: 89%

GPU-14: 93%

GPU-15: 77%

GPU-16: 99%

GPU-17: 85%

GPU-18: 92%

GPU-19: 68%

GPU-20: 97%

GPU-01: 94%

GPU-02: 87%

GPU-03: 99%

GPU-04: 91%

GPU-05: 76%

GPU-06: 98%

GPU-07: 83%

GPU-08: 95%

GPU-09: 88%

GPU-10: 100%

GPU-11: 72%

GPU-12: 96%

GPU-13: 89%

GPU-14: 93%

GPU-15: 77%

GPU-16: 99%

GPU-17: 85%

GPU-18: 92%

GPU-19: 68%

GPU-20: 97%

12,400 tok/s

18,200 tok/s

9,800 tok/s

21,600 tok/s

15,100 tok/s

7,300 tok/s

24,800 tok/s

11,400 tok/s

19,700 tok/s

8,600 tok/s

16,300 tok/s

22,100 tok/s

13,900 tok/s

6,700 tok/s

20,400 tok/s

17,800 tok/s

10,200 tok/s

25,600 tok/s

14,700 tok/s

23,300 tok/s

12,400 tok/s

18,200 tok/s

9,800 tok/s

21,600 tok/s

15,100 tok/s

7,300 tok/s

24,800 tok/s

11,400 tok/s

19,700 tok/s

8,600 tok/s

16,300 tok/s

22,100 tok/s

13,900 tok/s

6,700 tok/s

20,400 tok/s

17,800 tok/s

10,200 tok/s

25,600 tok/s

14,700 tok/s

23,300 tok/s

Integration

One API. Any model.

            
vorn_quickstart.pyPYTHON 3.11+

          # Install: pip install vorn-sdk

          import vorn

          # Initialize the client — one line, zero config

          client = vorn.Client(

              api_key="vk_live_••••••••••••••••",

              region="auto",  # routes to nearest edge node

          )

          # Stream inference at sub-10ms TTFT

          response = client.inference.stream(

              model="vorn/ultra-70b",

              messages=[{"role": "user", "content": prompt}],

              max_tokens=4096,

              cold_start=False,  # guaranteed warm

          )

          for chunk in response:

              print(chunk.text, end="", flush=True)

Run the world's fastest AI infra.

Numbers that matter.

One API. Any model.

Ready to scale?