Launch HN: IonRouter (YC W26) – High-throughput, low-cost inference

via ionrouter.io

Short excerpt below. Read at the original source.

Hey HN — I’m Veer and my cofounder is Suryaa. We’re building Cumulus Labs (YC W26), and we’re releasing our latest product IonRouter (https://ionrouter.io/), an inference API for open-source and fine tuned models. You swap in our base URL, keep your existing OpenAI client code, and get access to any model (open source or finetuned […]

Read at Source