API Documentation
Quick Start
InferNest is fully compatible with the OpenAI SDK. Change one line:
Python
from openai import OpenAI
client = OpenAI(
base_url="https://infernest.xyz/v1",
api_key="sk-your-key-here"
)
response = client.chat.completions.create(
model="deepseek-v4-flash",
messages=[{"role": "user", "content": "Hello!"}]
)Node.js
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://infernest.xyz/v1",
apiKey: "sk-your-key-here"
});
const response = await client.chat.completions.create({
model: "deepseek-v4-flash",
messages: [{ role: "user", content: "Hello!" }]
});curl
curl https://infernest.xyz/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer sk-your-key-here" \
-d '{"model":"deepseek-v4-flash","messages":[{"role":"user","content":"Hello!"}]}'Authentication
Include your API key in the Authorization header:
Authorization: Bearer sk-your-key-hereFind your API key in the Dashboard.
Endpoints
| Endpoint | Description |
|---|---|
| POST /v1/chat/completions | Chat completions |
| POST /v1/embeddings | Text embeddings |
| GET /v1/models | List available models |
Model IDs
deepseek-v4-flashqwen-3.6-27bglm-5.2doubao-pro-256kRate Limits
Default limits per API key:
- • 60 requests per minute
- • 100,000 tokens per minute
Need higher limits? Contact us after signing up.