API overview

The platform exposes REST endpoints for inference, chat completions, the Agent, fine-tuning, and usage. Authentication is via API key (Bearer token).

Base URL

Use this base for inference and agent:

https://api.llmtune.io

The dashboard and account management are at:

https://llmtune.io

Authentication

All API requests require an API key in the header:

Authorization: Bearer sk_live_YOUR_KEY

See Authentication for details.

Response format

Success — JSON body with the requested data (e.g. completion text, usage, job status).
Error — JSON with error and optional message. See Errors & status codes.

Rate limits

Endpoints are rate-limited per API key. When exceeded, the API returns 429 Too Many Requests; the body may include retryAfter (seconds). Use exponential backoff and avoid tight retry loops.

Idempotency

For non-idempotent operations (e.g. training start), sending the same request twice can create duplicate work. Use idempotency keys if the API supports them (e.g. Idempotency-Key header) when retrying.

Compatibility

Inference and chat completions follow an OpenAI-compatible shape. You can use the OpenAI SDK or any HTTP client by setting the base URL to the platform and the authorization header to your API key.

Overview

Inference

Agent

Fine-tuning

Billing & errors

API overview

API overview

Base URL

Authentication

Response format

Rate limits

Idempotency

Compatibility

Overview

Inference

Agent

Fine-tuning

Billing & errors

​API overview

​Base URL

​Authentication

​Response format

​Rate limits

​Idempotency

​Compatibility

API overview

Base URL

Authentication

Response format

Rate limits

Idempotency

Compatibility