Skip to main content
Models is Ageneral’s OpenAI-compatible inference API at https://models.ageneral.ai. Request and response shapes align with common OpenAI client libraries for chat, responses, and listing models. Use Models to:
  • List models available to your credential with GET /v1/models.
  • Run chat workloads with POST /v1/chat/completions.
  • Run Responses-compatible workloads with POST /v1/responses.

How routing and billing work

Models forwards allowed requests to provider backends enabled for your project, records usage, and applies billing to your Ageneral account. Spend checks run when billing gates apply.

Endpoints

MethodPathPurpose
GET/v1/modelsList models for the caller
POST/v1/chat/completionsChat-style completions; supports streaming where configured
POST/v1/responsesResponses-compatible requests
Model identifiers use provider/model syntax, for example openai/gpt-4o-mini. The set returned by GET /v1/models depends on project configuration. See Catalog.

Authentication

  • Send Authorization: Bearer <token> on every request.
  • Use a virtual key (sk-bf-…) when your software calls Models.
  • Ageneral-hosted agents call Models with infrastructure credentials you do not create in the console.
A signed-in browser session on https://ageneral.ai does not authorize Models requests by itself. See Authentication for the full credential matrix.

Next steps

TopicGuide
First requestQuickstart
Listing modelsCatalog
Body fieldsRequest parameters
Credits and errorsUsage and billing
Interactive referenceModels API