LogoLogo
IntroInstalling Dashboard
  • 🌐Distribute
    • Introducing DistributeAI
    • FAQ
  • 🪙Distribute For $DIS
    • Public Presale
    • Tokenomics
  • 📱Distribute For Consumers
    • Distribute.ai Discord Bot
      • Adding the bot to a server
      • Adding the bot as a user app
      • Bot Commands
      • Generation Buttons
      • Prompting Guide
    • Distribute.ai Dashboard
      • Statistics
      • Referrals
      • Account Settings
  • 🖥️Distribute For Inference Providers
    • Chrome Extension
      • Installing Extension
        • Enabling Chrome Flags
        • Chrome GPU Setup Guide
      • Updating the Extension
    • Desktop App
      • Installing Desktop App
      • Desktop App Debugging
        • Windows Debugging
        • MacOS Debugging
        • Linux Debugging
      • Desktop System Requirements
  • 🏢Distribute For Enterprise
    • Enterprise Inference API
      • Authentication
      • OpenAI Compatible API
        • Images
          • Generations
        • Chat
          • Completions
      • Async API
        • Chat
          • Chat Create
          • Chat Result
        • Images
          • Images Create
          • Images Result
    • Enterprise Compute Providers
Powered by GitBook
On this page
  1. Distribute For Enterprise
  2. Enterprise Inference API
  3. OpenAI Compatible API
  4. Chat

Completions

PreviousChatNextAsync API
🏢
post
Body
modelstring · enumRequiredPossible values:
reasoning_effortstring · enumOptionalDefault: mediumPossible values:
metadataobjectOptional
frequency_penaltynumber · min: -2 · max: 2OptionalDefault: 0
max_completion_tokensnumberOptionalDefault: 128
nnumber · enumOptionalDefault: 1Possible values:
presence_penaltynumber · min: -2 · max: 2OptionalDefault: 0
seednumberOptionalDefault: 882534
service_tierstring · enumOptionalDefault: defaultPossible values:
stopstring[]Optional
streambooleanOptionalDefault: false
temperaturenumber · max: 2OptionalDefault: 1
top_pnumber · max: 1OptionalDefault: 1
tool_choiceany ofOptional
string · enumOptionalPossible values:
or
parallel_tool_callsbooleanOptionalDefault: true
Responses
200
Default Response
application/json
400
Default Response
application/json
404
Default Response
application/json
500
Default Response
application/json
post
POST /v1/chat/completions HTTP/1.1
Host: 
Content-Type: application/json
Accept: */*
Content-Length: 520

{
  "messages": [
    {
      "role": "system",
      "content": "text",
      "name": "text"
    }
  ],
  "model": "Llama-3.1 8B",
  "reasoning_effort": "medium",
  "metadata": {},
  "frequency_penalty": 0,
  "max_completion_tokens": 128,
  "n": 1,
  "modalities": [
    "text"
  ],
  "presence_penalty": 0,
  "seed": 882534,
  "service_tier": "default",
  "stop": [
    "text"
  ],
  "stream": false,
  "stream_options": {
    "include_usage": true
  },
  "temperature": 1,
  "top_p": 1,
  "tools": [
    {
      "type": "function",
      "function": {
        "description": "text",
        "name": "text",
        "parameters": {},
        "strict": false
      }
    }
  ],
  "tool_choice": "none",
  "parallel_tool_calls": true
}
{
  "id": "text",
  "choices": [
    {
      "finish_reason": "stop",
      "index": 1,
      "message": {
        "content": "text",
        "tool_calls": [
          {
            "id": "text",
            "type": "function",
            "function": {
              "name": "text",
              "arguments": "text"
            }
          }
        ],
        "role": "system",
        "name": "text"
      }
    }
  ],
  "created": 1,
  "model": "Llama-3.1 8B",
  "service_tier": "text",
  "system_fingerprint": "text",
  "object": "chat.completion",
  "usage": {
    "completion_tokens": 1,
    "prompt_tokens": 1,
    "total_tokens": 1
  }
}