Developer Documentation

Everything you need to integrate AI Metal Cluster into your applications.

Quick Start

Installation
# Using curl
curl -X POST https://your-cluster.aimetalcluster.com/api/v1/query \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Hello, world!", "model": "llama3.2"}'

Fast Integration

RESTful API compatible with OpenAI's format. Migrate existing code in minutes.

Secure by Default

TLS encryption, API key authentication, and IP whitelisting included.

On-Premise

Your data never leaves your infrastructure. Full sovereignty.

Core API Endpoints

POST /api/v1/query

Send prompts to LLM models running on your cluster.

{
  "prompt": "Explain quantum computing",
  "model": "llama3.2",
  "max_tokens": 1000,
  "temperature": 0.7
}
POST /api/v1/transcribe

Transcribe audio files using MLX Whisper (large-v3).

curl -X POST https://your-cluster/api/v1/transcribe \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F "[email protected]" \
  -F "language=en"
POST /api/v1/generate-image

Generate images using Flux.1 on Apple Silicon.

{
  "prompt": "A futuristic city at sunset",
  "width": 1024,
  "height": 1024,
  "steps": 20
}
GET /health

Check cluster health and available resources.

{
  "status": "healthy",
  "gpu_memory_available": "245GB",
  "models_loaded": ["llama3.2", "whisper-large-v3"],
  "uptime": "14d 3h 22m"
}

Official SDKs

🐍

Python

pip install aimetalcluster
📦

Node.js

npm i @aimetalcluster/sdk
🦀

Rust

cargo add aimetalcluster
🐹

Go

go get aimetalcluster.com/sdk

Need Help?

Our engineering team is here to support your integration.