Developer Documentation
Everything you need to integrate AI Metal Cluster into your applications.
Quick Start
Installation
# Using curl
curl -X POST https://your-cluster.aimetalcluster.com/api/v1/query \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{"prompt": "Hello, world!", "model": "llama3.2"}'
Fast Integration
RESTful API compatible with OpenAI's format. Migrate existing code in minutes.
Secure by Default
TLS encryption, API key authentication, and IP whitelisting included.
On-Premise
Your data never leaves your infrastructure. Full sovereignty.
Core API Endpoints
POST
/api/v1/query
Send prompts to LLM models running on your cluster.
{
"prompt": "Explain quantum computing",
"model": "llama3.2",
"max_tokens": 1000,
"temperature": 0.7
}
POST
/api/v1/transcribe
Transcribe audio files using MLX Whisper (large-v3).
curl -X POST https://your-cluster/api/v1/transcribe \
-H "Authorization: Bearer YOUR_API_KEY" \
-F "[email protected]" \
-F "language=en"
POST
/api/v1/generate-image
Generate images using Flux.1 on Apple Silicon.
{
"prompt": "A futuristic city at sunset",
"width": 1024,
"height": 1024,
"steps": 20
}
GET
/health
Check cluster health and available resources.
{
"status": "healthy",
"gpu_memory_available": "245GB",
"models_loaded": ["llama3.2", "whisper-large-v3"],
"uptime": "14d 3h 22m"
}
Official SDKs
🐍
Python
pip install aimetalcluster
📦
Node.js
npm i @aimetalcluster/sdk
🦀
Rust
cargo add aimetalcluster
🐹
Go
go get aimetalcluster.com/sdk