AI on demand: Usage: Difference between revisions
Jump to navigation
Jump to search
(Created page with "= Overview = This pages describes some useful usage commands. = Usage = We assume you've received your API key from us in our usual manner. == Usage - List all models == === Usage - List all models - Generic === <syntaxhighlight lang="bash"> # Set your personal key: STONEY_KEY=sk-... # List all generic models: curl -s https://llm.stoney-cloud.com/v1/models \ -H "Authorization: Bearer $STONEY_KEY" \ | jq . </syntaxhighlight> Example output: <syntaxhighlight lang='...") |
|||
| Line 22: | Line 22: | ||
"data": [ | "data": [ | ||
{ | { | ||
"object": "model", | |||
"id": "allenai/olmOCR-2-7B", | "id": "allenai/olmOCR-2-7B", | ||
" | "max_model_len": 8192 | ||
}, | }, | ||
{ | { | ||
"object": "model", | |||
"id": "MinerU2.5-2509-1.2B", | "id": "MinerU2.5-2509-1.2B", | ||
" | "max_model_len": 16384 | ||
}, | }, | ||
{ | { | ||
"object": "model", | |||
"id": "MiniMaxAI/MiniMax-M2.5", | "id": "MiniMaxAI/MiniMax-M2.5", | ||
" | "max_model_len": 196608 | ||
}, | }, | ||
{ | { | ||
"object": "model", | |||
"id": "NVIDIA/NVIDIA-Nemotron-3-Super-120B-A12B", | "id": "NVIDIA/NVIDIA-Nemotron-3-Super-120B-A12B", | ||
" | "max_model_len": 131072 | ||
}, | }, | ||
{ | { | ||
"object": "model", | |||
"id": "Qwen/Qwen3-Coder-Next", | "id": "Qwen/Qwen3-Coder-Next", | ||
" | "max_model_len": 262144 | ||
}, | }, | ||
{ | { | ||
"object": "model", | |||
"id": "swiss-ai/Apertus-70B-Instruct-2509", | "id": "swiss-ai/Apertus-70B-Instruct-2509", | ||
" | "max_model_len": 65536 | ||
} | } | ||
], | ], | ||
"object": "list" | "object": "list" | ||
} | |||
</syntaxhighlight> | </syntaxhighlight> | ||
Latest revision as of 17:17, 14 April 2026
Overview
This pages describes some useful usage commands.
Usage
We assume you've received your API key from us in our usual manner.
Usage - List all models
Usage - List all models - Generic
# Set your personal key:
STONEY_KEY=sk-...
# List all generic models:
curl -s https://llm.stoney-cloud.com/v1/models \
-H "Authorization: Bearer $STONEY_KEY" \
| jq .
Example output:
{
"data": [
{
"object": "model",
"id": "allenai/olmOCR-2-7B",
"max_model_len": 8192
},
{
"object": "model",
"id": "MinerU2.5-2509-1.2B",
"max_model_len": 16384
},
{
"object": "model",
"id": "MiniMaxAI/MiniMax-M2.5",
"max_model_len": 196608
},
{
"object": "model",
"id": "NVIDIA/NVIDIA-Nemotron-3-Super-120B-A12B",
"max_model_len": 131072
},
{
"object": "model",
"id": "Qwen/Qwen3-Coder-Next",
"max_model_len": 262144
},
{
"object": "model",
"id": "swiss-ai/Apertus-70B-Instruct-2509",
"max_model_len": 65536
}
],
"object": "list"
}
Usage - List all models - Audio
# Set your personal key:
STONEY_KEY=sk-...
# List all audio models:
curl -s https://llm.stoney-cloud.com/v1/audio/models \
-H "Authorization: Bearer $STONEY_KEY" \
| jq .
Example output:
{
"data": [
{
"id": "mistralai/Voxtral-Mini-3B-2507",
"object": "model"
}
],
"object": "list"
}
Usage - Inspecting your usage
We issue one key per model for now. The usage is thus per-model-per-key.
# Set your personal key:
STONEY_KEY=sk-...
# Set desired month in the form YYYY-MM, for example: '2026-03'.
MONTH=$(date +%Y-%m)
curl -s https://llm.stoney-cloud.com/v1/usage?month="$MONTH" \
-H "Authorization: Bearer $STONEY_KEY" \
| jq .
Example output:
{
"models": [
{
"cost_chf": 0.000002,
"model": "MinerU2.5-2509-1.2B",
"output": 16,
"input": 70
},
{
"cost_chf": 0.054537,
"model": "MiniMaxAI/MiniMax-M2.5",
"output": 3603,
"input": 10097
},
{
"cost_chf": 0.0016,
"model": "mistralai/Voxtral-Mini-3B-2507",
"seconds": 8
},
{
"cost_chf": 0.003774,
"model": "swiss-ai/Apertus-70B-Instruct-2509",
"output": 551,
"input": 443
}
],
"month": "2026-03",
"total_cost_chf": 0.059913
}