YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Audio Flamingo 3 NVIDIA-Stack Endpoint Template
This template uses the same core runtime pattern as NVIDIA's Space:
llavacode fromnvidia/audio-flamingo-3(space repo)- base checkpoint from
nvidia/audio-flamingo-3(model repo) - optional
stage35think/long adapter
Request contract
{
"inputs": {
"prompt": "Please describe the audio in detail.",
"audio_base64": "<base64 WAV bytes>",
"think_mode": true,
"max_new_tokens": 2048,
"temperature": 0.2
}
}
Response contract
{
"generated_text": "...",
"mode": "think"
}
Bootstrap command
python scripts/hf_clone.py af3-nvidia-endpoint --repo-id YOUR_USERNAME/YOUR_AF3_NVIDIA_ENDPOINT_REPO
Endpoint settings
- Task:
custom - GPU instance required
- Secrets:
HF_TOKEN=<your_token>
Optional env vars
AF3_NV_CODE_REPO_ID=nvidia/audio-flamingo-3AF3_NV_MODEL_REPO_ID=nvidia/audio-flamingo-3AF3_NV_CODE_REPO_TYPE=spaceAF3_NV_MODEL_REPO_TYPE=modelAF3_NV_DEFAULT_MODE=thinkAF3_NV_LOAD_THINK=1AF3_NV_LOAD_SINGLE=0
Default behavior loads think/long mode for higher-quality long-form reasoning.
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support