Nemotron-Terminal
Collection
We are releasing Nemotron-Terminal models and training datasets. • 7 items • Updated
• 11
Nemotron-Terminal is a family of models specialized for autonomous terminal interaction, fine-tuned from the Qwen3 (8B, 14B, and 32B). Developed by NVIDIA, these models utilize Nemotron-Terminal-Corpus, a large-scale open-source dataset for terminal tasks, to achieve performance that rivals frontier models many times their size.
We release the following variants of the Nemotron-Terminal family:
The Nemotron-Terminal family demonstrates profound leaps in capability compared to the Qwen3 baselines across multiple specialized categories.
| Model | Size | Base Accuracy | Nemotron-Terminal Accuracy |
|---|---|---|---|
| Nemotron-Terminal-8B | 8B | 2.47% | 13.0% |
| Nemotron-Terminal-14B | 14B | 4.04% | 20.2% |
| Nemotron-Terminal-32B | 32B | 3.37% | 27.4% |
The models are trained using the Terminus 2 scaffolding and output a structured JSON format. For evaluation on Terminal Bench 2.0, we encourage using Terminus 2 scaffolding to maintain consistency with training.
{
"analysis": "Analysis of the current terminal state...",
"plan": "Step-by-step plan for the next command...",
"commands": [
{
"keystrokes": "ls -la\n",
"duration": 0.1
}
],
"task_complete": false
}