metadata
license: apache-2.0
language:
- en
base_model:
- ibm-granite/granite-4.0-h-micro
tags:
- conversational
- instruct
- mamba
- hybrid
there was originally going to be a better logo but i couldnt get any image model working. so this is what you all deserve
Info
Lune Mamba 3B is a Claude-OSS series model based on Granite 4.0 H(ybrid) Micro.
Claude-OSS is a (non-affiliated with Anthropic!) attempt to replicate the style of Anthropic's Claude model on top of open source bases.
| Benchmarks | Granite 4.0 H Micro | Lune Mamba 3B | Lune Mamba 3B GRPO_IF |
|---|---|---|---|
| MMLU | 63.7860 | 64.2338 | 64.3443 |
| IFEval* | 80.2218 | 75.0462 | 77.4492 |
| * IFEval numbers calculated from prompt loose accuracy |
Artifacts
- SFT checkpoint: allura-forge/claumba-micro-sft
- KTO checkpoint: You are here!
- GRPO (on IFeval) checkpoint: allura-org/Lune-Mamba-3B-v1-GRPO_IF