Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FastFlowLM
/
GPT-OSS-20B-NPU2
like
1
Text Generation
Transformers
gpt_oss
conversational
mxfp4
arxiv:
2508.10925
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
5
Deploy
Use this model
main
GPT-OSS-20B-NPU2
14.5 GB
2 contributors
History:
11 commits
FastFlowLM
feat: update layer and version (
#3
)
8c4a40b
verified
2 months ago
.gitattributes
1.97 kB
feat: Faster prefill
3 months ago
README.md
7.12 kB
Create README.md
3 months ago
attn.xclbin
592 kB
xet
feat: upload all xclbins
3 months ago
config.json
1.95 kB
feat: update layer and version (#3)
2 months ago
dequant_mxfp4.xclbin
279 kB
xet
feat: Faster prefill
3 months ago
dequant_q4_1.xclbin
114 kB
xet
feat: Faster prefill
3 months ago
expert.xclbin
146 kB
xet
feat: upload all xclbins
3 months ago
layer.xclbin
453 kB
xet
feat: update layer and version (#3)
2 months ago
lm_head.xclbin
153 kB
xet
feat: upload verified xclbin
3 months ago
mm.xclbin
544 kB
xet
feat: Faster prefill
3 months ago
model.q4nx
14.5 GB
xet
feat: add weights
3 months ago
tokenizer.json
27.9 MB
xet
feat: upload verified xclbin
3 months ago
tokenizer_config.json
21.8 kB
feat: upload verified xclbin
3 months ago