Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
uavleeva
/
grpo_sql_run_002
like
0
Transformers
TensorBoard
Safetensors
English
text-generation-inference
unsloth
qwen2
trl
License:
apache-2.0
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
main
grpo_sql_run_002
662 MB
1 contributor
History:
6 commits
uavleeva
Update README.md
019dce3
verified
19 days ago
.gitattributes
1.57 kB
Upload model trained with Unsloth
24 days ago
README.md
574 Bytes
Update README.md
19 days ago
adapter_config.json
1.21 kB
Upload model trained with Unsloth
24 days ago
adapter_model.safetensors
646 MB
xet
Upload model trained with Unsloth
24 days ago
added_tokens.json
632 Bytes
Upload model trained with Unsloth
24 days ago
chat_template.jinja
2.51 kB
Upload model trained with Unsloth
24 days ago
events.out.tfevents.1770135027.4b475d52079d.1773.0
208 kB
xet
tfevents
24 days ago
events.out.tfevents.1770135031.4b475d52079d.1773.1
251 kB
xet
tfevents
24 days ago
merges.txt
1.67 MB
Upload model trained with Unsloth
24 days ago
special_tokens_map.json
613 Bytes
Upload model trained with Unsloth
24 days ago
tokenizer.json
11.4 MB
xet
Upload model trained with Unsloth
24 days ago
tokenizer_config.json
4.89 kB
Upload model trained with Unsloth
24 days ago
vocab.json
2.78 MB
Upload model trained with Unsloth
24 days ago