data4elm-SLaM-submission / dataset_info.json
lwhalen7's picture
Migrate LoRA adapter from dataset repository
f94d935 verified
{
"description": "High-quality text dataset for edge language model training",
"size": "8.1 GB",
"instances": "2,604,072",
"estimated_tokens": "~0.8B",
"format": "JSONL format"
}