--- language: en license: mit --- # M-1023_longmult__0epoch_longmult3dig-rl ## Model Details - **Training Method**: VeRL Reinforcement Learning (RL) - **Stage Name**: rl - **Experiment**: 1023_longmult__0epoch_longmult3dig - **RL Framework**: VeRL (Versatile Reinforcement Learning) ## Training Configuration ## Experiment Tracking 🔗 **View complete experiment details**: https://huggingface.co/datasets/TAUR-dev/D-ExpTracker__1023_longmult__0epoch_longmult3dig__v1 ## Usage ```python from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("TAUR-dev/M-1023_longmult__0epoch_longmult3dig-rl") model = AutoModelForCausalLM.from_pretrained("TAUR-dev/M-1023_longmult__0epoch_longmult3dig-rl") ```