Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

hannahbillo
/
dpo-llama3-8b-sample-rules

PEFT
TensorBoard
Safetensors
llama
trl
dpo
Generated from Trainer
Model card Files Files and versions
xet
Metrics Training metrics Community
dpo-llama3-8b-sample-rules / runs
75.3 kB
  • 1 contributor
History: 9 commits
hannahbillo's picture
hannahbillo
Training in progress, step 112
b0f0513 verified over 1 year ago
  • Aug18_16-50-14_bf1f810b9c5e
    Training in progress, step 50 over 1 year ago
  • Aug18_16-52-20_bf1f810b9c5e
    Training in progress, step 50 over 1 year ago
  • Aug18_16-53-35_bf1f810b9c5e
    Training in progress, step 50 over 1 year ago
  • Aug18_16-58-26_bf1f810b9c5e
    Training in progress, step 50 over 1 year ago
  • Aug18_17-01-07_bf1f810b9c5e
    Training in progress, step 50 over 1 year ago
  • Aug18_17-03-18_bf1f810b9c5e
    Training in progress, step 112 over 1 year ago
  • Aug18_20-08-45_372fd28787fa
    Training in progress, step 112 over 1 year ago
  • Aug19_23-37-14_ce490d00d11c
    Training in progress, step 112 over 1 year ago