Benjamin-eecs
/

Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy

Feature Extraction

text-generation-inference

Model card Files Files and versions

Benjamin-eecs commited on Nov 24, 2024

Commit

abe5a38

·

verified ·

1 Parent(s): f0daef2

docs: update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -3,6 +3,8 @@ library_name: transformers
 license: llama3.1
 base_model:
 - meta-llama/Llama-3.1-8B-Instruct
 ---
 # Model Card for Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy
@@ -62,4 +64,4 @@ Training data consists of state-action pairs collected through NLRL actor-critic
 ```
 ## Model Card Contact
-benjaminliu.eecs@gmail.com

 license: llama3.1
 base_model:
 - meta-llama/Llama-3.1-8B-Instruct
+tags:
+- nlrl
 ---
 # Model Card for Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy
 ```
 ## Model Card Contact
+benjaminliu.eecs@gmail.com