yujiepan
/

opt-tiny-2layers-random

Text Generation

text-generation-inference

Model card Files Files and versions

yujiepan commited on Sep 2, 2023

Commit

3f9bed2

·

1 Parent(s): e2d08cc

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -10,7 +10,9 @@ library_name: transformers
 # yujiepan/opt-tiny-2layers-random
-This model is **randomly initialized**, using the config from [https://huggingface.co/facebook/opt-30b], but with following changes:
 ```python
 config.ffn_dim = 32
 config.hidden_size = 8
@@ -36,14 +38,13 @@ config.num_attention_heads = 2
 config.num_hidden_layers = 2
 config.word_embed_proj_dim = 8
-model = transformers.AutoModelForCausalLM.from_config(config, torch_dtype=torch.float16)
 model.save_pretrained(save_path)
 tokenizer = transformers.AutoTokenizer.from_pretrained('facebook/opt-30b')
 tokenizer.save_pretrained(save_path)
 ovmodel = OVModelForCausalLM.from_pretrained(save_path, export=True)
-ovmodel = ovmodel.half()
 ovmodel.save_pretrained(save_path)
 os.system(f'ls -alh {save_path}')

 # yujiepan/opt-tiny-2layers-random
+This model is **randomly initialized**, using the config from [https://huggingface.co/facebook/opt-30b] but the size is smaller.
+Note the model is in float32.
 ```python
 config.ffn_dim = 32
 config.hidden_size = 8
 config.num_hidden_layers = 2
 config.word_embed_proj_dim = 8
+model = transformers.AutoModelForCausalLM.from_config(config, torch_dtype=torch.float32)
 model.save_pretrained(save_path)
 tokenizer = transformers.AutoTokenizer.from_pretrained('facebook/opt-30b')
 tokenizer.save_pretrained(save_path)
 ovmodel = OVModelForCausalLM.from_pretrained(save_path, export=True)
 ovmodel.save_pretrained(save_path)
 os.system(f'ls -alh {save_path}')