bayartsogt
/

structbert-large

Model card Files Files and versions

bayartsogt commited on Jul 26, 2021

Commit

2cc4087

·

1 Parent(s): ec62a6f

Update README.md

Files changed (1) hide show

README.md +14 -6

README.md CHANGED Viewed

@@ -1,16 +1,23 @@
-# StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding
 Official Repository Link: https://github.com/alibaba/AliceMind/tree/main/StructBERT
 ## Reproduce HFHub models:
-```python
-!cp en_model pytorch_model.bin
-!cp large_bert_config.json config.json
 from transformers import AutoConfig, AutoModelForMaskedLM, AutoTokenizer
-config = AutoConfig.from_pretrained("./large_bert_config.json")
-model = AutoModelForMaskedLM.from_pretrained("./", config=config)
 tokenizer = AutoTokenizer.from_pretrained(".", config=config)
 model.push_to_hub("structbert-large")
@@ -19,6 +26,7 @@ tokenizer.push_to_hub("structbert-large")
 [https://arxiv.org/abs/1908.04577](https://arxiv.org/abs/1908.04577)
 ## Introduction
 We extend BERT to a new model, StructBERT, by incorporating language structures into pre-training.
 Specifically, we pre-train StructBERT with two auxiliary tasks to make the most of the sequential

+# StructBERT: Un-Official Copy
 Official Repository Link: https://github.com/alibaba/AliceMind/tree/main/StructBERT
+**Claimer**
+* This model card is not produced by [AliceMind Team](https://github.com/alibaba/AliceMind/)
 ## Reproduce HFHub models:
+Download model/tokenizer vocab
+```bash
+wget https://raw.githubusercontent.com/alibaba/AliceMind/main/StructBERT/config/large_bert_config.json && mv large_bert_config.json config.json
+wget https://raw.githubusercontent.com/alibaba/AliceMind/main/StructBERT/config/vocab.txt
+wget https://alice-open.oss-cn-zhangjiakou.aliyuncs.com/StructBERT/en_model && mv en_model pytorch_model.bin
+```
+```python
 from transformers import AutoConfig, AutoModelForMaskedLM, AutoTokenizer
+config = AutoConfig.from_pretrained("./config.json")
+model = AutoModelForMaskedLM.from_pretrained(".", config=config)
 tokenizer = AutoTokenizer.from_pretrained(".", config=config)
 model.push_to_hub("structbert-large")
 [https://arxiv.org/abs/1908.04577](https://arxiv.org/abs/1908.04577)
+# StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding
 ## Introduction
 We extend BERT to a new model, StructBERT, by incorporating language structures into pre-training.
 Specifically, we pre-train StructBERT with two auxiliary tasks to make the most of the sequential