Update README.md
Browse files
README.md
CHANGED
|
@@ -98,22 +98,23 @@ The evaluation is conducted using this [code](https://github.com/KaiYin97/DMRETR
|
|
| 98 |
|
| 99 |
---
|
| 100 |
|
| 101 |
-
## 📦 Model List
|
| 102 |
-
|
| 103 |
-
| Model
|
| 104 |
-
|
| 105 |
-
| DMRetriever-33M
|
| 106 |
-
| DMRetriever-33M-PT
|
| 107 |
-
| DMRetriever-109M
|
| 108 |
-
| DMRetriever-109M-PT
|
| 109 |
-
| DMRetriever-335M
|
| 110 |
-
| DMRetriever-335M-PT
|
| 111 |
-
| DMRetriever-596M
|
| 112 |
-
| DMRetriever-596M-PT
|
| 113 |
-
| DMRetriever-4B
|
| 114 |
-
| DMRetriever-4B-PT
|
| 115 |
-
| DMRetriever-7.6B
|
| 116 |
-
| DMRetriever-7.6B-PT
|
|
|
|
| 117 |
|
| 118 |
---
|
| 119 |
|
|
|
|
| 98 |
|
| 99 |
---
|
| 100 |
|
| 101 |
+
## 📦 DMRetriever Series Model List
|
| 102 |
+
|
| 103 |
+
| **Model** | **Description** | **Backbone** | **Backbone Type** | **Hidden Size** | **#Layers** |
|
| 104 |
+
|:--|:--|:--|:--|:--:|:--:|
|
| 105 |
+
| [DMRetriever-33M](https://huggingface.co/DMIR01/DMRetriever-33M) | Base 33M variant | MiniLM | Encoder-only | 384 | 12 |
|
| 106 |
+
| [DMRetriever-33M-PT](https://huggingface.co/DMIR01/DMRetriever-33M-PT) | Pre-trained version of 33M | MiniLM | Encoder-only | 384 | 12 |
|
| 107 |
+
| [DMRetriever-109M](https://huggingface.co/DMIR01/DMRetriever-109M) | Base 109M variant | BERT-base-uncased | Encoder-only | 768 | 12 |
|
| 108 |
+
| [DMRetriever-109M-PT](https://huggingface.co/DMIR01/DMRetriever-109M-PT) | Pre-trained version of 109M | BERT-base-uncased | Encoder-only | 768 | 12 |
|
| 109 |
+
| [DMRetriever-335M](https://huggingface.co/DMIR01/DMRetriever-335M) | Base 335M variant | BERT-large-uncased-WWM | Encoder-only | 1024 | 24 |
|
| 110 |
+
| [DMRetriever-335M-PT](https://huggingface.co/DMIR01/DMRetriever-335M-PT) | Pre-trained version of 335M | BERT-large-uncased-WWM | Encoder-only | 1024 | 24 |
|
| 111 |
+
| [DMRetriever-596M](https://huggingface.co/DMIR01/DMRetriever-596M) | Base 596M variant | Qwen3-0.6B | Decoder-only | 1024 | 28 |
|
| 112 |
+
| [DMRetriever-596M-PT](https://huggingface.co/DMIR01/DMRetriever-596M-PT) | Pre-trained version of 596M | Qwen3-0.6B | Decoder-only | 1024 | 28 |
|
| 113 |
+
| [DMRetriever-4B](https://huggingface.co/DMIR01/DMRetriever-4B) | Base 4B variant | Qwen3-4B | Decoder-only | 2560 | 36 |
|
| 114 |
+
| [DMRetriever-4B-PT](https://huggingface.co/DMIR01/DMRetriever-4B-PT) | Pre-trained version of 4B | Qwen3-4B | Decoder-only | 2560 | 36 |
|
| 115 |
+
| [DMRetriever-7.6B](https://huggingface.co/DMIR01/DMRetriever-7.6B) | Base 7.6B variant | Qwen3-8B | Decoder-only | 4096 | 36 |
|
| 116 |
+
| [DMRetriever-7.6B-PT](https://huggingface.co/DMIR01/DMRetriever-7.6B-PT) | Pre-trained version of 7.6B | Qwen3-8B | Decoder-only | 4096 | 36 |
|
| 117 |
+
|
| 118 |
|
| 119 |
---
|
| 120 |
|