FreedomIntelligence
/

Apollo2-3.8B

Question Answering

Model card Files Files and versions

BossRui commited on Oct 15, 2024

Commit

f22f582

·

verified ·

1 Parent(s): 2a1e16f

Update README.md

Files changed (1) hide show

README.md +22 -19

README.md CHANGED Viewed

@@ -76,12 +76,23 @@ Covering 12 Major Languages including English, Chinese, French, Hindi, Spanish,
 * **[2024.10.15]** ApolloMoE repo is published！🎉
 ## Architecture
 <details>
   <summary>Click to view the MoE routing image</summary>
-  ![ApolloMoE](/assets/hybrid_routing.png)
 </details>
@@ -188,17 +199,17 @@ Covering 12 Major Languages including English, Chinese, French, Hindi, Spanish,
    <details><summary>Click to expand</summary>
-   We take Gemma-2b as example
    1. Download Dataset for project:
       ```
-      bash 0.download_data.sh
       ```
-   2. Prepare test and dev for specific model:
-      - Create test data for with special token, you can use ./util/check.ipynb to check models' special tokens
        ```
        bash 1.data_process_test&dev.sh
@@ -206,23 +217,21 @@ Covering 12 Major Languages including English, Chinese, French, Hindi, Spanish,
    3. Prepare train data for specific model (Create tokenized data in advance):
-      - You can adjust data Training order and Training Epoch in this step
        ```
        bash 2.data_process_train.sh
        ```
    4. Train the model
-      - If you want to train in Multi Nodes please refer to ./scripts/multi_node_train_*.sh
        ```
-       bash 3.single_node_train_gemma.sh
        ```
@@ -232,12 +241,6 @@ Covering 12 Major Languages including English, Chinese, French, Hindi, Spanish,
          bash 4.eval.sh
          ```
-   6. Evaluate your model: Play with your ckpts in bash
-         ```
-         python ./src/evaluate/cli_demo.py --model_name='./ckpts/your/path/tfmr'
-         ```
    </details>

 * **[2024.10.15]** ApolloMoE repo is published！🎉
+## Languages Coverage
+12 Major Languages and 38 Minor Languages
+<details>
+  <summary>Click to view the Languages Coverage</summary>
+   ![ApolloMoE](assets/languages.png)
+</details>
 ## Architecture
 <details>
   <summary>Click to view the MoE routing image</summary>
+  ![ApolloMoE](assets/hybrid_routing.png)
 </details>
    <details><summary>Click to expand</summary>
+   We take Apollo2-7B or Apollo-MoE-0.5B as example
    1. Download Dataset for project:
       ```
+      bash 0.download_data.sh
       ```
+   2. Prepare test and dev data for specific model:
+      - Create test data for with special token
        ```
        bash 1.data_process_test&dev.sh
    3. Prepare train data for specific model (Create tokenized data in advance):
+      - You can adjust data Training order and Training Epoch in this step
        ```
        bash 2.data_process_train.sh
        ```
    4. Train the model
+      - If you want to train in Multi Nodes please refer to ./src/sft/training_config/zero_multi.yaml
        ```
+       bash 3.single_node_train.sh
        ```
          bash 4.eval.sh
          ```
    </details>