Puxis97 commited on
Commit
ace70ee
·
verified ·
1 Parent(s): 67bd044

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -7
README.md CHANGED
@@ -1,22 +1,39 @@
1
  ---
2
- base_model: unsloth/gpt-oss-20b-unsloth-bnb-4bit
 
 
3
  tags:
4
  - text-generation-inference
5
  - transformers
6
  - unsloth
7
- - gpt_oss
8
- - trl
 
 
 
 
9
  license: apache-2.0
10
  language:
11
  - en
12
  ---
13
 
14
- # Uploaded model
 
 
15
 
16
  - **Developed by:** Puxis97
17
  - **License:** apache-2.0
18
- - **Finetuned from model :** unsloth/gpt-oss-20b-unsloth-bnb-4bit
 
 
 
 
19
 
20
- This gpt_oss model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
 
 
 
 
 
21
 
22
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
1
  ---
2
+ # ⚠️ YAML Header
3
+ # This section defines the model's metadata on Hugging Face
4
+ base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
5
  tags:
6
  - text-generation-inference
7
  - transformers
8
  - unsloth
9
+ - mixtral
10
+ - mixture-of-experts
11
+ - qlora
12
+ - code-generation
13
+ - python-coder
14
+ - code-alpaca
15
  license: apache-2.0
16
  language:
17
  - en
18
  ---
19
 
20
+ # Puxis97/Mixtral-8x7B-Python-Coder-CodeAlpaca 🐍
21
+
22
+ This model is a **Mixtral 8x7B Instruct** model fine-tuned using **QLoRA** on the **CodeAlpaca 20K** dataset to specialize in **Python code instruction following and generation**.
23
 
24
  - **Developed by:** Puxis97
25
  - **License:** apache-2.0
26
+ - **Finetuned from model :** mistralai/Mixtral-8x7B-Instruct-v0.1
27
+
28
+ ### Training Details
29
+
30
+ This fine-tuned model was built for high-efficiency using **Unsloth's QLoRA optimizations** and the Hugging Face TRL library, resulting in a powerful, instruction-following code generation model that runs on consumer GPUs.
31
 
32
+ | Setting | Value |
33
+ | :--- | :--- |
34
+ | **Base Model** | `mistralai/Mixtral-8x7B-Instruct-v0.1` |
35
+ | **Dataset** | `HuggingFaceH4/CodeAlpaca_20K` |
36
+ | **Method** | QLoRA (4-bit quantization) |
37
+ | **Task** | Code Instruction Following / Python Coding |
38
 
39
+ [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)