Puxis97
/

Mixtral-8x7B-Python-Coder-CodeAlpaca

text-generation-inference

mixture-of-experts

code-generation

Model card Files Files and versions

Puxis97 commited on about 1 month ago

Commit

ace70ee

·

verified ·

1 Parent(s): 67bd044

Update README.md

Files changed (1) hide show

README.md +24 -7

README.md CHANGED Viewed

@@ -1,22 +1,39 @@
 ---
-base_model: unsloth/gpt-oss-20b-unsloth-bnb-4bit
 tags:
 - text-generation-inference
 - transformers
 - unsloth
-- gpt_oss
-- trl
 license: apache-2.0
 language:
 - en
 ---
-# Uploaded  model
 - **Developed by:** Puxis97
 - **License:** apache-2.0
-- **Finetuned from model :** unsloth/gpt-oss-20b-unsloth-bnb-4bit
-This gpt_oss model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 ---
+# ⚠️ YAML Header
+# This section defines the model's metadata on Hugging Face
+base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 tags:
 - text-generation-inference
 - transformers
 - unsloth
+- mixtral
+- mixture-of-experts
+- qlora
+- code-generation
+- python-coder
+- code-alpaca
 license: apache-2.0
 language:
 - en
 ---
+# Puxis97/Mixtral-8x7B-Python-Coder-CodeAlpaca 🐍
+This model is a **Mixtral 8x7B Instruct** model fine-tuned using **QLoRA** on the **CodeAlpaca 20K** dataset to specialize in **Python code instruction following and generation**.
 - **Developed by:** Puxis97
 - **License:** apache-2.0
+- **Finetuned from model :** mistralai/Mixtral-8x7B-Instruct-v0.1
+### Training Details
+This fine-tuned model was built for high-efficiency using **Unsloth's QLoRA optimizations** and the Hugging Face TRL library, resulting in a powerful, instruction-following code generation model that runs on consumer GPUs.
+| Setting | Value |
+| :--- | :--- |
+| **Base Model** | `mistralai/Mixtral-8x7B-Instruct-v0.1` |
+| **Dataset** | `HuggingFaceH4/CodeAlpaca_20K` |
+| **Method** | QLoRA (4-bit quantization) |
+| **Task** | Code Instruction Following / Python Coding |
+[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)