ArtusDev
/

requests-exl

@@ -21,6 +21,12 @@ tags:
     box-shadow: 0 4px 12px rgba(0,0,0,0.3);
     border: 1px solid #3c3c3c;
   }
   .card-dark h1 {
     font-size: 2.2em;
     color: #ffffff;
@@ -70,11 +76,12 @@ tags:
     padding: 3px 6px;
     border-radius: 4px;
     font-family: 'Fira Code', 'Courier New', monospace;
-    color: #4fc1ff;
   }
   a {
     color: #569cd6;
     text-decoration: none;
   }
   a:hover {
     text-decoration: underline;
@@ -89,7 +96,7 @@ tags:
 <div class="container-dark">
-  <div class="card-dark">
     <h1>EXL3 Quantization Requests</h1>
     <p class="subtitle">Community hub for requesting EXL3 quants.</p>
   </div>
@@ -139,11 +146,11 @@ tags:
     <h2>How to Download and Use EXL Quants</h2>
     <p>Each quantization size for a model is stored in a separate HF repository branch. You can download a specific quant size by its branch.</p>
     <p>For example, to download the <code class="inline-code-dark">4.0bpw_H6</code> quant:</p>
-    <p>Install hugginface-cli:</p>
     <pre><code>pip install -U "huggingface_hub[cli]"</code></pre>
-    <p>Download quant by targeting the specific quant size (revision):</p>
     <pre><code>huggingface-cli download ArtusDev/MODEL_NAME --revision "4.0bpw_H6" --local-dir ./</code></pre>
-    <p style="margin-top: 15px;">EXL3 quants can be run with any inference client that supports the EXL3 format, such as <a href="https://github.com/theroyallab/tabbyapi" target="_blank"><b>TabbyAPI</b></a>. Please refer to <a href="https://github.com/theroyallab/tabbyAPI/wiki/01.-Getting-Started" target="_blank">documentation</a> for set up instructions.</p>
   </div>
   <div class="card-dark">

     box-shadow: 0 4px 12px rgba(0,0,0,0.3);
     border: 1px solid #3c3c3c;
   }
+  .card-dark.card-dark-title h1 {
+    font-size: 1.5em;
+    color: #ffffff;
+    text-align: center;
+    margin-bottom: 10px;
+  }
   .card-dark h1 {
     font-size: 2.2em;
     color: #ffffff;
     padding: 3px 6px;
     border-radius: 4px;
     font-family: 'Fira Code', 'Courier New', monospace;
+    color: #c586c0;
   }
   a {
     color: #569cd6;
     text-decoration: none;
+    font-weight: 600;
   }
   a:hover {
     text-decoration: underline;
 <div class="container-dark">
+  <div class="card-dark card-dark-title">
     <h1>EXL3 Quantization Requests</h1>
     <p class="subtitle">Community hub for requesting EXL3 quants.</p>
   </div>
     <h2>How to Download and Use EXL Quants</h2>
     <p>Each quantization size for a model is stored in a separate HF repository branch. You can download a specific quant size by its branch.</p>
     <p>For example, to download the <code class="inline-code-dark">4.0bpw_H6</code> quant:</p>
+    <p><b>1. Install hugginface-cli:</b></p>
     <pre><code>pip install -U "huggingface_hub[cli]"</code></pre>
+    <p><b>2. Download quant by targeting the specific quant size (revision):</b></p>
     <pre><code>huggingface-cli download ArtusDev/MODEL_NAME --revision "4.0bpw_H6" --local-dir ./</code></pre>
+    <p>EXL3 quants can be run with any inference client that supports the EXL3 format, such as <a href="https://github.com/theroyallab/tabbyapi" target="_blank"><b>TabbyAPI</b></a>. Please refer to <a href="https://github.com/theroyallab/tabbyAPI/wiki/01.-Getting-Started" target="_blank">documentation</a> for set up instructions.</p>
   </div>
   <div class="card-dark">