thanhdathoang commited on
Commit
beaf4e1
·
verified ·
1 Parent(s): 6bfc097

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - griffith-bigdata/Qwen3-4B-SQL-Writer
4
+ ---
5
+
6
+ # FINER-SQL-4B-BIRD
7
+
8
+ Trained from [`griffith-bigdata/Qwen3-4B-SQL-Writer`](https://huggingface.co/griffith-bigdata/Qwen3-4B-SQL-Writer) using GRPO with two additional dense rewards from the FINER-SQL paper:
9
+
10
+ 🧠 Memory Reward — aligns reasoning with verified traces
11
+ ⚙️ Atomic Reward — measures operation-level SQL overlap
12
+
13
+ ✅ 68.4% EX on BIRD
14
+ ⚡ Runs efficient on a single 24 GB GPU
15
+
16
+ 📄 See other models: https://huggingface.co/collections/griffith-bigdata/finer-sql
17
+
18
+ 📄 Github code: https://github.com/thanhdath/finer-sql/tree/main