sigridjineth
/

colbert-ko-embeddinggemma-300m

Safetensors

Korean

Model card Files Files and versions

xet

Community

sigridjineth commited on Sep 14

Commit

a1b935a

verified ·

1 Parent(s): 65b4710

Update README.md

Browse files

Files changed (1) hide show

README.md +32 -17

README.md CHANGED Viewed

@@ -23,23 +23,38 @@ The model demonstrated stable and consistent improvement throughout the training
 The true power of the fine-tuned model is its ability to understand semantic context beyond simple keyword matching. In the following challenging example, the fine-tuned model correctly infers the answer, while the original base model fails.
-  * **쿼리 (Query):**
-    ```
-    "일론 머스크가 설립한 전기차 회사는 어디야?"
-    ```
-  * **✅ Fine-tuned Model Results:**
-    1.  **`Score: 10.00`**: **테슬라**는 모델 S, 3, X, Y를 생산하며 오토파일럿 기능으로 유명합니다.
-    2.  **`Score: 9.51`**: **스페이스X**는 재사용 가능한 로켓을 개발하여 우주 탐사 비용을 크게 낮췄습니다.
-    3.  **`Score: 8.57`**: 아마존 웹 서비스(AWS)는 클라우드 컴퓨팅 시장의 선두주자입니다.
-  * **❌ Original Model Results:**
-    1.  **`Score: 8.55`**: 수도권 전철은 서울과 주변 도시를 연결하는 중요한 교통수단입니다.
-    2.  **`Score: 8.54`**: **테슬라**는 모델 S, 3, X, Y를 생산하며 오토파일럿 기능으로 유명합니다.
-    3.  **`Score: 8.39`**: **스페이스X**는 재사용 가능한 로켓을 개발하여 우주 탐사 비용을 크게 낮췄습니다.
 **Analysis**: The fine-tuned model correctly identifies 'Tesla' by understanding the semantic relationship between the query and the document, even with no direct keyword overlap. In contrast, the original model is easily confused by distractors and fails to rank the correct answer first, demonstrating the significant impact of the ColBERT fine-tuning process.

 The true power of the fine-tuned model is its ability to understand semantic context beyond simple keyword matching. In the following challenging example, the fine-tuned model correctly infers the answer, while the original base model fails.
+```
+$ python inference.py
+Using device: cuda
+Loading fine-tuned model...
+Fine-tuned model loaded.
+Loading original (pre-trained) model for comparison...
+Original model loaded.
+==================================================
+Query: 일론 머스크가 설립한 전기차 회사는 어디야?
+==================================================
+--- 1. ✅ Fine-tuned Model Results ---
+  Rank 1 (Score: 9.00): 테슬라는 모델 S, 3, X, Y를 생산하며 오토파일럿 기능으로 유명합니다.
+  Rank 2 (Score: 7.92): 스페이스X는 재사용 가능한 로켓을 개발하여 우주 탐사 비용을 크게 낮췄습니다.
+  Rank 3 (Score: 7.72): 아마존 웹 서비스(AWS)는 클라우드 컴퓨팅 시장의 선두주자입니다.
+  Rank 4 (Score: 7.23): 수도권 전철은 서울과 주변 도시를 연결하는 중요한 교통수단입니다.
+  Rank 5 (Score: 5.77): 대한민국의 수도는 서울입니다. 서울은 경제와 문화의 중심지입니다.
+  Rank 6 (Score: 5.43): 일본의 수도는 도쿄입니다. 벚꽃이 아름다운 도시죠.
+  Rank 7 (Score: 5.40): 프랑스의 수도는 파리이며, 에펠탑으로 유명합니다.
+--- 2. ❌ Original Model Results ---
+  Rank 1 (Score: 9.13): 수도권 전철은 서울과 주변 도시를 연결하는 중요한 교통수단입니다.
+  Rank 2 (Score: 8.79): 테슬라는 모델 S, 3, X, Y를 생산하며 오토파일럿 기능으로 유명합니다.
+  Rank 3 (Score: 8.77): 일본의 수도는 도쿄입니다. 벚꽃이 아름다운 도시죠.
+  Rank 4 (Score: 8.71): 대한민국의 수도는 서울입니다. 서울은 경제와 문화의 중심지입니다.
+  Rank 5 (Score: 8.53): 아마존 웹 서비스(AWS)는 클라우드 컴퓨팅 시장의 선두주자입니다.
+  Rank 6 (Score: 8.48): 스페이스X는 재사용 가능한 로켓을 개발하여 우주 탐사 비용을 크게 낮췄습니다.
+  Rank 7 (Score: 8.24): 프랑스의 수도는 파리이며, 에펠탑으로 유명합니다.
+```
 **Analysis**: The fine-tuned model correctly identifies 'Tesla' by understanding the semantic relationship between the query and the document, even with no direct keyword overlap. In contrast, the original model is easily confused by distractors and fails to rank the correct answer first, demonstrating the significant impact of the ColBERT fine-tuning process.