File size: 599 Bytes
721d500
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
# Graphics Guide RAG App Quickstart

## Stack
- **Frontend**: Gradio 4.0+ (ChatInterface with auto API endpoints)
- **Database**: Supabase PGVector (1024-dim embeddings, HNSW index)
- **LLM**: HuggingFace Inference API (Llama-3.1-8B-Instruct)
- **Embeddings**: Jina AI API (jina-clip-v2, 1024-dim)
- **Client**: Supabase Python client + InferenceClient (huggingface_hub)

## Key Parameters
- **Temperature**: 0.2 (low hallucination)
- **Max Tokens**: 800 (moderate responses)
- **Retrieval K**: 5 documents
- **Match Threshold**: 0.5 (cosine similarity)
- **Connection**: Direct via Supabase client