Spaces:
Running
Running
| # Graphics Guide RAG App Quickstart | |
| ## Stack | |
| - **Frontend**: Gradio 4.0+ (ChatInterface with auto API endpoints) | |
| - **Database**: Supabase PGVector (1024-dim embeddings, HNSW index) | |
| - **LLM**: HuggingFace Inference API (Llama-3.1-8B-Instruct) | |
| - **Embeddings**: Jina AI API (jina-clip-v2, 1024-dim) | |
| - **Client**: Supabase Python client + InferenceClient (huggingface_hub) | |
| ## Key Parameters | |
| - **Temperature**: 0.2 (low hallucination) | |
| - **Max Tokens**: 800 (moderate responses) | |
| - **Retrieval K**: 5 documents | |
| - **Match Threshold**: 0.5 (cosine similarity) | |
| - **Connection**: Direct via Supabase client | |