Spaces:
Running
Running
Graphics Guide RAG App Quickstart
Stack
- Frontend: Gradio 4.0+ (ChatInterface with auto API endpoints)
- Database: Supabase PGVector (1024-dim embeddings, HNSW index)
- LLM: HuggingFace Inference API (Llama-3.1-8B-Instruct)
- Embeddings: Jina AI API (jina-clip-v2, 1024-dim)
- Client: Supabase Python client + InferenceClient (huggingface_hub)
Key Parameters
- Temperature: 0.2 (low hallucination)
- Max Tokens: 800 (moderate responses)
- Retrieval K: 5 documents
- Match Threshold: 0.5 (cosine similarity)
- Connection: Direct via Supabase client