graphics-llm / QUICKSTART.md
Tom
Update to Jina-CLIP-v2 embeddings and rebrand to Viz LLM
721d500
|
raw
history blame
599 Bytes

Graphics Guide RAG App Quickstart

Stack

  • Frontend: Gradio 4.0+ (ChatInterface with auto API endpoints)
  • Database: Supabase PGVector (1024-dim embeddings, HNSW index)
  • LLM: HuggingFace Inference API (Llama-3.1-8B-Instruct)
  • Embeddings: Jina AI API (jina-clip-v2, 1024-dim)
  • Client: Supabase Python client + InferenceClient (huggingface_hub)

Key Parameters

  • Temperature: 0.2 (low hallucination)
  • Max Tokens: 800 (moderate responses)
  • Retrieval K: 5 documents
  • Match Threshold: 0.5 (cosine similarity)
  • Connection: Direct via Supabase client