Spaces:

tomvaillant
/

graphics-llm

Running

App Files Files Community

graphics-llm / QUICKSTART.md

Tom

Update to Jina-CLIP-v2 embeddings and rebrand to Viz LLM

721d500 about 1 month ago

|

599 Bytes

	# Graphics Guide RAG App Quickstart

	## Stack
	- Frontend: Gradio 4.0+ (ChatInterface with auto API endpoints)
	- Database: Supabase PGVector (1024-dim embeddings, HNSW index)
	- LLM: HuggingFace Inference API (Llama-3.1-8B-Instruct)
	- Embeddings: Jina AI API (jina-clip-v2, 1024-dim)
	- Client: Supabase Python client + InferenceClient (huggingface_hub)

	## Key Parameters
	- Temperature: 0.2 (low hallucination)
	- Max Tokens: 800 (moderate responses)
	- Retrieval K: 5 documents
	- Match Threshold: 0.5 (cosine similarity)
	- Connection: Direct via Supabase client