refactor(app): improve streaming, background search, dtype fallback, and cleanup :contentReference[oaicite:0]{index=0} 293686e Luigi commited on Apr 22, 2025
usue chat pipeline instead of model and tokenizer individually ac8e9cc Luigi commited on Apr 12, 2025
Improve responsiveness by asynchronously retrieving web search context acda3f1 Luigi commited on Apr 11, 2025