view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware Aug 8 • 29
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 • 74