llama.cpp

by PotatoSniffer - opened 19 days ago

Discussion

PotatoSniffer

19 days ago

Please open pull request in official llama.cpp repo

tarruda

19 days ago

•

edited 19 days ago

~~Note that the current llama.cpp fork doesn't seem to work, at least not on apple silicon: https://huggingface.co/stepfun-ai/Step-3.5-Flash-Int4/discussions/2~~ (nevermind, my download was corrupted)

quasar-of-mikus

18 days ago

https://github.com/ggml-org/llama.cpp/pull/19271

rosspanda0

16 days ago

~~Note that the current llama.cpp fork doesn't seem to work, at least not on apple silicon: https://huggingface.co/stepfun-ai/Step-3.5-Flash-Int4/discussions/2~~ (nevermind, my download was corrupted)

mine 's corrupted too.

dzur658

7 days ago

For anyone who finds this in the future, llama.cpp has been updated to support Step 3.5 Flash, and the implementation is currently working on the latest version of llama.cpp as of writing. Tested on Apple Silicon at q8 quantization and seems to be working well initially!

https://github.com/ggml-org/llama.cpp/pull/19283

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment