I tried loading the model and running a 6653 token long prompt (cpu only). It showed a 48gb ram spike and crashed lol. I find the ram usage unusually high
Updated it! Thank you for your issue! Try it fixed!
· Sign up or log in to comment