How to run this?

by Autumnlight - opened Aug 24, 2024

Discussion

Autumnlight

Aug 24, 2024

How can I run this? I usually run exllama2 or Koboldcpp but both dont seem to support this.

roicohennn

Aug 25, 2024

Hi, We have added instructions in the model card on running the model using vLLM or transformers.

OmerAtEasily

Aug 27, 2024

Trying to run the model from transformers - AutoModelForCausalLM
I get an authentication error - Access to model ai21labs/AI21-Jamba-1.5-Mini is restricted. You must be authenticated to access it.

Can you please provide some authentication insturctions?

michael-go

AI21 org Aug 27, 2024

•

edited Aug 27, 2024

@OmerAtEasily you'll need to:

accept the model terms in the Model card
get an Access Token from your huggingface settings page, as explained here: https://huggingface.co/docs/hub/en/security-tokens
pass the token to transformers (explained below)

There are several ways to let transformers use your token:

use the hugginface-cli login CLI command as explained here: https://huggingface.co/docs/huggingface_hub/en/guides/cli#huggingface-cli-login. It will write the token to a file in which transformers will automatically look for the token if exists
set it in HF_TOKEN env var as explained here: https://huggingface.co/docs/huggingface_hub/en/package_reference/environment_variables#hftoken
pass the token with the token= arg when loading the model/tokenizer as shown here: https://huggingface.co/docs/hub/en/security-tokens#how-to-use-user-access-tokens

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment