runtime error
Exit code: 1. Reason: safetensors: 0%| | 0.00/3.67G [00:00<?, ?B/s][A model-00004-of-00004.safetensors: 2%|β | 67.0M/3.67G [00:05<05:17, 11.3MB/s][A model-00004-of-00004.safetensors: 5%|β | 201M/3.67G [00:07<01:44, 33.1MB/s] [A model-00004-of-00004.safetensors: 9%|β | 317M/3.67G [00:09<01:20, 41.4MB/s][A model-00004-of-00004.safetensors: 29%|βββ | 1.05G/3.67G [00:10<00:14, 178MB/s][A model-00004-of-00004.safetensors: 60%|ββββββ | 2.19G/3.67G [00:11<00:03, 381MB/s][A model-00004-of-00004.safetensors: 93%|ββββββββββ| 3.40G/3.67G [00:12<00:00, 569MB/s][A model-00004-of-00004.safetensors: 100%|ββββββββββ| 3.67G/3.67G [00:12<00:00, 290MB/s] The following generation flags are not valid and may be ignored: ['cache_implementation']. Set `TRANSFORMERS_VERBOSITY=info` for more details. Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s][A Loading checkpoint shards: 100%|ββββββββββ| 4/4 [00:00<00:00, 10817.03it/s] generation_config.json: 0%| | 0.00/223 [00:00<?, ?B/s][A generation_config.json: 100%|ββββββββββ| 223/223 [00:00<00:00, 1.06MB/s] Traceback (most recent call last): File "/app/app.py", line 9, in <module> model = AutoModelForCausalLM.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 604, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 277, in _wrapper return func(*args, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 5140, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 504, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.
Container logs:
Fetching error logs...