aws-prototyping
/

long-llava-qwen2-7b

Model card Files Files and versions

yinsong1986 commited on Sep 2, 2024

Commit

43abc09

·

verified ·

1 Parent(s): 82c4d33

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -297,6 +297,9 @@ print(f"Chat completion output:{result}")
 ## Deploy the model on a SageMaker LMI Endpoint
 Please refer to this [Jupyter Notebook](https://github.com/awslabs/extending-the-context-length-of-open-source-llms/tree/main/long-llava-qwen2-7b/notebooks/deploy-on-aws-sagemaker-long-llava-qwen2-7b.ipynb) to see how to deploy a Sagemaker [large model inference (LMI) container](https://docs.aws.amazon.com/sagemaker/latest/dg/large-model-inference-container-docs.html).

 ## Deploy the model on a SageMaker LMI Endpoint
+**Important Note** - It is recommended to run the Jupyter notebook below on a [SageMaker Notebook instace](https://docs.aws.amazon.com/sagemaker/latest/dg/nbi.html). Please make sure your IAM role for the jupter Notebook instance
+has full access to [AmazonEC2ContainerRegistryFullAccess](https://docs.aws.amazon.com/AmazonECR/latest/userguide/security-iam-awsmanpol.html)
 Please refer to this [Jupyter Notebook](https://github.com/awslabs/extending-the-context-length-of-open-source-llms/tree/main/long-llava-qwen2-7b/notebooks/deploy-on-aws-sagemaker-long-llava-qwen2-7b.ipynb) to see how to deploy a Sagemaker [large model inference (LMI) container](https://docs.aws.amazon.com/sagemaker/latest/dg/large-model-inference-container-docs.html).