A deployment is in the state "Deployment Pending" until it has been successfully deployed on your cluster and is ready to accept requests.
- View the logs of your deployment endpoint: The logs will show you if deployment is pending because there is a syntax error, or for example a missing Python package.
- View the Cluster Status tab: The cluster status will show if there are issues with running the deployment. A common issue is your loading a large model in-memory but the deployment doesn't have enough memory reserved causing it to fail on an out-of-memory (OOM) exception.
Comments
0 comments
Please sign in to leave a comment.