Severing a Model Version
Model serving is the next step of model deployment. After deploying a model version, we can optionally start serving it. This creates a Model Endpoint which is a stable URL associated with a model, of the following format:
For example a Model named my-model
within Project named my-project
with the base domain models.id.merlin.dev
will have a Model Endpoint which look as follows:
Having a Model Endpoint makes it easy to keep updating the model (creating a new model version, running it and then serving it) without having to modify the model URL used by the called system.
Serving a Model Version
A model version can be served via the SDK or the UI.
Serving a Model Version via SDK
To serve a model version, you can call serve_traffic()
function from Merlin Python SDK.
Serving a Model Version via UI
Once a model version is deployed (i.e., it is in the Running state), the Serve option can be selected from the model versions view.
Last updated