I have 6 different models trained. I registered models and created endpoints separately in azure ml and able to access endpoints and getting outputs. But noticed that each endpoint took one virtual machine separately. 6 endpoints took 6 virtual machines.
How can create or deploy my 6 endpoints to a single virtual machine. Because in future my models may increase.
Note: all my 6 models are different and each endpoint output is different.
If you have more than one model, then one possible alternate solution can be by creating a
multi-model deployment.With this all-model file are registered as a single model asset on Azure and loaded simultaneously in the scoring script and the scoring script parses each request for a "model" field and routes the payload accordingly.
Below is sample code snippet for the custom scoring script:
For detailed example code, you can check this notebook.