Serving "Frankenstein" (combined) models at scale

46 Views Asked by bli00 At 28 July 2025 at 02:21

I have a tensorflow model that's combined with a clustering algorithm in (HDBSCAN). Both have been trained/fitted separately but they work together (tf -> hdbscan). I'm looking to serve predictions on GCP at scale.

Currently, I've created a custom serving container that stitches the models together in python, but you can imagine that this isn't very performant, especially since the tf model is loaded in eager mode. Are there canonical solutions to this problem?

An idea I have is to run the canonical tf server detached inside the container and have a outside facing server that intercepts request, passes it to the local tf server, then run the clustering algorithm on the tf server response, but I'm not sure how well this will work or if there's better ways.

Original Q&A

Serving "Frankenstein" (combined) models at scale

There are 0 best solutions below

Related Questions in TENSORFLOW

Related Questions in SCIKIT-LEARN

Related Questions in TENSORFLOW-SERVING

Related Questions in HDBSCAN

Trending Questions

Popular # Hahtags

Popular Questions