While deploying your models (either as-a-service or self-hosted) can help reduce costs and improve operations and scalability (and are almost always a must for production use), there are plenty of instances where it may be preferential to run models locally (i.
Source link