Posts for: #AI

Why I Run Local Models

Why I Run Local Models

Yet again, Claude is down. Yet again, my local models just keep working. A case study in why the complexity of self-hosted AI is worth it.

[Read more]

AI on-demand with Kubernetes (Part 2)

AI on-demand with Kubernetes (Part 2)

Part two of the quest to deliver AI applications on-demand from Kubernetes. In this one, we’ll deploy a typical AI application, and configure it to scale up (and down to zero) on demand.

[Read more]

Inference in the Cloud with Modal

Playing with contemporary machine-learning models can demand hardware with a pretty hefty pricetag. Modal lets you do it in the cloud with a much more reasonable pricing model than the big Cloud Compute providers.

[Read more]