Part two of the quest to deliver AI applications on-demand from Kubernetes. In this one, we’ll deploy a typical AI application, and configure it to scale up (and down to zero) on demand.
Posts for: #AI
AI on-demand with Kubernetes (Part 1)
Extending my Kubernetes cluster with a GPU/CUDA node to deliver on-demand AI applications
Local Llama3 Assistant in JetBrains
Notes on integrating a local LLM with JetBrains IDEA IDEs
Using multi-modal AI for better image captions
Using a genuine multi-modal AI model to generate photo captions gives a huge leap in quality over my previous efforts.
Inference in the Cloud with Modal
Playing with contemporary machine-learning models can demand hardware with a pretty hefty pricetag. Modal lets you do it in the cloud with a much more reasonable pricing model than the big Cloud Compute providers.
After the AI Apocalypse
A second attempt at using AI to generate some captions for my photographs
Bucharest Botanic Gardens
Photographs taken in Bucharest’s Dimitrie Brândză Botanical Garden, captioned by AI
The Internet’s AI Bomb Spike
I don’t know if you’ve noticed, but search engines got a lot less useful lately…