Banana provides serverless GPU infrastructure for ML inference, built around its open-source HTTP framework called Potassium. Developers define model initialization and request handler functions using Potassium, which Banana then deploys and scales dynamically. The platform exposes a full Automation API with SDKs and a CLI for integrating deployments into existing pipelines. It supports GitHub integration, rolling deploys, branch deployments, environment management, and customizable inference queues for advanced workloads.