RunPod
Squaring up to Modal with a decorator-based Python SDK while seeding a creator marketplace for AI models.
◆Recent moves
- 2mo ago
Flash beta: Run Python functions on cloud GPUs
⚡ SPARKFlash brings a Modal-style decorator-based Python SDK to Runpod's serverless GPUs — wrap a function with @Endpoint, declare GPU type and dependencies, and the function runs remotely with auto-scaling and per-datacenter network volumes. This is the move that makes Runpod a direct competitor to Modal and Beam, not just a GPU rental service.
View source ↗ - 3mo ago
New Public Endpoints and expanded examples
A wide drop of Public Endpoints across modalities — SORA 2 / SORA 2 Pro, Kling 2.1/2.6, WAN 2.6 for video; Seedream 4.0 for images; Qwen3 32B and IBM Granite 4.0 for text; Chatterbox Turbo for audio. Plus a Vercel AI SDK provider package and configuration guides for OpenCode, Cursor, and Cline. The integration angle matters: Runpod is positioning to be the default model provider inside the AI coding tool stack.
View source ↗ - 4mo ago
GitHub release rollback GA and load balancing Serverless repos in beta
GitHub release rollback hits GA, letting users restore an earlier Serverless build directly from the console without waiting on a fresh release. Hub listings can now declare load-balanced Serverless deployment via hub.json, and downstream users choose between autoscaling Serverless or dedicated Pod deployment from the same listing. Operational maturity work for serious customers.
View source ↗ - 5mo ago
Pod migration in beta and Serverless development guides
Pod migration enters beta — when a stopped Pod's GPU is occupied by another tenant, Runpod can provision an equivalent Pod on a free machine and transfer data automatically. Solves one of the most painful workflow gaps for Pod users who pause and resume work across sessions. Plus a comprehensive new Serverless development-and-debugging guide set.
View source ↗ - 8mo ago
Slurm Clusters GA, cached models in beta, and new Public Endpoints available
Slurm Clusters reach GA, opening Runpod up to traditional HPC and distributed-training workloads with on-demand multi-node provisioning and pay-as-you-go billing. Cached models enter beta to eliminate worker startup model-download time. Public Endpoints add WAN 2.5 and Nano Banana for video and image-merging.
View source ↗ - 9mo ago
Hub revenue sharing launches and Pods UI gets refreshed
⚡ SPARKHub revenue share lets repo authors earn up to 7% of compute revenue when others deploy their listings, with credits auto-deposited monthly. This is the economic primitive that converts Runpod's Hub from a community catalogue into a creator marketplace. The Pods UI refresh alongside is cosmetic; the revenue share is the real move.
View source ↗