Modal Labs is a serverless platform purpose-built for machine learning. Instead of managing Kubernetes clusters or EC2 instances, you write Python functions, decorate them with @app.function(), and Modal handles deployment, scaling, and GPU allocation. Modal excels at inference (running pre-trained models), batch processing, and on-demand workloads. You pay only for compute time. Model loading happens once (containerized), then requests are routed to warm instances.