AWS Lambda executes code without provisioning servers, but performance depends on configuration. Cold starts add latency (first invocation). Memory affects CPU allocation and cost. Concurrency limits throttle requests. Optimization means tuning memory, reducing package size, managing initialization, and using Provisioned Concurrency for critical paths. Lambda is ideal for bursty, short-lived workloads. For sustained, long-running APIs, containers are often cheaper.