Model Serving Infrastructure Patterns: How AI Agencies Deploy Models That Scale