Running a model on a large dataset in a single scheduled job rather than in real time. Batch inference is more cost-efficient than real-time serving for use cases such as nightly report generation, bulk document classification, or periodic customer scoring.
Book a 30-minute call to discuss how these AI concepts translate to your specific industry and business challenges.