技术方法

Knowledge Distillation

定义

A training technique where a smaller "student" model is trained to replicate the behaviour of a larger "teacher" model. Distillation produces compact, fast models suitable for latency-sensitive or resource-constrained deployments without sacrificing too much quality.

相关术语

Quantization

A model compression technique that reduces the numerical precision of model weights—for example, from 32-bit floats to 8-bit integers—shrinking memory requirements and accelerating inference with minimal accuracy loss. Quantization is essential for deploying LLMs on-premise or at the edge.

Model Compression

A set of techniques—including quantization, distillation, pruning, and low-rank factorisation—that reduce model size and computational requirements while preserving performance. Model compression is essential for deploying powerful models on edge hardware or within cost budgets.

Fine-tuning

The process of taking a pre-trained AI model and further training it on a specific dataset to adapt it for a particular task or domain. Fine-tuning can deliver significantly better performance than prompting alone for specialised enterprise workflows.

了解术语只是第一步，将其落地应用才是第二步。

预约一次 Physical AI 适配性沟通，探讨这些 AI 概念如何转化到您所在的具体行业与业务挑战中。