Technieken

LoRA (Low-Rank Adaptation)

Definitie

A parameter-efficient fine-tuning method that injects trainable low-rank matrices into transformer layers instead of updating all model weights. LoRA enables cost-effective domain adaptation of large models on modest hardware, making enterprise fine-tuning accessible without full GPU clusters.

Gerelateerde Termen

PEFT (Parameter-Efficient Fine-Tuning)

A family of techniques—including LoRA, prefix tuning, and adapters—that adapt large pre-trained models to new tasks by training only a small fraction of parameters. PEFT dramatically reduces compute and storage costs compared to full fine-tuning.

Fine-tuning

The process of taking a pre-trained AI model and further training it on a specific dataset to adapt it for a particular task or domain. Fine-tuning can deliver significantly better performance than prompting alone for specialised enterprise workflows.

Transformer

The dominant neural network architecture for language, vision, and multimodal AI, introduced in the 2017 "Attention Is All You Need" paper. Transformers use self-attention to process all tokens in parallel, enabling training on internet-scale data and powering every major LLM in use today.

Hulp Nodig bij het Begrijpen van AI?

Boek een consultatie om te bespreken hoe AI-concepten op uw uitdagingen van toepassing zijn.