Capability

Language Model Pretraining And Fine Tuning

20 artifacts provide this capability.

Want a personalized recommendation?

Top Matches

via “fine-tuning with causal language modeling objective”

text-generation model by undefined. 1,42,05,413 downloads.

Unique: Supports both full fine-tuning and LoRA-based parameter-efficient adaptation, with HuggingFace Trainer integration providing distributed training, mixed precision, and gradient checkpointing out-of-the-box for 124M-parameter models

vs others: Smaller and faster to fine-tune than GPT-3 (which requires API calls), but less capable at few-shot learning — requires more task-specific data to match GPT-3's zero-shot performance

Language Model Pretraining And Fine Tuning

Top Matches

Also Known As

Company