Capability
Dataset Formatting And Preprocessing Utilities
4 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →Top Matches
via “dataset-formatting-and-preprocessing-utilities”
Train transformer language models with reinforcement learning.
Unique: Provides task-specific data collators (SFT, RLHF, DPO) that automatically handle padding, truncation, and format conversion, eliminating manual preprocessing code for common training objectives
vs others: More integrated than generic data loaders because it understands trl's training objectives and formats data accordingly, while more flexible than fixed-format datasets by supporting multiple input formats