Skip to content

v0.7.0: NeMo PPO, PEFT Migration, and Fixes

Latest
Compare
Choose a tag to compare
@jon-tow jon-tow released this 23 Jun 22:21

The v0.7.0 release includes several new features, bug fixes, and overall improvements to the codebase. Here are the key changes:

🐠 NeMo PPO and SFT support

This release introduces NeMo-backed PPO and SFT implementations for capabilities and improved system performance under large-scale training.

🦆 PEFT Migration

trlx now supports parameter-efficient tuning methods via the peft library, which we hope will provide greater access to RLHF training in low-resource settings.

Fixes and mores!

New Contributors

Full Changelog: v0.6.0...v0.7.0