Atropos RL Pipeline
AI/ML
Reinforcement learning training pipeline for fine-tuning and aligning models with RLHF/RLAIF workflows integrated into the Hermes ecosystem.
Features
- ✓RL training
- ✓RLHF/RLAIF
- ✓Batch processing
- ✓Model fine-tuning
Reinforcement learning training pipeline for fine-tuning and aligning models with RLHF/RLAIF workflows integrated into the Hermes ecosystem.