Atropos RL Pipeline
AI/ML
Reinforcement learning training pipeline by Nous Research. Fine-tune and align models with RLHF/RLAIF workflows integrated into the Hermes ecosystem.
Features
- ✓RL training
- ✓RLHF/RLAIF
- ✓Batch processing
- ✓Model fine-tuning
Reinforcement learning training pipeline by Nous Research. Fine-tune and align models with RLHF/RLAIF workflows integrated into the Hermes ecosystem.