Rocket introduces a self-play RL framework for automated hyperparameter optimization, handling mixed types without priors. It scales large datasets via reward approximation, achieving expert-level performance while cutting time and cost in real-world deployments.
- Zhanzhan Cheng
- Yuyi Cheng
- Shuigeng Zhou