Molecular design using data-driven generative models faces the problem of reward hacking in multiobjective settings. Here, authors propose a framework to automatically adjust reliability levels for each objective to design promising molecules.
- Tatsuya Yoshizawa
- Shoichi Ishida
- Kei Terayama