The Role of Human-in-the-Loop Preferences in Reward Function Learning for Humanoid Tasks
Table of Links
Abstract and Introduction
Related Work
Problem Definition
Method
Experiments
Conclusion and References
\
A. Appendix
A.1. Full Prompts and A.2 ICPL Details
A. 3 Baseline Details
A.4...