Publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2024

  1. MHFAIA
    Comparing Comparisons: Informative and Easy Human Feedback with Distinguishability Queries
    Xuening Feng, Zhaohui Jiang, Timo Kaufmann, and 3 more authors
    In ICML 2024 Workshop on Models of Human Feedback for AI Alignment (MHFAIA), 2024
  2. MHFAIA
    Relatively Rational: Learning Utilities and Rationalities Jointly from Pairwise Preferences
    Taku YamagataTobias OberkoflerTimo Kaufmann, and 3 more authors
    In ICML 2024 Workshop on Models of Human Feedback for AI Alignment (MHFAIA), 2024
  3. arXiv
    Inverse Constitutional AI: Compressing Preferences into Principles
    Arduin FindeisTimo KaufmannEyke Hüllermeier, and 2 more authors
    2024
  4. RLBRew
    OCALM: Object-Centric Assessment with Language Models
    Timo KaufmannJannis BlümlQuentin Delfosse, and 2 more authors
    In ICML 2024 Workshop on Reinforcement Learning Beyond Rewards (RLBRew), 2024

2023

  1. arXiv
    A Survey of Reinforcement Learning from Human Feedback
    Timo KaufmannPaul WengViktor Bengs, and 1 more author
    2023
  2. HLDM
    On the Challenges and Practices of Reinforcement Learning from Real Human Feedback
    Timo KaufmannSarah BallJacob Beck, and 2 more authors
    2023
    Accepted at the ECML-PKDD HLDM’23 Workshop.
  3. Reinforcement Learning from Human Feedback for Cyber-Physical Systems: On the Potential of Self-Supervised Pretraining
    Timo KaufmannViktor Bengs, and Eyke Hüllermeier
    In Proceedings of the International Conference on Machine Learning for Cyber-Physical Systems (ML4CPS), 2023