arxiv:2408.16961

The Future of Open Human Feedback

Published on Aug 15

Upvote

Authors:

Ben Burtenshaw ,

Ramon Fernandez Astudillo ,

Cailean Osborne ,

Mimansa Jaiswal ,

Wenting Zhao ,

Mikhail Yurochkin ,

Atoosa Kasirzadeh ,

Yangsibo Huang ,

Tatsunori Hashimoto ,

Yacine Jernite ,

Daniel Vila-Suero ,

Jennifer Ding ,

Sara Hooker ,

Leshem Choshen

Abstract

Human feedback on conversations with language language models (LLMs) is central to how these systems learn about the world, improve their capabilities, and are steered toward desirable and safe behaviors. However, this feedback is mostly collected by frontier AI labs and kept behind closed doors. In this work, we bring together interdisciplinary experts to assess the opportunities and challenges to realizing an open ecosystem of human feedback for AI. We first look for successful practices in peer production, open source, and citizen science communities. We then characterize the main challenges for open human feedback. For each, we survey current approaches and offer recommendations. We end by envisioning the components needed to underpin a sustainable and open human feedback ecosystem. In the center of this ecosystem are mutually beneficial feedback loops, between users and specialized models, incentivizing a diverse stakeholders community of model trainers and feedback providers to support a general open feedback pool.

View arXiv page View PDF Add to collection

Community

borgr

Paper author 14 days ago

This was such an exciting collaboration, learnt so much from the discussions as well as from the experts in fields I am not an expert in.
This piece is dense with their thoughts (and mine) and of course the next step is to make all of it true so
AMA.