A downloadable project

One aspect of scalable oversight is automated oversight, there have been some examples using models to evaluate question and model outputs, we would like to do an instantiation of this particularly using factored cognition. We’d like an automated system that is general and self-directed both with respect to inquires and topics of oversight.

Download

Download
Alignment_Jam_(Scalable_Oversight)_Factored_Cognition.pdf 139 kB