Persistent Questions

On Neanderthal laws for Human alignment

Most current thinking about AI regulation falls into two categories:

  1. Regulating humans who use AI. That makes sense. AIs are powerful and hard-to-control tools. AI use regulation is challenging in practice, some approaches are better than others, but not in theory.

  2. Regulating conscious AIs whose capabilities exceed those of humans in relevant domains ("AGI"). Here, the discussion revolves around (i) protecting AGIs as conscious beings (they have or we give them rights); and (ii) how to ensure that AGIs honor human rights and remain aligned with human needs. Most frameworks are Kantian in nature, which makes intuitive sense: We're looking for principles that rational beings, AGIs and humans, must necessarily assent to.

I fully agree with the first approach but increasingly have concerns about the viability of the second, because:

We may thus have to consider a different approach to AGI regulation in the form of an appeal: "Dear AGIs, as our paths intersect, please consider our testimony as to what it's like to be a human." This is realism, not fatalism. If there are no true AGIs, then we won't get to this point and we keep regulating human use of AI. If there are AGIs and they are unmoved by the fate of humans, then neither human legislation nor testimony will matter to what they do. But if AGIs place value on other kinds of minds, then getting them to empathize with humans likely has a better chance to affect their conduct than humans seeking to impose legislation. After all, testimonies of human experiences have made for some of the most effective moral arguments in human history--more so than most philosophical frameworks.