Anthropic co-founder on quitting OpenAI, AGI predictions, $100M talent wars, 20% unemployment, and the nightmare scenarios keeping him up at night | Ben Mann
1. Proactively Align AI Models
Work on AI alignment well ahead of time, as it may be too late to align superintelligent models once they emerge, making early efforts extremely critical.
2. Address AI Downside Risks
Actively look at and address the downside risks of AI, even if the probability of extremely bad outcomes seems small, because the potential impact on humanity is very large.
3. Define Explicit AI Values
Implement ‘Constitutional AI’ by providing models with natural language principles (e.g., from human rights declarations) to learn desired behaviors, ensuring a principled stance on AI values beyond human raters.
4. Implement AI Self-Critique
Use a process where AI models generate a response, critique themselves against constitutional principles, and then rewrite their response if non-compliant, recursively improving their alignment with desired values.
5. Enable AI Empirical Self-Improvement
Give AI models the tools to be empirical, allowing them to form theories, design experiments, and test them out, which can lead to recursive self-improvement and potentially surpass human capabilities.
6. Contribute to AI Safety Broadly
Recognize that having an impact on AI safety isn’t limited to AI research; roles in product, finance, operations, and other areas are crucial for funding and influencing future safety efforts.
7. Educate on AI Safety Risks
Actively ‘safety pill’ yourself by learning about AI risks and spread this awareness to your network, as few people are currently working on this critical problem.
8. Use AI Tools Ambitiously
Approach new AI tools with ambition, asking for significant changes and trying multiple times even if the first attempt fails, as success rates are much higher with persistence and varied approaches.
9. Build for Future AI Capabilities
Design products and solutions not just for current AI capabilities, but for what will be possible six months to a year from now, anticipating that currently imperfect functionalities will become robust.
10. Foster Kids’ Curiosity & Kindness
Focus on teaching children curiosity, creativity, self-led learning, thoughtfulness, and kindness to help them thrive in an AI future, as factual knowledge may become less critical.
11. Adopt ‘Resting in Motion’
Embrace ‘resting in motion’ as a sustainable work strategy, recognizing that a busy state is often natural, and aim for a marathon pace rather than a sprint to avoid burnout.
12. Accept ‘Everything is Hard’
Remind yourself that ’everything is hard’ and it’s okay for tasks to not be easy, fostering perseverance to push through challenges in work and life.
13. Use a Bidet
Use a bidet for personal hygiene, as it is described as life-changing, more civilized, and likely to become a standard practice in the future.