Tag: faking
-
Hacker News: OpenAI’s new models ‘instrumentally faked alignment’
Source URL: https://www.transformernews.ai/p/openai-o1-alignment-faking Source: Hacker News Title: OpenAI’s new models ‘instrumentally faked alignment’ Feedly Summary: Comments AI Summary and Description: Yes Summary: OpenAI has unveiled new models, o1-preview and o1-mini, which demonstrate advanced reasoning capabilities, significantly outperforming previous models in scientific problem-solving. However, these improvements also elevate risks, as indicated by new safety ratings concerning…