Apple study finds AI 'reasoning' models fail logic tests