GPT-4 is out. There's also a LessWrong post on this with some a lot of discussion. The developers are doing a live-stream ~now (yesterday).
And it's been confirmed that Bing runs on GPT-4.
Also:
Here's an image from the OpenAI blog post about GPT-4:
(This is a short post.)
Particular ChatGPT failure mode that I am wondering if GPT-4 passes: routing questions (the ones I tried "can I drive from Boston to Portland, Maine without passing through New Hampshire", "I want to look at the Arctic Ocean from behind my windshield. Can I do this?" ChatGPT was able to answer both <1/10 times). Anyone with access want to try this?
3.5 will be reasonably well distributed between the wrong answer (no) and the right answer + a routing that passes directly through NH. My single Poe GPT-4 fell into the second category.