ChatGPT broke the Turing test — the race is on for new ways to assess AI
ChatGPT broke the Turing test — the race is on for new ways to assess AI
www.nature.com ChatGPT broke the Turing test — the race is on for new ways to assess AI
Large language models mimic human chatter, but scientists disagree on their ability to reason.

You're viewing a single thread.
All comments
199
comments
How does ChatGPT do with the Winograd schema? That's a lot harder to fake: https://en.m.wikipedia.org/wiki/Winograd_schema_challenge
4 0 ReplyI dont remember the numbers but iirc it was covered by one of the validation datasets and GPT 4 did quite well on it
2 0 ReplyYeah, but did it do well on the specific examples from the Winograd paper? Because ChatGPT probably just learned those since they are well known and oft repeatef. Or does it do well on brand new sentences made according to the Winograd scheme?
1 0 Reply
199
comments
Scroll to top