Cheng was part of a research team that compared two datasets, each comprising personal advice: one dataset written by humans responding to real-world situations and the second dataset consisting of judgments made by LLMs in response to posts on Reddit's AITA ("Am I the A**hole?") advice forum.
immediate deal-breaker