Emergent Analogical Reasoning in Large Language Models

Emergent Analogical Reasoning in Large Language Models


Taylor Webb, Keith J. Holyoak, Hongjing Lu, arXiv, Jan 04, 2023


What’s really interesting to me about GPT-3 and other large language models (LLM) is that they are not programmed with rules or categories, but instead create them out of the data they’re given. As this paper (27 page PDF) argues, "GPT-3 appears to display an emergent ability to reason by analogy, matching or surpassing human performance across a wide range of problem types." The authors continue, "The deep question that now arises is how GPT-3 achieves the analogical capacity that is often considered the core of human intelligence." Now many of the criticisms of LLM point to errors in these pattern recognition capabilities. They sometimes get basic facts wrong, and don’t seem to (yet) understand what types of things some things are. But as the authors write, "regardless of the extent to which GPT-3 employs human-like mechanisms to perform analogical reasoning, we can be certain that it did not acquire these mechanisms in a human-like manner." We don’t actually teach an LLM the way we would, say, a child. But suppose we did…

Web: [Direct Link] [This Post]


via Stephen’s Web ~ OLDaily http://www.downes.ca/

January 4, 2023 at 06:45PM