En skeptisk syn på Artificiell Generell Intelligens (AGI)

Hur tusan utvärderar man om en maskin är intelligent? Tong-testet visar sig vara en intressant variant.

Turing-testet känner de flesta till. Det går ut på se ifall en maskin kan lura en människa att den pratar med en människa, lite raljant formulerat. Men det säger inte så mycket om maskinen faktiskt är intelligent, självmedveten eller klok. Bara att den åtminstone är en god nog imitatör att den kan övertyga människor.

I det senaste avsnittet av The Skeptics Guide to the Universe, cirka trekvart in, när de på sitt sedvanliga vis bearbetade vetenskapsnyheter, men Tong-testet som jag inte tidigare hört talas om. Ett sätt att inte fastna i en filosofisk zombie utan något mer meningsfullt.

Tong-testet

Skrivelsens abstrakt:

“The release of the generative pre-trained transformer (GPT) series has brought artificial general intelligence (AGI) to the forefront of the artificial intelligence (AI) field once again. However, the questions of how to define and evaluate AGI remain unclear. This perspective article proposes that the evaluation of AGI should be rooted in dynamic embodied physical and social interactions (DEPSI). More specifically, we propose five critical characteristics to be considered as AGI benchmarks and suggest the Tong test as an AGI evaluation system. The Tong test describes a value- and ability-oriented testing system that delineates five levels of AGI milestones through a virtual environment with DEPSI, allowing for infinite task generation. We contrast the Tong test with classical AI testing systems in terms of various aspects and propose a systematic evaluation system to promote standardized, quantitative, and objective benchmarks and evaluation of AGI.”
– The Tong Test: Evaluating Artificial General Intelligence Through Dynamic Embodied Physical and Social Interactions