Measuring an Artificial Intelligence System's Performance on a Verbal IQ Test For Young Children

09/11/2015
by   Stellan Ohlsson, et al.
0

We administered the Verbal IQ (VIQ) part of the Wechsler Preschool and Primary Scale of Intelligence (WPPSI-III) to the ConceptNet 4 AI system. The test questions (e.g., "Why do we shake hands?") were translated into ConceptNet 4 inputs using a combination of the simple natural language processing tools that come with ConceptNet together with short Python programs that we wrote. The question answering used a version of ConceptNet based on spectral methods. The ConceptNet system scored a WPPSI-III VIQ that is average for a four-year-old child, but below average for 5 to 7 year-olds. Large variations among subtests indicate potential areas of improvement. In particular, results were strongest for the Vocabulary and Similarities subtests, intermediate for the Information subtest, and lowest for the Comprehension and Word Reasoning subtests. Comprehension is the subtest most strongly associated with common sense. The large variations among subtests and ordinary common sense strongly suggest that the WPPSI-III VIQ results do not show that "ConceptNet has the verbal abilities a four-year-old." Rather, children's IQ tests offer one objective metric for the evaluation and comparison of AI systems. Also, this work continues previous research on Psychometric AI.

READ FULL TEXT

page 10

page 12

page 13

research
08/27/2015

Using Thought-Provoking Children's Questions to Drive Artificial Intelligence Research

We propose to use thought-provoking children's questions (TPCQs), namely...
research
05/09/2023

"Alexa doesn't have that many feelings": Children's understanding of AI through interactions with smart speakers in their homes

As voice-based Conversational Assistants (CAs), including Alexa, Siri, G...
research
03/20/2013

A Sensitivity Analysis of Pathfinder: A Follow-up Study

At last year?s Uncertainty in AI Conference, we reported the results of ...
research
04/10/2019

Early features associated with the neurocognitive development at 36 months of age: the AuBE study

Background. Few studies on the relations between sleep quantity and/or q...
research
11/18/2019

Modeling Gestalt Visual Reasoning on the Raven's Progressive Matrices Intelligence Test Using Generative Image Inpainting Techniques

Psychologists recognize Raven's Progressive Matrices as a very effective...
research
05/17/2023

AI Friends: A Design Framework for AI-Powered Creative Programming for Youth

What role can AI play in supporting and constraining creative coding by ...

Please sign up or login with your details

Forgot password? Click here to reset