b'Ernest Davis'

research

∙ 08/10/2023

Testing GPT-4 with Wolfram Alpha and Code Interpreter plug-ins on math and science problems

This report describes a test of the large language model GPT-4 with the ...

0 Ernest Davis, et al. ∙

research

∙ 02/09/2023

Benchmarks for Automated Commonsense Reasoning: A Survey

More than one hundred benchmarks have been developed to test the commons...

0 Ernest Davis, et al. ∙

research

∙ 01/23/2023

Mathematics, word problems, common sense, and artificial intelligence

The paper discusses the capacities and limitations of current artificial...

0 Ernest Davis, et al. ∙

research

∙ 08/14/2022

Limits of an AI program for solving college math problems

Drori et al. (2022) report that "A neural network solves, explains, and ...

0 Ernest Davis, et al. ∙

research

∙ 04/25/2022

A very preliminary analysis of DALL-E 2

The DALL-E 2 system generates original synthetic images corresponding to...

47 Gary Marcus, et al. ∙

research

∙ 04/03/2022

Pragmatic constraints and pronoun reference disambiguation: the possible and the impossible

Pronoun disambiguation in understanding text and discourse often require...

0 Ernest Davis, et al. ∙

research

∙ 01/22/2022

Physical Reasoning in an Open World

Most work on physical reasoning, both in artificial intelligence and in ...

0 Zhuoran Zeng, et al. ∙

research

∙ 01/07/2022

The Defeat of the Winograd Schema Challenge

The Winograd Schema Challenge – a set of twin sentences involving pronou...

4 Vid Kocijan, et al. ∙

research

∙ 12/08/2021

Deep Learning and Mathematical Intuition: A Review of (Davies et al. 2021)

A recent paper by Davies et al (2021) describes how deep learning (DL) t...

0 Ernest Davis, et al. ∙

research

∙ 05/24/2021

A Flawed Dataset for Symbolic Equation Verification

Arabshahi, Singh, and Anandkumar (2018) propose a method for creating a ...

0 Ernest Davis, et al. ∙

research

∙ 08/01/2020

The test set for the TransCoder system

The TransCoder system translates source code between Java, C++, and Pyth...

0 Ernest Davis, et al. ∙

research

∙ 04/23/2020

A Review of Winograd Schema Challenge Datasets and Approaches

The Winograd Schema Challenge is both a commonsense reasoning and natura...

5 Vid Kocijan, et al. ∙

research

∙ 12/12/2019

The Use of Deep Learning for Symbolic Integration: A Review of (Lample and Charton, 2019)

Lample and Charton (2019) describe a system that uses deep learning tech...

0 Ernest Davis, et al. ∙

research

∙ 08/05/2016

Winograd Schemas and Machine Translation

A Winograd schema is a pair of sentences that differ in a single word an...

0 Ernest Davis, et al. ∙

research

∙ 06/16/2015

The Scope and Limits of Simulation in Cognitive Models

It has been proposed that human physical reasoning consists largely of r...

0 Ernest Davis, et al. ∙

research

∙ 11/06/2014

The Limitations of Standardized Science Tests as Benchmarks for Artificial Intelligence Research: Position Paper

In this position paper, I argue that standardized tests for elementary s...

0 Ernest Davis, et al. ∙

Ernest Davis

Featured Co-authors

Sign in with Google

Consider DeepAI Pro