Emergence of Concepts in DNNs?

11/11/2022
by   Tim Räz, et al.
0

The present paper reviews and discusses work from computer science that proposes to identify concepts in internal representations (hidden layers) of DNNs. It is examined, first, how existing methods actually identify concepts that are supposedly represented in DNNs. Second, it is discussed how conceptual spaces – sets of concepts in internal representations – are shaped by a tradeoff between predictive accuracy and compression. These issues are critically examined by drawing on philosophy. While there is evidence that DNNs able to represent non-trivial inferential relations between concepts, our ability to identify concepts is severely limited.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset