Comprehensive assessment methods are key to progress in deep learning

Michael W. Spratling

doi:10.1017/S0140525X23001668

Comprehensive assessment methods are key to progress in deep learning

Published online by Cambridge University Press: 06 December 2023

Michael W. Spratling

Show author details

Michael W. Spratling*: Affiliation:
Department of Informatics, King's College London, London, UK michael.spratling@kcl.ac.uk https://nms.kcl.ac.uk/michael.spratling/

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Bowers et al. eloquently describe issues with current deep neural network (DNN) models of vision, claiming that there are deficits both with the methods of assessment, and with the models themselves. I am in agreement with both these claims, but propose a different recipe to the one outlined in the target article for overcoming these issues.

Type: Open Peer Commentary
Information: Behavioral and Brain Sciences , Volume 46 , 2023 , e407

DOI: https://doi.org/10.1017/S0140525X23001668 [Opens in a new window]
Copyright: Copyright © The Author(s), 2023. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Biggio, B., & Roli, F. (2018). Wild patterns: Ten years after the rise of adversarial machine learning. Pattern Recognition, 84, 317–331. doi:10.1016/j.patcog.2018.07.023CrossRef Google Scholar

Croce, F., & Hein, M. (2020). Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks. In H. Daumé III & A. Singh (Eds.), Proceedings of the international conference on machine learning, volume 119 of Proceedings of machine learning research (pp. 2206–2216). arXiv:2003.01690.Google Scholar

Hassabis, D., Kumaran, D., Summerfield, C., & Botvinick, M. (2017). Neuroscience-inspired artificial intelligence. Neuron, 95, 245–258. doi:10.1016/j.neuron.2017.06.011CrossRef Google Scholar PubMed

Hendrycks, D., & Dietterich, T. G. (2019). Benchmarking neural network robustness to common corruptions and perturbations. In Proceedings of the international conference on learning representations, New Orleans, USA. arXiv:1903.12261.Google Scholar

Hendrycks, D., & Gimpel, K. (2017). A baseline for detecting misclassified and out-of-distribution examples in neural networks. In Proceedings of the international conference on Learning representations, Toulon, France. arXiv:1610.02136.Google Scholar

Johnson, M. H. (1999). Ontogenetic constraints on neural and behavioral plasticity: Evidence from imprinting and face recognition. Canadian Journal of Experimental Psychology, 53, 77–90.CrossRef Google Scholar

Kumano, S., Kera, H., & Yamasaki, T. (2022). Are DNNs fooled by extremely unrecognizable images? arXiv, arXiv:2012.03843.Google Scholar

Malhotra, G., Dujmović, M., & Bowers, J. S. (2022). Feature blindness: A challenge for understanding and modelling visual object recognition. PLoS Computational Biology, 18(5), e1009572. doi:10.1371/journal.pcbi.1009572CrossRef Google Scholar PubMed

Malhotra, G., Evans, B. D., & Bowers, J. S. (2020). Hiding a plane with a pixel: Examining shape-bias in CNNs and the benefit of building in biological constraints. Vision Research, 174, 57–68. doi:10.1016/j.visres.2020.04.013CrossRef Google Scholar PubMed

Michaelis, C., Mitzkus, B., Geirhos, R., Rusak, E., Bringmann, O., Ecker, A. S., … Brendel, W. (2019). Benchmarking robustness in object detection: Autonomous driving when winter is coming. arXiv, arXiv:1907.07484.Google Scholar

Mu, N., & Gilmer, J. (2019). MNIST-C: A robustness benchmark for computer vision. arXiv, arXiv:1906.02337.Google Scholar

Nguyen, A., Yosinski, J., & Clune, J. (2015). Deep neural networks are easily fooled: High confidence predictions for unrecognizable images. arXiv, arXiv:1412.1897.Google Scholar

Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., … Sutskever, I. (2021). Learning transferable visual models from natural language supervision. arXiv, arXiv:2103.00020. https://proceedings.mlr.press/v139/radford21a.html Google Scholar

Schrimpf, M., Kubilius, J., Lee, M. J., Murty, N. A. R., Ajemian, R., & DiCarlo, J. J. (2020). Integrative benchmarking to advance neurally mechanistic models of human intelligence. Neuron, 108(3), 413–423 https://www.cell.com/neuron/fulltext/S0896-6273(20)30605-X CrossRef Google Scholar PubMed

Shen, Z., Liu, J., He, Y., Zhang, X., Xu, R., Yu, H., & Cui, P. (2021). Towards out-of-distribution generalization: A survey. arXiv, arXiv:2108.13624.Google Scholar

Szegedy, C., Zaremba, W., Sutskever, I., Bruna, J., Erhan, D., Goodfellow, I. J., & Fergus, R. (2014). Intriguing properties of neural networks. In Proceedings of the international conference on learning representations, Banff, Canada. arXiv:1312.6199.Google Scholar

Tsipras, D., Santurkar, S., Engstrom, L., Turner, A., & Madry, A. (2019). Robustness may be at odds with accuracy. In Proceedings of the international conference on learning representations, New Orleans, USA. arXiv:1805.12152.Google Scholar

Vaze, S., Han, K., Vedaldi, A., & Zisserman, A. (2022). Open-set recognition: A good closed-set classifier is all you need? In Proceedings of the international conference on learning representations, Virtual. arXiv:2110.06207.Google Scholar

Zaadnoordijk, L., Besold, T. R., & Cusack, R. (2022). Lessons from infant learning for unsupervised machine learning. Nature Machine Intelligence, 4, 510–520. doi:10.1038/s42256-022-00488-2CrossRef Google Scholar

Zador, A. M. (2019). A critique of pure learning and what artificial neural networks can learn from animal brains. Nature Communications, 10, 3770. doi:10.1038/s41467-019-11786-6CrossRef Google Scholar PubMed

Explananda and explanantia in deep neural network models of neurological network functions

Mihnea Moldoveanu Mihnea Moldoveanu

Behavioral and Brain Sciences , Volume 46

A deep new look at color

Jelmer Philip de Vries Jelmer Philip de Vries ,

Alban Flachot Alban Flachot ,

Takuma Morimoto Takuma Morimoto and

Karl R. Gegenfurtner Karl R. Gegenfurtner

Behavioral and Brain Sciences , Volume 46

Beyond the limitations of any imaginable mechanism: Large language models and psycholinguistics

Conor Houghton Conor Houghton ,

Nina Kazanina Nina Kazanina and

Priyanka Sukumaran Priyanka Sukumaran

Behavioral and Brain Sciences , Volume 46

Comprehensive assessment methods are key to progress in deep learning

Michael W. Spratling

Behavioral and Brain Sciences , Volume 46

Deep neural networks are not a single hypothesis but a language for expressing computational hypotheses

Behavioral and Brain Sciences , Volume 46

Even deeper problems with neural network models of language

Thomas G. Bever Thomas G. Bever , Noam Chomsky , Sandiway Fong and Massimo Piattelli-Palmarini

Behavioral and Brain Sciences , Volume 46

Fixing the problems of deep neural networks will require better training data and learning algorithms

Drew Linsley and

Thomas Serre Thomas Serre

Behavioral and Brain Sciences , Volume 46

For deep networks, the whole equals the sum of the parts

Philip J. Kellman Philip J. Kellman , Nicholas Baker , Patrick Garrigan , Austin Phillips and Hongjing Lu

Behavioral and Brain Sciences , Volume 46

For human-like models, train on human-like tasks

Katherine Hermann Katherine Hermann ,

Aran Nayebi Aran Nayebi ,

Sjoerd van Steenkiste Sjoerd van Steenkiste and

Matt Jones Matt Jones

Behavioral and Brain Sciences , Volume 46

Going after the bigger picture: Using high-capacity models to understand mind and brain

Hans Op de Beeck Hans Op de Beeck and Stefania Bracci

Behavioral and Brain Sciences , Volume 46

Implications of capacity-limited, generative models for human vision

Joseph Scott German and

Robert A. Jacobs Robert A. Jacobs

Behavioral and Brain Sciences , Volume 46

Let's move forward: Image-computable models and a common model evaluation scheme are prerequisites for a scientific understanding of human vision

James J. DiCarlo James J. DiCarlo , Daniel L. K. Yamins , Michael E. Ferguson , Evelina Fedorenko , Matthias Bethge , Tyler Bonnen and Martin Schrimpf

Behavioral and Brain Sciences , Volume 46

Modelling human vision needs to account for subjective experience

Marcin Koculak Marcin Koculak and

Michał Wierzchoń Michał Wierzchoń

Behavioral and Brain Sciences , Volume 46

Models of vision need some action

Constantin Rothkopf Constantin Rothkopf , Frank Bremmer , Katja Fiehler , Katharina Dobs and Jochen Triesch

Behavioral and Brain Sciences , Volume 46

My pet pig won't fly and I want a refund

Michael J. Tarr Michael J. Tarr

Behavioral and Brain Sciences , Volume 46

Neither hype nor gloom do DNNs justice

Felix A. Wichmann Felix A. Wichmann ,

Simon Kornblith Simon Kornblith and

Robert Geirhos Robert Geirhos

Behavioral and Brain Sciences , Volume 46

Neural networks need real-world behavior

Aedan Y. Li Aedan Y. Li and

Marieke Mur Marieke Mur

Behavioral and Brain Sciences , Volume 46

Neural networks, AI, and the goals of modeling

Walter Veit Walter Veit and

Heather Browning Heather Browning

Behavioral and Brain Sciences , Volume 46

Perceptual learning in humans: An active, top-down-guided process

Heleen A. Slagter Heleen A. Slagter

Behavioral and Brain Sciences , Volume 46

Psychophysics may be the game-changer for deep neural networks (DNNs) to imitate the human vision

Keerthi S. Chandran Keerthi S. Chandran , Amrita Mukherjee Paul , Avijit Paul and

Kuntal Ghosh Kuntal Ghosh

Behavioral and Brain Sciences , Volume 46

Statistical prediction alone cannot identify good models of behavior

Nisheeth Srivastava Nisheeth Srivastava , Anjali Sifar and Narayanan Srinivasan

Behavioral and Brain Sciences , Volume 46

The model-resistant richness of human visual experience

Jianghao Liu Jianghao Liu and Paolo Bartolomeo

Behavioral and Brain Sciences , Volume 46

The scientific value of explanation and prediction

Hause Lin Hause Lin

Behavioral and Brain Sciences , Volume 46

There is a fundamental, unbridgeable gap between DNNs and the visual cortex

Moshe Gur Moshe Gur

Behavioral and Brain Sciences , Volume 46

Thinking beyond the ventral stream: Comment on Bowers et al.

Christopher Summerfield Christopher Summerfield and Jessica A. F. Thompson

Behavioral and Brain Sciences , Volume 46

Using DNNs to understand the primate vision: A shortcut or a distraction?

Yaoda Xu Yaoda Xu and Maryam Vaziri-Pashkam

Behavioral and Brain Sciences , Volume 46

Where do the hypotheses come from? Data-driven learning in science and the brain

Barton L. Anderson Barton L. Anderson , Katherine R. Storrs and Roland W. Fleming

Behavioral and Brain Sciences , Volume 46

Why psychologists should embrace rather than abandon DNNs

Galit Yovel Galit Yovel and Naphtali Abudarham

Behavioral and Brain Sciences , Volume 46

You can't play 20 questions with nature and win redux

Bradley C. Love Bradley C. Love and Robert M. Mok

Behavioral and Brain Sciences , Volume 46

Article contents

Comprehensive assessment methods are key to progress in deep learning

Abstract

Access options

References

Target article

Related commentaries (29)

Author response

Linked content

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests