Home > Faculty

József Fiser, Ph.D.
Assistant Professor of Psychology

Visual information processing and learning
Ph.D., University of Southern California

contact information
781-736-3253

Lab Website

My research focuses on the acquisition of structured visual information and the conversion of this information into sophisticated internal representations for controlling behavior. We use an integrated approach with three main components, human visual and learning experiments, computational modeling of learning, and multi-electrode recording from behaving animals. The recurrent theme of our work is the pursuit of a statistically based and biologically sound framework to link low-level visual mechanisms (e.g., adaptation) with the development and learning of higher level complex features and constancies for efficient visual representations of objects and scenes.

During development, humans and animals learn to understand their visual environment based on their sensory experience. Despite decades of research, it is still not clear what representations the brain uses in this process and how it acquires them. We follow a systematic research program to clarify these issues. Recently, we have conducted a series of adult and infant experiments showing that humans possess a fundamental ability to extract statistical regularities of unknown visual scenes automatically both in time and space from a very early age. We argue that this basic ability is key in the formation of visual representations from the simplest levels of luminance changes to the level of conscious memory traces. Currently we are in the process of investigating the interaction between this learning ability and various perceptual constraints due to e.g., eye movements, clutter, occlusion, and the hierarchical embeddedness of features, that make such learning feasible. Using fMRI, we have also identified the brain structures involved in this learning and made predictions about the nature of the process.

Our computational modeling work interprets our experimental data in a Bayesian framework. We have demonstrated that generative statistical model selection learning can well capture human behavior observed in our experiments. This suggests that humans interpret their sensory input through an "unconscious inference" process that follows precisely the statistical structure of the environment but aims at the simplest possible internal description of the input. We have shown that this framework gives a statistically based interpretation of empirical Gestalt rules and chunking as well as provides a tightly coupled explanation for visual recognition and visual learning.

The Bayesian framework requires a continuous reciprocal interaction between groups of elements at different levels of the hierarchical representation encoded in the brain. This dynamic collective coding is in contrast with the traditional feed forward view of how visual information is processed in the cortex. We have shown that both at the level of primary visual cortex and at higher areas the representation of visual information is best described as the activity pattern of cell assemblies rather than a set of individual feature detectors. We have also shown that the precise developmental pattern and the correlational structure of cell responses in the primary visual cortex calls in question the notion that ongoing cortical activity is accidental noise unrelated to visual coding. Instead, we suggest that ongoing activity is the manifestation of internal states of the brain that expresses relevant knowledge of the world for perception, and sensory input only modulates these states. This view supports Hebb's original notion of internal dynamical states being crucial for integrating cognitive processes beyond simple stimulus-response associations, and it can potentially close the gap between response functions and behavior.

Representative publications:

Orbán G, Fiser J, Aslin RN, Lengyel M. (2008) Bayesian learning of visual chunks by human observers. Proc Natl Acad Sci U S A. 2008 Feb 19;105(7):2745-50.

Fiser J, Scholl BJ, Aslin RN. (2007) Perceived object trajectories during occlusion constrain visual statistical learning. Psychon Bull Rev. 2007 Feb;14(1):173-8.

Fiser, J., & Aslin, R.N. (2005). Encoding multi-element scenes: Statistical learning of visual feature hierarchies. Journal of Experimental Psychology: General, 134: 521-537. [abstract]

Aslin, R. N., & Fiser, J. (2005). Methodological challenges for understanding cognitive development in infants. Trends in Cognitive Sciences, 9: 92-98. [abstract]

Fiser, J., Chiu, C., & Weliky, M. (2004). Small modulation of ongoing cortical dynamics by sensory input during natural vision. Nature, 431: 573-578. [abstract]

Fiser, J., Bex, P.J., & Makous, W.L. (2003). Contrast conservation in human vision. Vision Research, 43: 2637-2648. [abstract]

Weliky, M., Fiser, J., Hunt, H.R., & Wagner, D.N. (2003). Coding of natural scenes in primary visual cortex. Neuron, 37: 703-718. [abstract]

Fiser, J., & Aslin, R.N. (2002). Statistical learning of new visual feature combinations by infants. Proceedings of the National Academy of Sciences, 99: 15822-15826. [abstract]

Fiser, J., & Aslin, R.N. (2002). Statistical learning of higher-order temporal structure from visual shape sequences. Journal of Experimental Psychology: Learning Memory & Cognition, 28, pp. 458-467. [abstract]

Fiser, J., & Aslin, R.N. (2001). Unsupervised statistical learning of higher-order spatial structures from visual scenes. Psychological Science, 12: 499-504. [abstract]

Fiser, J., Subramaniam, S. and Biederman, I. (2001). Size tuning in the absence of spatial frequency tuning in object recognition. Vision Research, 41: 1931-1950. [abstract]

Atkins, J., Fiser, J. and Jacobs, R.A. (2001). Experience-dependent visual cue integration based on consistencies between visual and haptic percepts. Vision Research, 41: 449-461. [abstract]

Fiser, J. & Biederman, I. (2001). Invariance of long-term visual priming to scale, reflection, translation and hemisphere. Vision Research, 41: 221-234. [abstract]

Mel, B.W. & Fiser, J. (2000). Minimizing binding errors using learned conjunctive features. Neural Computation, 12: 731-762. [abstract]

Dobbins, A.C., Jeo, R.M., Fiser, J. and Allman, J.M. (1998). Distance modulation of neural activity in the visual cortex. Science, 281: 552-555. [abstract]

 


Last review: August 27, 2008. E-mail comments or questions to the webmaster.