My research focuses on the acquisition of structured visual
information and the conversion of this information into
sophisticated internal representations for controlling behavior.
We use an integrated approach with three main components,
human visual and learning experiments, computational modeling
of learning, and multi-electrode recording from behaving
animals. The recurrent theme of our work is the pursuit
of a statistically based and biologically sound framework
to link low-level visual mechanisms (e.g., adaptation) with
the development and learning of higher level complex features
and constancies for efficient visual representations of
objects and scenes.
During development, humans and animals learn to understand
their visual environment based on their sensory experience.
Despite decades of research, it is still not clear what
representations the brain uses in this process and how it
acquires them. We follow a systematic research program to
clarify these issues. Recently, we have conducted a series
of adult and infant experiments showing that humans possess
a fundamental ability to extract statistical regularities
of unknown visual scenes automatically both in time and
space from a very early age. We argue that this basic ability
is key in the formation of visual representations from the
simplest levels of luminance changes to the level of conscious
memory traces. Currently we are in the process of investigating
the interaction between this learning ability and various
perceptual constraints due to e.g., eye movements, clutter,
occlusion, and the hierarchical embeddedness of features,
that make such learning feasible. Using fMRI, we have also
identified the brain structures involved in this learning
and made predictions about the nature of the process.
Our computational modeling work interprets our experimental
data in a Bayesian framework. We have demonstrated that
generative statistical model selection learning can well
capture human behavior observed in our experiments. This
suggests that humans interpret their sensory input through
an "unconscious inference" process that follows precisely
the statistical structure of the environment but aims at
the simplest possible internal description of the input.
We have shown that this framework gives a statistically
based interpretation of empirical Gestalt rules and chunking
as well as provides a tightly coupled explanation for visual
recognition and visual learning.
The Bayesian framework requires a continuous reciprocal
interaction between groups of elements at different levels
of the hierarchical representation encoded in the brain.
This dynamic collective coding is in contrast with the traditional
feed forward view of how visual information is processed
in the cortex. We have shown that both at the level of primary
visual cortex and at higher areas the representation of
visual information is best described as the activity pattern
of cell assemblies rather than a set of individual feature
detectors. We have also shown that the precise developmental
pattern and the correlational structure of cell responses
in the primary visual cortex calls in question the notion
that ongoing cortical activity is accidental noise unrelated
to visual coding. Instead, we suggest that ongoing activity
is the manifestation of internal states of the brain that
expresses relevant knowledge of the world for perception,
and sensory input only modulates these states. This view
supports Hebb's original notion of internal dynamical states
being crucial for integrating cognitive processes beyond
simple stimulus-response associations, and it can potentially
close the gap between response functions and behavior.
Representative publications:
Fiser, J., & Aslin, R.N. (2005). Encoding multi-element
scenes: Statistical learning of visual feature hierarchies.
Journal of Experimental Psychology: General, 134:
521-537. [abstract]
Aslin, R. N., & Fiser, J. (2005). Methodological challenges
for understanding cognitive development in infants. Trends
in Cognitive Sciences, 9: 92-98. [abstract]
Fiser, J., Chiu, C., & Weliky, M. (2004). Small modulation
of ongoing cortical dynamics by sensory input during natural
vision. Nature, 431: 573-578. [abstract]
Fiser, J., Bex, P.J., & Makous, W.L. (2003). Contrast conservation
in human vision. Vision Research, 43: 2637-2648.
[abstract]
Weliky, M., Fiser, J., Hunt, H.R., & Wagner, D.N. (2003).
Coding of natural scenes in primary visual cortex. Neuron,
37: 703-718. [abstract]
Fiser, J., & Aslin, R.N. (2002). Statistical learning of
new visual feature combinations by infants. Proceedings
of the National Academy of Sciences, 99: 15822-15826.
[abstract]
Fiser, J., & Aslin, R.N. (2002). Statistical learning of
higher-order temporal structure from visual shape sequences.
Journal of Experimental Psychology: Learning Memory & Cognition,
28, pp. 458-467. [abstract]
Fiser, J., & Aslin, R.N. (2001). Unsupervised statistical
learning of higher-order spatial structures from visual
scenes. Psychological Science, 12: 499-504. [abstract]
Fiser, J., Subramaniam, S. and Biederman, I. (2001). Size
tuning in the absence of spatial frequency tuning in object
recognition. Vision Research, 41: 1931-1950. [abstract]
Atkins, J., Fiser, J. and Jacobs, R.A. (2001). Experience-dependent
visual cue integration based on consistencies between visual
and haptic percepts. Vision Research, 41: 449-461.
[abstract]
Fiser, J. & Biederman, I. (2001). Invariance of long-term
visual priming to scale, reflection, translation and hemisphere.
Vision Research, 41: 221-234. [abstract]
Mel, B.W. & Fiser, J. (2000). Minimizing binding errors
using learned conjunctive features. Neural Computation,
12: 731-762. [abstract]
Dobbins, A.C., Jeo, R.M., Fiser, J. and Allman, J.M. (1998).
Distance modulation of neural activity in the visual cortex.
Science, 281: 552-555. [abstract]
Last review: February 21, 2007. E-mail comments
or questions to the webmaster.