Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Feb 13;15(1):5319.
doi: 10.1038/s41598-025-89732-4.

Coordinating multiple mental faculties during learning

Affiliations

Coordinating multiple mental faculties during learning

Xiaoliang Luo et al. Sci Rep. .

Abstract

Complex behavior is supported by the coordination of multiple brain regions. How do brain regions coordinate absent a homunculus? We propose coordination is achieved by a controller-peripheral architecture in which peripherals (e.g., the ventral visual stream) aim to supply needed inputs to their controllers (e.g., the hippocampus and prefrontal cortex) while expending minimal resources. We developed a formal model within this framework to address how multiple brain regions coordinate to support rapid learning from a few example images. The model captured how higher-level activity in the controller shaped lower-level visual representations, affecting their precision and sparsity in a manner that paralleled brain measures. In particular, the peripheral encoded visual information to the extent needed to support the smooth operation of the controller. Alternative models optimized by gradient descent irrespective of architectural constraints could not account for human behavior or brain responses, and, typical of standard deep learning approaches, were unstable trial-by-trial learners. While previous work offered accounts of specific faculties, such as perception, attention, and learning, the controller-peripheral approach is a step toward addressing next generation questions concerning how multiple faculties coordinate.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
The controller-peripheral architecture provides a general framework for how different brain regions coordinate while performing a task. (A) Peripherals aim to supply their controllers with the information they require while expending minimal resources (i.e., costly energy principle). Here, we illustrate a number of possible arrangements of controllers and peripherals. (ii) A single controller with multiple peripherals could offer an account of multi-modal integration for the convergence of visual and somatosensory signals in parietal cortex or semantic hubs in the anterior temporal lobe. (iii) Conversely, multiple controllers with a single peripheral could model eye movements in which multiple controllers related to visual search, obstacle avoidance, social cognition, etc. share this perceptual resource. (iv) The controller and peripheral can share reciprocal connections, allowing each to serve both roles interchangeably. This arrangement may be particularly useful for modeling the interaction between the prefrontal cortex (PFC) and premotor cortex in movement execution and recalibration. Controllers and peripherals can be arranged hierarchically as in (v). This arrangement is consistent with hierarchical accounts of the ventral visual stream in object recognition. (B) We use the controller-peripheral architecture to develop a model that can learn concepts from a few visual examples. To simplify, we assume a single controller involving the hippocampus and ventromedial prefrontal cortex (vmPFC) and a single peripheral involving the ventral visual stream. The model captures how higher-level goals and outcomes shape activity throughout the ventral visual stream, which aims to provide its controller with needed information while minimizing resource expenditure (i.e., the costly energy principle).
Fig. 2
Fig. 2
The controller-peripheral framework consisting of a clustering module capturing HPC and PFC and a DNN module capturing ventral visual stream, captures human category learning behavior in. (A) Six category learning tasks where human participants learnt to classify geometric shapes into one of two categories where stimuli were made up of three binary-valued features (color, shape and size). Typically, models operate over hand-coded three-dimensional vectors. Instead, we trained on actual images, in this case using the insect stimuli from, a replication of these classic learning problems with image stimulus inputs. The peripheral part of the model, reflecting the ventral visual stream, extracts these three higher-level dimensions for the controller (Fig. 1B). (B) The model fits on the left were from’s replication of the six categorization tasks using simple stimuli as shown in Fig. 2A. Our model (right) captured the same difficulty ordering of the six problems using image stimuli from with equivalent problem structure. Probability of error is plotted as a function of learning block for each problem type. (C) The controller exhibited the same attention strategies as SUSTAIN, solving Type I by attending to one dimension, Type II by attending to two dimensions and Type III–VI by attending to all three dimensions.
Fig. 3
Fig. 3
(A) A whole-brain voxel-wise linear mixed effects regression was performed in, which revealed a vmPFC region that showed a significant interaction between learning block and problem complexity. Neural compression increased over learning blocks and was higher for learning problems with fewer relevant dimensions (each fMRI run consists of four learning blocks; see the original paper for more details). (B) Functional correspondence between the clustering module of the controller-peripheral system and vmPFC in the human brain. The clustering module deploys attention strategies (in terms of attention compression) that track the degree of neural compression in vmPFC across category learning tasks over learning across category structure complexity.
Fig. 4
Fig. 4
Performance of the DNN peripheral and its relation to LOC activity during learning. (A) Following the controller’s needs and the costly energy principle, task-relevant features (shaded) are more precisely coded than task-irrelevant features (unshaded). (B) The error-rate for a classifier applied to LOC activity to discriminate (decode) between pairs of stimuli mirrored the precision of the peripheral’s feature outputs, consistent with our claim that the peripheral’s advanced layers correspond to LOC. (C) Following the costly energy principle, the fewer relevant features for a learning problem (VI>II>I), the more zero-valued peripheral attention weights there are.

References

    1. Locke, J. An essay concerning human understanding. Read. Hist. Psychol.1, 55–68 (1960).
    1. Hume, D. An enquiry concerning human understanding. Essays and treatises on several subjects, Vol 2: Containing An enquiry concerning human understanding, A dissertation on the passions, An enquiry concerning the principles of morals, and The natural history of religion. 3–212 (1779). /record/2008-16196-001.
    1. Kieras, D. E. An overview of the EPIC architecture for cognition and performance with application to human-computer interaction. Human-Comput. Interact.12, 391–438 (1997) (/record/1998-00672-004.).
    1. Young, R. M. & Lewis, R. L. The Soar Cognitive Architecture and Human Working Memory. Models Work. Memory 224–256 (1999). /record/1999-02490-006
    1. Anderson, J. R., Michael, M. & Lebiere, C. ACT-R: A theory of higher level cognition and its relation to visual attention. Human-Comput. Interact.12, 439–462 (1997).

LinkOut - more resources