Chapter 11 Predictive Processing and Active Inference
11.1 Chapter Overview
Predictive Processing and Active Inference are among the most influential frameworks in contemporary cognitive science and theoretical neuroscience. They propose that the brain is not a passive receiver of sensory information, but an active prediction-generating system that continuously attempts to explain the causes of its sensory input [@friston2010; @clark2016].
According to this framework, perception, action, attention, learning, emotion, bodily regulation, selfhood, and perhaps consciousness itself emerge through ongoing attempts to reduce prediction error. Prediction error is the mismatch between what the brain expects and what sensory systems actually receive.
Rather than building conscious perception from raw sensory input alone, Predictive Processing proposes that the brain generates hypotheses about the world and then updates those hypotheses when prediction errors occur. Perception is therefore not passive reception. It is active inference: the brain’s best current interpretation of the causes of sensory input [@helmholtz1867; @rao1999; @friston2005].
This framework became highly influential because it offers a unified way to explain perception, action, attention, learning, hallucination, emotion, embodiment, psychiatric symptoms, and adaptive behaviour. It also connects naturally with Bayesian inference, machine learning, computational neuroscience, robotics, and philosophy of mind [@hohwy2013; @clark2013; @parr2022].
At the same time, major philosophical questions remain. Predictive Processing may explain how organisms model the world and regulate behaviour, but does it fully explain subjective experience? Does prediction-error minimization explain consciousness itself, or does it primarily explain cognition, perception, and adaptive control? These questions connect Predictive Processing directly to the hard problem of consciousness [@chalmers1995; @chalmers1996].
This chapter examines the historical development, conceptual foundations, hierarchical structure, active inference mechanisms, empirical support, philosophical implications, strengths, criticisms, and unresolved questions surrounding Predictive Processing and Active Inference.
11.2 Learning Objectives
After reading this chapter, the reader should be able to:
- Define the central claims of Predictive Processing and Active Inference.
- Explain prediction-error minimization.
- Describe hierarchical predictive processing.
- Distinguish passive perception from predictive perception.
- Explain the relationship between perception, action, and active inference.
- Understand generative models and precision weighting.
- Analyze predictive accounts of hallucinations and altered states.
- Compare Predictive Processing with other theories of consciousness.
- Evaluate strengths and criticisms of the framework.
- Discuss implications for embodiment, selfhood, and artificial intelligence.
11.3 Why Predictive Processing Became Influential
Predictive Processing became influential because it offers a broad framework for connecting many domains that were often studied separately. Perception, action, attention, learning, emotion, bodily regulation, and selfhood can all be interpreted as parts of a single inferential process.
The framework helps explain why perception is shaped by expectation and context. Human beings do not experience the world as a neutral recording of sensory input. What we perceive is influenced by prior knowledge, memory, attention, bodily state, emotional significance, and current goals. Predictive Processing explains this by proposing that perception depends on the interaction between top-down predictions and bottom-up prediction errors.
This approach also helps explain illusions, hallucinations, dreaming, psychosis, psychedelic states, motor control, anxiety, interoception, and adaptive behaviour. In each case, perception and action depend on how the brain balances prior expectations against incoming sensory evidence [@hohwy2013; @clark2016; @seth2021].
Predictive Processing also became influential because it fits naturally with computational neuroscience. It provides a formal framework for understanding perception and action in probabilistic terms. The brain is treated as a system that continuously updates its internal models in response to uncertainty.
11.4 Core Idea in One Picture
Figure @ref(fig:fig-pp) summarizes the major conceptual structure of Predictive Processing and Active Inference.
Figure 11.1: Predictive Processing and Active Inference. Panel A illustrates hierarchical predictive processing and prediction-error minimization. Panel B contrasts passive perception with predictive perception. Panel C illustrates the active inference loop. Panel D applies predictive processing to hallucinations and altered states. Panel E compares predictive processing with other major theories of consciousness.
As Figure @ref(fig:fig-pp) illustrates, Predictive Processing proposes that perception emerges through continuous interaction between top-down predictions and bottom-up prediction errors. The brain does not merely wait for sensory input. It actively anticipates input and updates its internal models when expectations fail.
The figure also highlights one of the framework’s central philosophical implications: perception is constructed, not passively received. However, it also points to a major unresolved question. Prediction and inference may explain perception and adaptive control, but they do not automatically explain why perception is accompanied by subjective experience.
11.5 Historical Development
The intellectual roots of Predictive Processing extend across several traditions. Hermann von Helmholtz argued that perception involves unconscious inference: the mind interprets sensory signals by inferring their likely causes [@helmholtz1867]. Later developments in cybernetics, Bayesian inference, information theory, machine learning, and computational neuroscience provided formal tools for developing this idea.
In the late twentieth and early twenty-first centuries, predictive coding models became increasingly important in neuroscience. Rao and Ballard proposed influential models of predictive coding in visual cortex, where higher levels generate predictions and lower levels return prediction errors [@rao1999].
Karl Friston’s Free Energy Principle then provided a broad mathematical framework for understanding perception, action, and biological self-organization [@friston2005; @friston2010]. According to this framework, biological systems must resist disorder by reducing uncertainty, surprise, or prediction error in relation to their sensory states.
Andy Clark later helped popularize Predictive Processing as a broad theory of perception, cognition, action, and embodied mind [@clark2013; @clark2016]. Jakob Hohwy also developed an influential philosophical interpretation of the predictive mind [@hohwy2013]. Together, these developments made Predictive Processing one of the most important frameworks in contemporary philosophy of mind and cognitive neuroscience.
11.6 The Brain as a Prediction Machine
The central claim of Predictive Processing is that the brain is a prediction machine. It constantly generates expectations about sensory input, bodily conditions, environmental changes, social interactions, and future outcomes.
Incoming sensory signals are compared with these predictions. When there is a mismatch, the system generates prediction error. The brain then attempts to reduce this error by updating its internal model, shifting attention, revising expectations, changing bodily state, or acting on the world.
This framework reverses the traditional picture of perception. Perception is not simply a process in which the world sends information through the senses and the brain builds an internal image. Instead, the brain actively predicts what sensory input should occur and uses error signals to adjust its predictions.
In this sense, perception is controlled hallucination constrained by sensory evidence. The brain generates models of the world, but those models are corrected by incoming signals [@seth2021].
11.7 Generative Models
Predictive Processing proposes that the brain constructs generative models. A generative model is an internal model that attempts to explain how sensory input is generated by causes in the world, body, and environment.
For example, when seeing a moving object, the brain does not simply register changing light patterns. It infers that there is an object, that the object has a shape, that it is moving in space, and that it may continue moving. These inferences depend on prior expectations and sensory evidence.
Generative models allow organisms to anticipate events, reduce uncertainty, guide behaviour, and maintain adaptive contact with the environment. They also explain why perception can be mistaken. If the brain’s model gives too much weight to expectation or too little weight to sensory correction, perception can become distorted.
This is one reason Predictive Processing has become important in accounts of hallucination, dreaming, and altered states. Experience depends not only on sensory input, but on how the brain interprets that input through its generative model.
11.8 Prediction-Error Minimization
Prediction-error minimization is the central computational principle of Predictive Processing. The brain continuously compares predicted sensory input with actual sensory input. The difference between them is prediction error.
Prediction error can be reduced in two main ways. First, the brain can update its internal model so that future predictions better match sensory input. Second, the organism can act on the world so that sensory input better matches predictions. The first process is perception and learning. The second process is active inference.
For example, if a person expects to see their keys on the table but does not see them, the mismatch produces prediction error. The person may update their belief by thinking the keys are elsewhere. Or they may act by moving objects, turning their head, or searching the room to generate new sensory input.
Prediction-error minimization therefore links perception, action, and learning. The organism is not merely representing the world. It is continuously regulating its relation to the world.
11.9 Passive Perception and Predictive Perception
Predictive Processing changes the traditional understanding of perception. A passive model treats perception as a bottom-up process:
world → senses → perception
Predictive Processing proposes a more interactive model:
brain predictions ↔ sensory input
In this model, perception is not the direct reading of sensory data. It is the brain’s best current inference about the causes of sensory data. Sensory input constrains perception, but it does not determine perception by itself.
This helps explain why context and expectation shape experience. A sound may be heard differently depending on what one expects. A vague image may be interpreted differently depending on surrounding context. Bodily sensations may feel more threatening when a person is anxious. Perception is therefore inferential, active, and context-sensitive.
This does not mean perception is arbitrary. Predictions are continuously corrected by prediction errors. Conscious experience results from the dynamic balance between expectation and sensory evidence.
11.10 Hierarchical Predictive Processing
Predictive Processing proposes that the brain is organized hierarchically. Higher levels generate more abstract predictions about objects, meanings, contexts, goals, and causes. Lower levels process more immediate sensory features such as edges, motion, sounds, colours, pressure, and bodily signals.
Predictions flow downward through the hierarchy. Prediction errors flow upward. Higher levels attempt to explain lower-level signals, while lower levels report mismatches between prediction and input.
This hierarchical structure allows the brain to integrate perception, memory, attention, action, and interpretation. A visual experience, for example, is not only a set of colours and lines. It is interpreted as an object in a context, with meaning and possible relevance for action.
Hierarchical prediction also helps explain why high-level beliefs can influence low-level perception. Expectations, emotions, and prior knowledge can shape what is perceived, especially under uncertainty.
11.11 Precision Weighting
A major concept in Predictive Processing is precision weighting. Not all prediction errors are treated equally. The brain must estimate which signals are reliable and which are noisy or irrelevant.
Precision refers to the estimated reliability or importance of a signal. A high-precision prediction error strongly influences belief updating. A low-precision prediction error is given less weight.
Attention is often interpreted as a mechanism for adjusting precision. When attention is directed toward a stimulus, prediction errors related to that stimulus may be given greater weight. This allows the brain to prioritize relevant information.
Abnormal precision weighting may help explain hallucinations, psychosis, anxiety, and altered states. If high-level predictions are given too much precision, they may dominate sensory evidence. If sensory prediction errors are given too much precision, the world may feel chaotic, unstable, or overly salient [@friston2010; @barrett2017].
Precision weighting is important for consciousness because it may help explain why some signals become prominent in experience while others remain background or unconscious.
11.12 Active Inference
Active Inference extends Predictive Processing beyond perception. According to Active Inference, organisms reduce prediction error not only by changing their beliefs, but also by acting on the world [@friston2010; @friston2013; @parr2022].
If the brain predicts a certain sensory outcome, the organism may move in ways that make the prediction true. For example, if one expects to see an object more clearly, one may move closer, turn on a light, or shift gaze. Action becomes a way of sampling the world to reduce uncertainty.
This creates a continuous loop:
- The brain generates predictions.
- The organism acts in the world.
- Action changes sensory input.
- Prediction errors are computed.
- Internal models are updated.
- Further action is guided by the updated model.
Active Inference therefore dissolves a sharp boundary between perception and action. Perception guides action, and action shapes perception. The organism is not a detached observer, but an active participant in its environment.
11.13 Consciousness and Predictive Processing
A major question is whether Predictive Processing explains consciousness specifically or cognition more generally. The framework is powerful for explaining perception, expectation, uncertainty, learning, hallucination, sensory integration, and adaptive behaviour. However, it remains debated whether these mechanisms fully explain phenomenal consciousness.
Some theorists suggest that consciousness may correspond to stable, integrated, precision-weighted predictive models. Others emphasize interoceptive prediction and bodily regulation as central to conscious feeling and selfhood [@seth2013; @seth2021]. On these views, consciousness may arise when predictive models become sufficiently integrated, embodied, and self-related.
However, critics argue that predictive inference alone may not explain subjective experience. A system could generate predictions, minimize error, and guide action without there necessarily being something it is like to be that system. This concern connects Predictive Processing to the same hard problem faced by functionalism, Global Workspace Theory, and Higher-Order theories [@chalmers1995; @chalmers1996].
Predictive Processing may therefore be best understood as a powerful framework for explaining the structure and dynamics of conscious perception, while leaving open whether it fully explains why experience exists.
11.14 Hallucinations and Altered States
Predictive Processing has become especially influential in explaining hallucinations, dreaming, psychosis, psychedelic states, and altered perception.
Hallucinations may occur when top-down predictions dominate sensory evidence. In such cases, the brain’s expectations may generate perceptual experience even when external input is weak or absent. Psychosis has sometimes been interpreted in terms of disturbed precision weighting, where unusual signals or beliefs are given excessive importance [@friston2010].
Dreaming can also be interpreted predictively. During dreams, internally generated models dominate experience while external sensory input is reduced. The brain continues to generate a world, but that world is less constrained by external input.
Psychedelic states have been interpreted as involving altered precision weighting and weakened high-level priors. According to one influential view, psychedelics relax the grip of high-level beliefs, allowing perception, emotion, and self-experience to become more flexible, uncertain, or unconstrained [@carhartHarris2019].
These examples show one of Predictive Processing’s greatest strengths. It explains altered states not as random errors, but as changes in the balance between prediction, sensory evidence, uncertainty, and precision.
11.15 Selfhood and Interoception
Predictive Processing also offers an important account of selfhood. The self is not treated as a simple inner object. Instead, it may arise from predictive regulation of bodily and interoceptive signals.
Interoception refers to the sensing of internal bodily states such as heartbeat, breathing, hunger, pain, temperature, and visceral condition. The brain continuously predicts and regulates these signals. According to predictive approaches, the sense of being an embodied self may arise from stable models of the body’s internal condition [@seth2013; @seth2021].
This connects consciousness with emotion and affect. Feelings may depend on the brain’s predictions about bodily state and physiological regulation. Anxiety, for example, may involve predictions about bodily threat and uncertainty. Emotional experience may therefore be deeply connected to interoceptive inference [@barrett2017].
This embodied approach is important because it moves Predictive Processing beyond abstract cognition. Consciousness is not only about representing the external world. It is also about regulating a living body.
11.16 Predictive Processing Compared with Other Theories
Predictive Processing differs from several major theories discussed elsewhere in this book, although it also overlaps with them.
11.16.1 Relation to Global Workspace Theory
Global Workspace Theory explains consciousness in terms of global broadcasting and cognitive access [@baars1988; @dehaene2011]. Predictive Processing explains perception and cognition in terms of hierarchical prediction and error correction. The two theories may be complementary. Global broadcasting could make certain predictive contents widely available, while predictive processing could explain how those contents are generated.
11.16.2 Relation to Integrated Information Theory
Integrated Information Theory emphasizes intrinsic causal integration and irreducibility [@tononi2004; @oizumi2014]. Predictive Processing emphasizes inference, generative models, precision, and action. IIT asks what kind of system has experience in itself. Predictive Processing asks how organisms model and regulate their world.
11.16.3 Relation to Higher-Order Thought Theory
Higher-Order Thought Theory explains consciousness through awareness of mental states [@rosenthal2005; @lau2011]. Predictive Processing can incorporate higher-order models by treating metacognition as prediction about one’s own mental states. However, HOT places higher-order awareness at the center, while Predictive Processing places hierarchical inference at the center.
11.16.4 Relation to Recurrent Processing Theory
Recurrent Processing Theory emphasizes recurrent neural activity within sensory systems [@lamme2006]. Predictive Processing also emphasizes recurrent hierarchical exchange, but it interprets this exchange as prediction and error correction.
11.16.5 Relation to Attention Schema Theory
Attention Schema Theory explains consciousness through the brain’s model of attention [@graziano2013]. Predictive Processing can interpret attention as precision weighting. Both theories involve internal modeling, but they explain consciousness through different mechanisms.
11.16.6 Relation to Embodied and Enactive Theories
Predictive Processing overlaps strongly with embodied and enactive theories when it emphasizes action, bodily regulation, and organism-environment interaction [@varela1991; @thompson2007]. However, some enactive theorists argue that Predictive Processing remains too brain-centered unless it fully incorporates lived embodiment and environmental coupling.
11.16.7 Relation to Consciousness-First Theories
Predictive Processing differs from consciousness-first theories such as panpsychism, cosmopsychism, idealism, or Taheri’s T-Consciousness. Predictive Processing begins with biological and computational systems that generate models and reduce prediction error. Consciousness-first theories treat consciousness as fundamental or prior to material organization. These frameworks will be discussed later in the book.
11.17 Empirical Support
Predictive Processing is supported by a wide range of empirical research. Perceptual illusion studies show that expectation and context shape experience. Mismatch negativity experiments show that the brain responds strongly when expected patterns are violated. Predictive coding models have been used to explain visual perception, auditory processing, motor control, and hierarchical inference [@rao1999; @friston2005].
Research on hallucinations, psychosis, and altered states also supports the importance of prediction and precision weighting. Psychedelic neuroscience, in particular, has increasingly used predictive frameworks to interpret changes in perception, selfhood, and belief [@carhartHarris2019].
Active Inference has also influenced robotics and computational psychiatry. It provides a framework for understanding how agents act to reduce uncertainty and maintain adaptive engagement with their environment [@parr2022].
However, empirical support for Predictive Processing is complex. Many findings are consistent with predictive models, but consistency is not the same as exclusive confirmation. Competing theories may explain some of the same data differently.
11.18 Strengths of Predictive Processing
Predictive Processing has several major strengths. First, it provides a unified framework for perception, action, attention, learning, embodiment, and adaptive behaviour.
Second, it has a strong computational foundation. It connects consciousness research with Bayesian inference, probabilistic modeling, machine learning, and computational neuroscience.
Third, it explains why perception is shaped by expectation, context, and uncertainty. This makes it especially powerful for understanding illusions, hallucinations, dreams, anxiety, psychosis, and psychedelic states.
Fourth, it integrates body and world. Through Active Inference and interoceptive prediction, the framework connects perception with action, bodily regulation, emotion, and selfhood.
Fifth, it is broadly applicable. Predictive approaches have influenced neuroscience, psychiatry, robotics, artificial intelligence, philosophy of mind, and embodied cognition.
11.19 Weaknesses and Criticisms
Predictive Processing also faces several criticisms. One concern is overgeneralization. Because many processes can be described as prediction-error minimization, critics argue that the framework can become too broad. If everything is prediction, the theory risks explaining too much too easily.
A second concern is vagueness. Predictive Processing sometimes functions as a broad research program rather than a single precise theory. Different authors interpret prediction, inference, precision, and free energy in different ways.
A third concern is empirical underdetermination. Many findings interpreted through Predictive Processing may also be explained by other models. The framework needs clear, risky predictions that distinguish it from competing theories.
A fourth concern is neural implementation. Although predictive coding has strong computational appeal, it remains debated how exactly predictive hierarchies are implemented in biological neural circuits.
The deepest criticism concerns consciousness itself. Predictive Processing may explain perception, cognition, action, and adaptive regulation, but it may not fully explain why any of these processes are accompanied by subjective experience. This is the hard problem in predictive form.
11.20 Predictive Processing and Artificial Intelligence
Predictive Processing has important implications for artificial intelligence. If cognition depends on generative modeling, prediction-error minimization, and active inference, then artificial systems with these capacities may become more adaptive and intelligent.
In robotics, active inference suggests that agents should not merely process inputs and generate outputs. They should actively sample the world, predict sensory consequences, and update internal models through action.
However, prediction alone may not be sufficient for consciousness. Many machine-learning systems generate predictions without having subjective experience. A predictive model may classify, forecast, and adapt without there being anything it is like to be that system.
From a consciousness perspective, the key questions are whether artificial systems can develop integrated predictive models, embodied regulation, self-modeling, affective significance, and autonomous agency. Recent discussions of AI consciousness include predictive processing among several possible theoretical indicators, but they do not treat prediction alone as decisive [@butlin2023].
Predictive Processing therefore contributes important tools to AI and robotics, but it does not by itself settle the question of machine consciousness.
11.21 Open Questions
Several major questions remain unresolved. Why should predictive inference generate subjective experience? Does Predictive Processing explain consciousness itself or only perception and cognition? Is consciousness reducible to probabilistic modeling? How should conscious experience be measured in predictive terms? Can artificial predictive systems become genuinely conscious? Is embodiment necessary? How does precision weighting relate to phenomenology? Can predictive processing explain the unity and qualitative character of experience?
These questions show that Predictive Processing is powerful but incomplete as a theory of consciousness. It provides a rich account of how experience is shaped, but it may not fully explain why experience exists.
11.22 Evaluation
Predictive Processing and Active Inference are among the most influential frameworks in modern cognitive science. They offer a unified account of perception, action, attention, learning, embodiment, hallucination, selfhood, and adaptive behaviour.
Their greatest strength is explanatory integration. They show how perception is shaped by prediction, how action reduces uncertainty, how attention regulates precision, and how selfhood may depend on bodily prediction.
Their greatest limitation is that they may explain cognition more clearly than consciousness itself. Predictive inference can explain how perceptual contents are constructed, corrected, and regulated. It does not automatically explain why those contents are subjectively experienced.
For this reason, Predictive Processing is best understood as a powerful framework for explaining the dynamics and structure of conscious perception, embodiment, and self-modeling. Whether it provides a complete theory of phenomenal consciousness remains unresolved.
11.23 Chapter Summary
Predictive Processing proposes that the brain is a prediction-generating system. It continuously generates hypotheses about the causes of sensory input and updates those hypotheses through prediction-error minimization.
Active Inference extends this framework by showing how organisms act on the world to reduce prediction error. Perception and action form a continuous loop: the organism predicts, acts, samples sensory input, updates its model, and acts again.
The framework explains perception as active inference rather than passive reception. It emphasizes generative models, hierarchical prediction, prediction error, precision weighting, interoception, embodiment, and adaptive regulation.
Predictive Processing is especially strong in explaining perception, illusions, hallucinations, dreaming, psychedelic states, selfhood, emotion, and action. Its major challenge is whether prediction and inference fully explain subjective experience.
The central unresolved question is whether consciousness is itself a form of predictive inference, or whether predictive inference explains only the structure and dynamics of conscious cognition.