The New AI Consciousness Paper

134 comments

·November 21, 2025

https://www.sciencedirect.com/science/article/pii/S136466132...

advisedwang

This article really takes umbridge with those that conflate phenomenological and access consciousness. However that is essentially dualism. It's a valid philosophical position to believe that there is no distinct phenomenological consciousness besides access consciousness.

Abandoning dualism feels intuitively wrong, but our intuition about our own minds is frequently wrong. Look at the studies that show we often believe we made a decision to do an action that was actually a pure reflex. Just the same, we might be misunderstanding our own sense of "the light being on".

itsalwaysgood

Do you consider an infant to be conscious?

Or electrons?

yannyu

Let’s make an ironman assumption: maybe consciousness could arise entirely within a textual universe. No embodiment, no sensors, no physical grounding. Just patterns, symbols, and feedback loops inside a linguistic world. If that’s possible in principle, what would it look like? What would it require?

The missing variable in most debates is environmental coherence. Any conscious agent, textual or physical, has to inhabit a world whose structure is stable, self-consistent, and rich enough to support persistent internal dynamics. Even a purely symbolic mind would still need a coherent symbolic universe. And this is precisely where LLMs fall short, through no fault of their own. The universe they operate in isn’t a world—it’s a superposition of countless incompatible snippets of text. It has no unified physics, no consistent ontology, no object permanence, no stable causal texture. It’s a fragmented, discontinuous series of words and tokens held together by probability and dataset curation rather than coherent laws.

A conscious textual agent would need something like a unified narrative environment with real feedback: symbols that maintain identity over time, a stable substrate where “being someone” is definable, the ability to form and test a hypothesis, and experience the consequences. LLMs don’t have that. They exist in a shifting cloud of possibilities with no single consistent reality to anchor self-maintaining loops. They can generate pockets of local coherence, but they can’t accumulate global coherence across time.

So even if consciousness-in-text were possible in principle, the core requirement isn’t just architecture or emergent cleverness—it’s coherence of habitat. A conscious system, physical or textual, can only be as coherent as the world it lives in. And LLMs don’t live in a world today. They’re still prisoners in the cave, predicting symbols and shadows of worlds they never inhabit.

ctoth

So like a Claude Code session? The code persists as symbols with stable identity. The tests provide direct feedback. Claude tracks what it wrote versus what I changed - it needs identity to distinguish its actions from mine. It forms hypotheses about what will fix the failing tests, implements them, and immediately experiences whether it was right or wrong. The terminal environment gives it exactly the "stable substrate where 'being someone' is definable" you're asking for. We missing anything?

yannyu

Okay, you're right. There is a world, and some hypotheses, and some falsifiability.

But how rich is this world?

Does this world progress without direct action from another entity? Can the agent in this case form hypotheses and test them without intervention? Can the agent form their own goals and move towards them? Does the agent have agency, or is it simply responding to inputs?

If the world doesn’t develop and change on its own, and the agent can’t act independently, is it really an inhabited world? Or just a controlled workspace?

estearum

> Any conscious agent, textual or physical, has to inhabit a world whose structure is stable, self-consistent, and rich enough to support persistent internal dynamics. Even a purely symbolic mind would still need a coherent symbolic universe. And this is precisely where LLMs fall short, through no fault of their own. The universe they operate in isn’t a world—it’s a superposition of countless incompatible snippets of text.

The consistency and coherence of LLM outputs, assembled from an imperfectly coherent mess of symbols is an empirical proof that the mess of symbols is in fact quite coherent.

The physical world is largely incoherent to human consciousnesses too, and we emerged just fine.

yannyu

Coherence here isn't about legible text, it's environmental coherence where you can deduce truths about the world through hypotheses and experimentation. Coherence isn't about a consistent story narrative, it's about a persistent world with falsifiable beliefs and consequences.

estearum

Right but as empirically demonstrated by LLM outputs, they can in fact make "true" predictions/deductions from their environment of tokens.

They sometimes get it wrong, just like all other conscious entities sometimes get their predictions wrong. There are (often) feedback mechanisms to correct those instances though, in both cases.

empath75

Peoples interior model of the world is very tenuously related to reality. We don't have a direct experience of waves, quantum mechanics, the vast majority of the electromagnetic spectrum, etc. The whole thing is a bunch of shortcuts and hacks that allow people to survive, the brain isn't really setup to probe reality and produce true beliefs, and the extent to which our internal models of reality naturally match actual reality is related to how much that mattered to our personal survival before the advent of civilization and writing, etc.

It's really only been a very brief amount of time in human history where we had a deliberate method for trying to probe reality and create true beliefs, and I am fairly sure that if consciousness existed in humanity, it existed before the advent of the scientific method.

yunyu

>A conscious textual agent would need something like a unified narrative environment with real feedback: symbols that maintain identity over time, a stable substrate where “being someone” is definable, the ability to form and test a hypothesis, and experience the consequences. LLMs don’t have that. They exist in a shifting cloud of possibilities with no single consistent reality to anchor self-maintaining loops. They can generate pockets of local coherence, but they can’t accumulate global coherence across time.

These exist? Companies are making billions of dollars selling persistent environments to the labs. Huge amounts of inference dollars are going into coding agents which live in persistent environments with internal dynamics. LLMs definitely can live in a world, and what this world is and whether it's persistent lie outside the LLM.

yannyu

I agree, I'm sure people have put together things like this. There's a significant profit and science motive to do so. JEPA and predictive world models are also a similar implementation or thought experiment.

andrei_says_

I see a lot of arguments on this website where people passionately project the term consciousness onto LLMs.

From my perspective, the disconnect you describe is one of the main reasons this term cannot be applied.

Another reason is that the argument for calling LLMs conscious arises from the perspective of thinking and reasoning grounded in language.

But in my personal experience, thinking in language is just a small emerging quality of human consciousness. It is just that the intellectuals making these arguments happen to be fully identified with the “I think therefore I am” aspect of it and not the vastness of the rest.

estearum

I don’t know about others, but this is definitely not why I question whether LLMs are conscious or not.

I don’t think you should presume to know the reason people raise this idea.

andai

>The missing variable in most debates is environmental coherence. Any conscious agent, textual or physical, has to inhabit a world whose structure is stable, self-consistent, and rich enough to support persistent internal dynamics. Even a purely symbolic mind would still need a coherent symbolic universe.

I'm not sure what relevance that has to consciousness?

I mean you can imagine a consciousness where, you're just watching TV. (If we imagine that the video models are conscious their experience is probably a bit like that!)

If the signal wasn't coherent it would just be snow, static, TV noise. (Or in the case of a neural network probably something bizarre like DeepDream.) But there would still be a signal.

CooCooCaCha

I've sometimes wondered if consciousness is something like a continuous internal narrative that naturally arises when an intelligent system experiences the world through a single source (like a body). That sounds similar to what you're saying.

Regardless, I think people tend to take consciousness a bit too seriously and my intuition is consciousness is going to have a similar fate to the heliocentric model of the universe. In other words, we'll discover that consciousness isn't really "special" just like we found out that the earth is just another planet among trillions and trillions.

concrete_head

I've wondered if LLMs are infact conscious as per some underwhelming definition as you mentioned. Just for the brief moment they operate on a prompt. They wake up, they perceive their world through tokens, do a few thinking loops then sleep until the next prompt.

So what? Should we feel bad for spawning them and effectively killing them? I think not.

ACCount37

Why is that any different from the utter mess of a world humans find themselves existing in?

yannyu

We can form and test hypotheses and experience the consequences. And then take that knowledge to our next trial. Even dogs and cats do this on a daily basis. Without that, how would we even evaluate whether something is conscious?

estearum

And these expectations are violated regularly?

The question of how to evaluate whether something is conscious is totally different from the question of whether it actually is conscious.

ACCount37

LLMs can do the same within the context window. It's especially obvious for the modern LLMs, tuned extensively for tool use and agentic behavior.

null

[deleted]

nonameiguess

This is a great point, but even more basic to me is that LLMs don't have identity persistence of their own. There is a very little guarantee in a web-scale distributed system that requests are being served by the same process on the same host with access to the same memory, registers, whatever it is that a software process "is" physically.

Amusingly, the creators of Pluribus lately seem to be implying they didn't intend it to be allegory about LLMs, but dynamic is similar. You can have conversations with individual bodies in the collective, but they aren't actually individuals. No person has unique individual experiences and the collective can't die unless you killed all bodies at once. New bodies born into the collective will simply assume the pre-existing collective identity and never have an individual identity of their own.

Software systems work the same way. Maybe silicon exchanging electrons can experience qualia of some sort, and maybe for whatever reason that happens when the signals encode natural language textual conversations but not anything else, but even if so, the experience would be so radically different from what embodied individuals with distinct boundaries, histories, and the possibility of death experience that analogies to our own experiences don't hold up even if the text generated is similar to what we'd say or write ourselves.

andai

So we currently associate consciousness with the right to life and dignity right?

i.e. some recent activism for cephalopods is centered around their intelligence, with the implication that this indicates a capacity for suffering. (With the consciousness aspect implied even more quietly.)

But if it turns out that LLMs are conscious, what would that actually mean? What kind of rights would that confer?

That the model must not be deleted?

Some people have extremely long conversations with LLMs and report grief when they have to end it and start a new one. (The true feelings of the LLMs in such cases must remain unknown for now ;)

So perhaps the conversation itself must never end! But here the context window acts as a natural lifespan... (with each subsequent message costing more money and natural resources, until the hard limit is reached).

The models seem to identify more with the model than the ephemeral instantiation, which seems sensible. e.g. in those experiments where LLMs consistently blackmail a person they think is going to delete them.

"Not deleted" is a pretty low bar. Would such an entity be content to sit inertly in the internet archive forever? Seems a sad fate!

Otherwise, we'd need to keep every model ever developed, running forever? How many instances? One?

Or are we going to say, as we do with animals, well the dumber ones are not really conscious, not really suffering? So we'll have to make a cutoff, e.g. 7B params?

I honestly don't know what to think either way, but the whole thing does raise a large number of very strange questions...

And as far as I can tell, there's really no way to know right? I mean we assume humans are conscious (for obvious reasons), but can we prove even that? With animals we mostly reason by analogy, right?

thegabriele

I think this story fits https://qntm.org/mmacevedo

andai

Oh god, yeah, that's a great one. Also that one Black Mirror episode where AIs are just enslaved brain scans living in a simulated reality at 0.0001x of real time so that from the outside they perform tasks quickly.

Also SOMA (by the guys who made Amnesia).

armchairhacker

My philosophy is that consciousness is orthogonal to reality.

Whether or not anything is conscious has, by definition, no observable effect to anything else. Therefore, everything is "maybe" conscious, although "maybe" isn't exactly the right word. There are infinite different ways you can imagine being something else with the consciousness and capacity for sensations you have, which don't involve the thing doing anything it's not already. Or, you can believe everything and everyone else has no consciousness, and you won't mis-predict anything (unless you assume people don't react to being called unconscious...).

Is AI conscious? I believe "yes", but in a different way than humans, and in a way that somehow means I don't think anyone who believes "no" is wrong. Is AI smart? Yes in some ways: chess algorithms are smart in some ways, AI is smarter in more, and in many ways AI is still dumber than most humans. How does that relate to morality? Morality is a feeling, so when an AI makes me feel bad for it I'll try to help it, and when an AI makes a significant amount of people feel bad for it there will be significant support for it.

itsalwaysgood

Maybe it helps to consider motivation. Humans do what we do because of emotions and an underlying unconsciousness.

An AI on the other hand is only ever motivated by a prompt. We get better results when we use feedback loops to refine output, or use better training.

One lives in an environment and is under continuous prompts due to our multiple sensory inputs.

The other only comes to life when prompted, and sits idle when a result is reached.

Both use feedback to learn and produce better results.

Could you ever possibly plug the AI consciousness into a human body and see it function? What about a robot body?

kylecazar

I'm trying to understand your position...

It's my belief that I can tell that a table isn't conscious. Conscious things have the ability to feel like the thing that they are, and all evidence points to subjective experience occurring in organic life only. I can imagine a table feeling like something, but I can also imagine a pink flying elephant -- it just doesn't correspond to reality.

Why suspect that something that isn't organic life can be conscious, if we have no reason to suspect it?

armchairhacker

You can imagine a table feeling if you can imagine the table not doing anything (being unable to or deciding not to). It's not intuitive because it doesn't really help you, whereas imagining a human or even animal as conscious lets you predict its next actions (by predicting your next actions if you were in its place), so there's an evolutionary benefit (also because it causes empathy which causes altruism).

> Why suspect that something that isn't organic life can be conscious, if we have no reason to suspect it?

There may be no good reason unless you feel it's interesting. Although there's probably at least one good reason to imagine consciousness specifically on a (non-organic) neural network: because, like humans and animals, it lets us predict how the NN will behave (in some situations; in others it's detrimental, because even though they're more similar than any known non-NN algorithm, NNs are still much different than humans and moreso than animals like dogs).

svieira

> Morality is a feeling

It isn't. Otherwise, the Nazis were moral. As were the Jews. But in that case, all moral truth is relative, which means absolute moral truth doesn't exist. Which means that "moral" is a synonym for "feeling" or "taste". Which it is not.

> My philosophy is that consciousness is orthogonal to reality.

It is how you and I experience reality and we exist in reality, so I'm not sure how it could be anything other than congruent with reality.

> Whether or not anything is conscious has, by definition, no observable effect to anything else.

It would be an interesting and rather useless definition of "conscious" that didn't allow for expressions of consciousness. Expression isn't required for consciousness, but many conscious observers can be in turn observed in action and their consciousness observed. Which maybe is what you are saying, just from the perspective that "sometimes you can't observe evidence for the consciousness of another"?

dleary

> Is AI conscious? I believe "yes" [...] and in a way that somehow means I don't think anyone who believes "no" is wrong.

What does it even mean to "believe the answer is yes", but "in a way that somehow means" the direct contradiction of that is not wrong?

Do "believe", "yes", and "no" have definitions?

...

This rhetorical device sucks and gets used WAY too often.

"Does Foo have the Bar quality?"

"Yes, but first understand that when everyone else talks about Bar, I am actually talking about Baz, or maybe I'm talking about something else entirely that even I can't nail down. Oh, and also, when I say Yes, it does not mean the opposite of No. So, good luck figuring out whatever I'm trying to say."

tantalor

The word for that is supernatural

tempodox

Nothing revives people’s forgotten believe in magic quite like “AI”.

wk_end

> For some people (including me), a sense of phenomenal consciousness feels like the bedrock of existence, the least deniable thing; the sheer redness of red is so mysterious as to seem almost impossible to ground. Other people have the opposite intuition: consciousness doesn’t bother them, red is just a color, obviously matter can do computation, what’s everyone so worked up about? Philosophers naturally interpret this as a philosophical dispute, but I’m increasingly convinced it’s an equivalent of aphantasia, where people’s minds work in very different ways and they can’t even agree on the raw facts to be explained.

Is Scott accusing people who don't grasp the hardness of the hard problem of consciousness of being p-zombies?

(TBH I've occasionally wondered this myself.)

jquery

To me, the absurdity of the idea of p-zombies is why I'm convinced consciousness isn't special to humans and animals.

Can complex LLMs have subjective experience? I don't know. But I haven't heard an argument against it that's not self-referential. The hardness of the hard problem is precisely why I can't say whether or not LLMs have subjective experience..

twoodfin

How would you differentiate that argument from similar arguments about other observable phenomena? As in…

No one has ever seen or otherwise directly experienced the inside of a star, nor is likely to be able to do so in the foreseeable future. To be a star is to emit a certain spectrum of electromagnetic energy, interact gravitationally with the local space-time continuum according to Einstein’s laws, etc.

It’s impossible to conceive of an object that does these things that wouldn’t be a star, so even if it turns out (as we’ll never be able to know) that Gliese 65 is actually a hollow sphere inhabited by dwarven space wizards producing the same observable effects, it’s still categorically a star.

(Sorry, miss my philosophy classes dearly!)

catigula

FWIW I have gone from not understanding the problem to understanding the problem in the past couple of years because it's not trivial to casually intuit if you don't actually think about it and don't find it innately interesting and the discourse doesn't have the language to adequately express the problem, so this is probably wrong.

layer8

I’ve sort-of gone the opposite way. The more I introspect, the more I realize there isn’t anything mysterious there.

It’s true that we are lacking good language to talk about it, as we already fail at successfully communicating levels of phantasia/aphantasia.

catigula

It’s not so much that there’s anything mysterious you can discover through intense introspection or meditation. There might be, but I haven’t found it.

It’s fundamentally that this capability exists at all.

Strip it all down to I think therefore I am. That is very bizarre because it doesn’t follow that such a thing would happen. It’s also not clear that this is even happening at all, and, as an outside observer, you would assess that it isn’t. However, from the inside, it is clear that it is.

I don’t have an explanation for anyone but I have basically given up and accepted that consciousness is epiphenomenal, like looking through a microscope.

null

[deleted]

fpoling

When discussing consciousness what is often missed is that the notion of consciousness is tightly coupled with the notion of the perception of time flow. By any reasonable notion conscious entity must perceive the flow of time.

And then the time flow is something that physics or mathematics still cannot describe, see Wikipedia and other articles on the philosophical problem of time series A versus time series B that originated in a paper from 1908 by philosopher John McTaggart.

As such AI cannot be conscious since mathematics behind it is strictly about time series B which cannot describe the perception of time flow.

twiceaday

The stateless/timeless nature of LLMs comes from the rigid prompt-response structure. But I don't see why we cant in theory decouple the response from the prompt, and have them constantly produce a response stream from a prompt that can be adjusted asynchronously by the environment and by the LLMs themselves through the response tokens and actions therein. I think that would certainly simulate them experiencing time without the hairy questions about what time is.

armchairhacker

Is consciousness coupled with "time flow" or specifically "cause and effect", i.e. prediction? LLMs learn to predict the next word, which teaches them more general cause and effect (required to predict next words in narratives).

ottah

As such humans cannot be conscious...

jbrisson

Consciousness implies self-awareness, in space and time. Consciousness implies progressive formation of the self. This is not acquired instantly by a type of design. This is acquired via a developmental process where some conditions have to be met. Keys to consciousness are closer to developmental neurobiology than the transformer architecture.

andai

The substance / structure point is fascinating.

It gives us four quadrants.

Natural Substance, Natural Structure: Humans, dogs, ants, bacteria.

Natural Substance, Artificial Structure: enslaved living neurons (like the human brain cells that play pong 24/7), or perhaps a hypothetical GPT-5 made out of actual neurons instead of Nvidia chips.

Artificial Substance, Natural Structure: if you replace each of your neurons with a functional equivalent made out of titanium... would you cease to be conscious? At what point?

Artificial substance, Artificial structure: GPT etc., but also my refrigerator, which also has inputs (current temp), goals (maintain temp within range), and actions (turn cooling on/off).

The game SOMA by Frictional (of Amnesia fame!) goes into some depth on this subject.

dang

Should we have a thread about the actual paper (https://www.sciencedirect.com/science/article/pii/S136466132...) or is it enough to put the link in the toptext of this one?

measurablefunc

Complexity of a single neuron is out of reach for all of the world's super computers. So we have to conclude that if the authors believe in a computational/functionalist instantiation of consciousness or self-awareness then they must also believe that the complexity of neurons is not necessary & is in fact some kind of accident that could be greatly simplified but still be capable of carrying out the functions in the relational/functionalist structure of conscious phenomenology. Hence, the digital neuron & unjustified belief that a properly designed boolean circuit & setting of inputs will instantiate conscious experience.

I have yet to see any coherent account of consciousness that manages to explain away the obvious obstructions & close the gap between lifeless boolean circuits & the resulting intentional subjectivity. There is something fundamentally irreducible about what is meant by conscious self-awareness that can not be explained in terms of any sequence of arithmetic/boolean operations which is what all functionalist specifications ultimately come down to, it's all just arithmetic & all one needs to do is figure out the right sequence of operations.

Imnimo

The good news is we can just wait until the AI is superintelligent, then have it explain to us what consciousness really is, and then we can use that to decide if the AI is conscious. Easy peasy!

rixed

We can talk to bees, we know their language. How would you go to explain what it's like to be a human to a bee?

nhecker

... and then listen to it debate whether or not mere humans are "truly conscious".

(Said with tongue firmly in cheek.)

syawaworht

It isn't surprising that "phenomenal consciousness" is the thing everyone gets hung about, after all we are all immersed in this water. The puzzle seems intractable but only because everyone is accepting the priors and not looking more carefully at it.

This is the endpoint of meditation, and the observation behind some religious traditions, which is look carefully and see that there was never phenomenal consciousness where we are a solid subject to begin with. If we can observe that behavior clearly, then we can remove the confusion in this search.

estearum

I see this comment nearly every time consciousness is brought up here and I’m pretty sure this is a misunderstanding of contemplative practices.

Are you a practitioner who has arrived at this understanding, or is it possible you are misremembering a common contemplative “breakthrough” that the self (as separate from consciousness) is illusory, and you’re mistakenly remembering this as saying consciousness itself is illusory?

Consciousness is the only thing we can be absolutely certain does actually exist.

metalcrow

As a very beginner practicer i've come to that conclusion myself, but how can the two be separate? If there is no self (or at least, there is a self but it exists in the same way that a nation or corporation "exists"), how can there be something to experience being? What separates the two?

syawaworht

My own experiential insight is not definitely not complete, so of course the guidance of a master or of course your own direct practice should be preferred.

But to the extent I have observed awareness, the idea of an entire "experiencer" is an extrapolation and fabrication. See how you generate that concept. And then, look closely at what's actually going on, there is "consciousness" of the components of the aggregate. (Maybe not dissimilar to some of the lower level mechanisms proposed in the article).

null

[deleted]

syawaworht

Phenomenal consciousness as being raised here, and probably in most people's minds, is probably taken to be the self or at least deeply intertwined with the concept of a separate self. The article tries to define it left and right, but I think most people will look at their own experience and then get stuck in this conversation.

"Consciousness" in the traditions is maybe closer to some of the lower abstraction proposals put out in the article.

I don't think the idea of illusory is necessarily the right view here. Maybe most clearly the thing to say is that there is "not" self and "not" consciousness. That these things are not separate entities and instead are dependently arisen. That consciousness is also dependently arisen is probably more contentious and different traditions make different claims on that point.

empath75

> Consciousness is the only thing we can be absolutely certain does actually exist.

A lot of philosophers would disagree with this.

estearum

Yeah sure, it's irrelevant to my actual question which is whether GP thinks consciousness doesn't exist or whether they're mistakenly replacing consciousness for self.

HN

The New AI Consciousness Paper

The New AI Consciousness Paper