Geometry of Human Perceptual Domains Emerges Transiently in LLM Representations

Simardeep Singh & Paras Chopra

LLMs are trained purely on text — yet their internal representations spontaneously develop geometric structures that mirror human perception across color, pitch, emotion, and taste. These structures emerge in intermediate layers before attenuating in deeper representations.

arXiv (PDF) alphaXiv Google Scholar Explore Results ↓

Key Findings

Three things we discovered

By extracting layer-wise representations from open-weight language models and comparing them to human perceptual baselines, we uncovered a consistent pattern across domains and model families.

1

Perceptual geometry emerges without supervision

Color wheels, pitch spirals, emotion manifolds, and taste maps form spontaneously inside LLMs trained only on text.

2

Each domain follows a unique emergence profile

Taste peaks early and fades fast. Emotion peaks later and persists deep. Pitch and color lie in between, each with their own curve.

3

The trajectory is rise → peak → fade

Early layers are diffuse, intermediate layers crystallize human-like manifolds, later layers dissolve them as models shift to task-specific computation.

Overview

Perceptual geometry across four domains

The figure below shows the central result: for color (LLaMA-3-8B), pitch (Qwen-3-4B), and emotion (Gemma-7B), we display the human perceptual baseline, the model's peak-alignment geometry, and the layer-wise alignment profile.

Overview figure showing perceptual geometry emergence across color, pitch, and emotion domains

Figure 2. Each row is a perceptual domain. Left column: human perceptual baseline. Middle: peak-alignment model geometry via MDS. Right: RSA and GPA alignment scores across layers, showing the rise–peak–fade trajectory.

Comparison of human taste perception map and Gemma-7B peak-layer representation

Figure 3. Human perceptual map (top left) vs. Gemma-7B peak-layer LLM representation for taste (top right). Bottom: layer-wise RSA and GPA alignment scores.

The Pattern

Rise → peak → fade

Layer-wise alignment profile (schematic)

Color — peaks at early-intermediate layers, fades gradually
Pitch — localized emergence, deforms in deeper layers
Emotion — peaks later, stays comparatively stable
Taste — emerges early, degrades rapidly
Early layers Intermediate Late layers

Results by modality

Each domain, in depth

Color · LLaMA-3-8B

The color wheel emerges without grounding

At peak alignment layers, color representations organize into a smooth circular manifold closely resembling the human perceptual color wheel — arising from purely linguistic statistics.

The depth-wise pattern is consistent across all four architectures: a clear rise–peak–fall profile where color alignment peaks at an intermediate layer before late-layer attenuation. Qwen-3-4B shows a brief late-layer rebound before final degradation.

Color geometry emergence in LLaMA-3-8B across model depth

Fig. 6. Layer-wise emergence of 2D color geometry in LLaMA-3-8B. Human baseline (top-left), early layer (top-right), peak layer (bottom-left), final layer (bottom-right).

Pitch · Qwen-3-4B

A smooth arc encodes ordinal pitch relations

The peak-layer geometry reveals a smooth arc-like organization consistent with continuous, ordinal human pitch perception. No discrete clusters — pitch is represented relationally, as a continuous manifold.

Early layers show weak partial ordering; intermediate layers undergo a structural transition; later layers progressively deform this organization.

3D pitch geometry emergence in Qwen-3-4B across model depth

Fig. 17. Layer-wise emergence of 3D Pitch geometry in Qwen-3-4B. Human baseline (top-left), early layer (top-right), peak layer (bottom-left), final layer (bottom-right).

Emotion · Gemma-7B

Affective geometry persists through depth

The peak-layer representation recovers a well-organized affective manifold aligned with the human valence–arousal structure. Unlike color, this alignment remains comparatively stable across later layers.

This persistence suggests that emotional geometry is more deeply encoded — perhaps because affective concepts are more densely represented in language than sensory modalities.

Emotion geometry emergence in Gemma-7B across model depth

Fig. 11. Layer-wise emergence of emotion geometry in Gemma-7B. Human baseline (top-left), early layer (top-right), peak layer (bottom-left), final layer (bottom-right).

Taste · Gemma-7B

Early emergence, rapid degradation

Taste representations recover a qualitatively well-formed manifold with strong geometric alignment (high GPA). The relative ordering of primary tastes and mixtures broadly matches human perceptual arrangement.

However, taste diverges from other domains: lower RSA scores and noisier layer-wise profiles indicate less precise pairwise relations, and the geometry degrades more rapidly after its peak.

Taste geometry emergence in Gemma-7B across model depth

Fig. 13. Layer-wise emergence of Taste geometry in Gemma-7B. Human baseline (top-left), early layer (top-right), peak layer (bottom-left), final layer (bottom-right).

Methodology

A fully intrinsic approach

No probing classifiers. No fine-tuning. We extract layer-wise residual stream representations, construct geometric maps, and compare them to human perceptual baselines using two complementary metrics.

01 · Stimuli

Minimal structured prompts

Each stimulus (e.g. #9B081A, afraid, 261 Hz) is embedded in a short template designed to avoid semantic bias.

02 · Extract

Layer-wise hidden states

Last-token hidden state activations are extracted at every transformer layer from the residual stream.

03 · Geometry

MDS + Isomap

Pairwise cosine dissimilarities are projected into low-dimensional space via MDS, verified with Isomap to rule out projection artifacts.

04 · Compare

RSA + Procrustes (GPA)

Representational Similarity Analysis and Generalized Procrustes Analysis measure alignment against human perceptual baselines at each layer.

Paper summary

The key ideas, condensed

Citation

How to cite this work

This paper is currently under review at ICML Mechanistic Interpretability workshop 2026. If you use this work, please cite:

@misc{singh2026geometryhumanperceptualdomains,
      title={Geometry of Human Perceptual Domains Emerges Transiently in LLM Representations}, 
      author={Simardeep Singh and Paras Chopra},
      year={2026},
      eprint={2605.27970},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2605.27970}, 
}