Your Eyes & Brain
Photons β perception β understanding
Photons hit your retina, rod and cone cells fire, the optic nerve carries signals to the visual cortex, and your brain assembles meaning. You see the page. You understand. You act again.
How It Works
The display emits photons that pass through your cornea, lens, and vitreous humor to hit your retina β a thin layer of neural tissue at the back of your eye. The retina contains ~120 million rod cells (sensitive to brightness, used in low light) and ~6 million cone cells (sensitive to color β three types for red, green, and blue wavelengths). These cells convert photons into electrical signals through a photochemical process called phototransduction.
The signals from ~126 million photoreceptor cells are compressed by retinal processing into ~1.2 million nerve fibers in the optic nerve β about 10 megabits per second of information. These signals reach the primary visual cortex (V1) in about 30β50 ms. V1 detects edges and orientations. V2 processes contours and depth. V4 handles color and shape recognition. Higher areas integrate this into objects, text, and meaning. The entire process β from photon to conscious perception β takes about 100β150 ms. And then the cycle begins again: you see, you decide, you act.
The Signal Flow
Key Concepts
Rod cells detect brightness (scotopic vision, ~120M per eye). Cone cells detect color: S-cones (blue, ~420nm), M-cones (green, ~530nm), L-cones (red, ~560nm) β roughly 6M per eye. The fovea (center of vision) has the highest cone density β that's why you look directly at text to read it.
~1.2 million nerve fibers transmit about 10 Mbps of processed visual information from each eye β already heavily compressed from the ~126M photoreceptors by retinal circuits.
V1 (primary visual cortex) detects edges and orientations. V2 adds contour ownership and depth. V4 processes color constancy and shape. The fusiform face area recognizes faces. The visual word form area recognizes text. Each level builds more abstract representations.
Photon to V1: ~30β50 ms. V1 to conscious perception: ~50β100 ms more. Total: ~100β150 ms from photon to 'I see it.' This is why 100 ms response times feel instant β the response arrives before you've finished processing the previous frame.
Deep Dive
Why 60 fps looks smooth
The human visual system doesn't see in discrete frames β it integrates light continuously. But there's a temporal resolution limit: most people stop perceiving improvement above 60β90 fps for typical content (some competitive gamers perceive differences up to 240 fps for fast motion). Below 24 fps, motion appears jerky. Between 30β60 fps, motion blur masks individual frames. The web's 60 fps target (16.67 ms per frame) is a balance between visual smoothness and computational cost β high enough that motion appears fluid, low enough that modern hardware can sustain it.
The perception-action cycle
Vision isn't passive reception β it's an active process. Your eyes make 3β5 saccades per second (rapid jumps between fixation points), each lasting 20β200 ms. Between saccades, your brain suppresses visual input (saccadic masking) so you don't see blur. You read a web page through a series of fixations, each lasting 200β300 ms, with your fovea (sharp center of vision) landing on the most informative parts. UI design that respects this β clear visual hierarchy, adequate spacing, F-pattern or Z-pattern layouts β works with your visual system, not against it.