Why should we believe what we can’t consciously perceive—that language and music mimic nature at all but the highest hierarchical level? Why not go all the way and make language and music completely like nature?
Let’s not forget that language and music are not merely trying to mimic nature. They have jobs to do: writing is for putting thoughts on the record, speech is for transmitting thoughts to others, and music is perhaps for something like evoking feelings in others. Language and music want to capture as much of the structure of nature as they can so that they have an easy ride into our brains, but they must serve their purpose, and will have to sacrifice nature-mimicry when it is necessary to do so.
So one can see how sacrifices of nature-mimicry may sometimes be part of doing business. But why should the sacrifices be up near the top, where we have greater conscious access? The principal reason for this is that if the earlier regions of the hierarchy receive stimuli that they can’t make any sense of, then they will output garbage to the next higher level, and so all levels above the unhappy level will be unhappy. Breaking nature-mimicry at one level will break it at all higher levels
For example, I have argued in earlier research and in The Vision Revolution that writing looks like nature. In particular, I have suggested that written words look like visual objects. But words do not necessarily look natural at all levels up the hierarchy. Strokes look like contours, and letters look like object junctions; and thus the lower and middle levels of your visual hierarchy are happy. But because in alphabetic writing systems the letters in a word depend on how it is spoken, there is no effective way to make entire words look like objects. (For example, the junction-like letters in the words you are currently reading are simply placed side by side, which is not the way junctions in scenes are spatially related.) Your highest-level regions, of which you are most directly aware, only notice the nonnatural look of written words. And when visual signs do more closely match the visual structure of objects at the highest levels, people do see the resemblance to nature—this is why trademark logos and logographic writing systems like Chinese look (to your conscious self) much more object-like than the words you’re reading here.
My claim in this book that language and music mimic nature must be understood in this light. I claim that they mimic nature, indeed, but not necessarily “all the way up.” The reason why writing, speech, and music don’t obviously seem like nature is that nature is not being injected at the higher levels, perhaps—as we’ve seen with writing—in order to better accomplish the functions they are designed to carry out.
We see, then, why it is that the nature-mimicry in language and music has remained a secret for so many millennia. If only your lower-level visual and auditory areas could speak! They’d have long ago let you know that language and music are built like nature. Because those lower homunculi are part of you, there is a sense in which you have known about this ancient, deep secret code all along. Pieces of meat inside you knew the secret, but weren’t telling. In this light, one can view this book as a kind of psychoanalysis—if you’re into that—digging up the homunculus-knowledge you already have deep inside you, and working through the ways it shaped who you are today.
Nature’s Hard Core
“Language and music mimic nature.”
I will try to convince you of that over the course of this book. But we humans are very bad judges of whether a stimulus is natural or not, as we just finished discussing. How, then, are we to have any idea whether language and music do or do not mimic nature? And isn’t the phrase “mimicking nature” awfully imprecise?
Indeed, it is imprecise. Part of the book’s point is to make this more precise—to say specifically which aspects of nature are mimicked by language and music. But can this really be done? How can we possibly know the details of the natural world our ancestors experienced? It is, after all, that version of nature that our brains evolved to be good at processing, and that was a long time ago. In order to show that language and music mimic the primordial natural habitats that shaped our brains, it would seem that we have no choice but to start familiarizing ourselves with the sights and sounds of the savanna, and wherever else our ancestors hung out.
A field trip is in order. Like all natural habitats, savannas feature a hodgepodge of characters and settings, and we must take a visual and auditory inventory. We will need images from all over the savanna, including acacia trees, tall grass, sunsets, rocks, giraffes, lions’ manes, and termite mounds. And we will need to record the sounds within those habitats, including wind, rustling leaves, bird calls, insect buzzes, and rhino grunts. Once we have built an encyclopedia of how our ancestral world looked and sounded, we can then ask whether language and music mimic nature.
Hold it right there! Sorry, but I’m pulling your leg. That is definitely not what we’re going to do. Trekking across the Serengeti with a camera and tape recorder sounds grand, but it isn’t the route to a compelling scientific explanation. We want generalizations about what visual nature looked and sounded like for our ancestors, not merely an itemized list of all the furniture in the savanna. We need to grasp the fundamental “grammar” of nature. We need to pick apart nature, carve it at its joints, and elegantly summarize its structure. With such a “grammar” in hand, we will be able to make powerful predictions about what a nature-mimicker should be like . . . and thus, what language and music should be like.
Might there be fundamental regularities that cut across a wide swath of terrestrial habitats? Could it be that, although there are large salient differences across habitats, there are nevertheless deeper respects in which they are all similar? Although the savanna and any other specific habitats shaping our ancestors have hosts of peculiar features, might the sights and sounds of these habitats nevertheless have the same fundamental “core grammar”?
If there is a “core grammar” to nature, then this core would have been a highly steady invariant over our evolutionary history, and would thus have been a strong shaper of our visual and auditory systems. Sensory structure specific to the savanna or other particular habitats would, on the other hand, have been highly variable and intermittent over evolutionary time, and consequently less important for understanding what our visual and auditory systems became good at.
It’s this hard core of nature that we want. But is there one? Yes, indeed. There are solid core grammars underlying the structure of visual and auditory nature—there are “universals” in the structure of nature. That’s what I’ll endeavor to show you in this book; and then I’ll show you that language and music mimic these cores. I’ll give you a preview of these hard cores in the next section, when I introduce the central tenets of the theory—that is, when I first reveal the big secrets.
The Reveal
At this point, we have discussed the possibility that language’s and music’s mimicry of nature could be what enabled us humans to acquire language and music. We also took up why, if this were so, it would not be obvious to us. And in the section just above, I made myself clearer about the role that “nature” will play in understanding the origins of language and music: the goal is to find fundamental principles underlying the structure of nature (as opposed to a catalog of savanna paraphernalia), so that we’ll be in a strong position to ask whether these fundamental principles underlie language and music.