Dancing (movement with sound) is universal in human cultures, but the form it takes is not.
There is an argument to made against the proposition that language exists, or at least exists in the sense of being something that can be studied. It says that individual languages can be studied, and abstract generalizations can be made, but those generalizations don’t have predictive power about any particular language. They cannot because each language has its own constelation of unpredictable properties..
This argument is set forth by Morten Christiansen in a paper he has posted on the internet. In it he treats language as purely a cultural phenomenon. “Language has been shaped by cultural evolution to fit domain-general constraints deriving from the human brain” (p. 2). Domain-general constraints include “socio-pragmatic considerations, the nature of our thought processes, perception-motor factors, as well as cognitive limitations on learning, memory and processing” (4). In other words, there is no faculty of language in the narrow sense of some brain module that contributes only to linguistic production.
Furthermore, he rejects Chomsky’s distinction between internalized language (I-language) and externalized language (E-language). Christiansen argues that what is internalized comes from listening and whatever neural wiring there is governing language “cannot be understood independently of the input that gave rise to [it]” (7).
Fifty plus years after Chomsky sank Skinner the argument is still with us—is language instinctive or is it learned? Christiansen’s argument is far more sophisticated than Skinner’s and this sophistication is due very much to Chomsky’s intervention which forced everybody to grant that language is much more complex than can be allowed by learning by conditioning based solely on stimuli and rewards. In Skinner’s day, the language learner was considered a “black box,” an entity whose internal workings were unknown and irrelevant to the learning process. Today’s learning-based approach gets rid of the black box and puts heavy emphasis on what the brain has to contribute generally to learning.
So how do children learn to talk if they are not pre-wired for it? Christiansen proposes two answers. First, the language is adapted to the brain. This idea has been around since at least the early 1990s and was given wide prominence by Terrence Deacon. It states that people learn what they can learn and forget what they cannot; thus, by its very nature, anything passed down from generation to generation will be learnable. This idea has been discussed many times on this blog and I have also reported a variety of experiments in which “generations” begin with difficult-to-learn languages and turn them into much more regular, readily learned systems.
A second aid to learning proposed by Christinasen is a set of cues that help a listener, including a learner, follow spoken language. I was surprised to see this line of argumentation as it means Christiansen has given up on the idea that adapting to the brain is sufficient to explain language’s learnability.
A similar, yet different, idea was introduced some years ago back when Chomsky’s forces recognized that languages are too different for everything about them to be built into the brain. Chomskyans proposed a series of parameters that could define any language, but that system failed when parameters proved insufficient as predictors of the idiosyncrasies in an entire language. Now Christiansen proposes that cues such as the number of phonemes in a word or an initial th phoneme can be informative to English speakers.
I’m not going to discuss Christiansen's experiments and calculations because this is an idea that requires extensive examination and testing. At this point it is sufficient to note that he argues “every language today incorporates its own unique constellation of probabilistic cues to signal different aspects of structure” (8), e.g., some cues signal an English learner that a word is a noun and some a verb. I do wonder how those distinctions mean anything to a three-year-old.
In this approach “natural language is a culturally evolved multiple-cue integration system” (12). That description puts the theory at the polar opposite of the Chomskyan approach that sees natural language as an inborn biological faculty. Both parties leave room for the other side to play a role. Thus, Christiansen says is account “does not preclude that there may have been biological adaptations for aspects of language that would improve learning and use” (21). Meanwhile, Chomsky and his fellows grant that the language faculty may include domain-general capacities, and of course they grant that each language has a cultural history of its own. But in both cases their focus is on one side or t’other of the biology/culture divide. They are not focused on a co-evolutionary hypothesis.
My favorite argument against cultural evolution alone is the adaptation to vocalization shown by the human body. The control of lips and tongue is quite unlike the less precise control available to apes. The loss of air sacs in the throa, common to all apes,t widened the range of sounds available to speakers. The FOXP2 gene supports delicate motor control of vocalizations. Work with songbirds has shown that birds have special modules for learning their local song, and their foxp2 contributes to this process. That work makes for a very strong hypothesis that humans learn the phonology of their local language through a combination of cultural input and specific biological adaptations that promote the learning. The biological and cultural components are so mixed that neither leads, neither trails.
The co-evolution approach seems like the only way we shall ever escape the learning vs instinct battle that rages interminably in linguistics. However, this approach is not neutral in its implications. Co-evolution requires a long time, much longer than the 200 thousand years of Homo sapiens. In a co-evolutionary scenario, some biological adaptations are adaptations to existing speech.
For example, we know from work with Washoe and other signing chimps that apes were smart enough to make basic phrases, but no ape has come close to signing a sentence formed by subject + predicate. Thus, there is no reason to assume that the first speakers could unite two distinct foci of attention by means of a verb.
The move from phrases to sentences is hard to justify In either a culture-alone or a biology-alone scenario. Cultural selection should reduce an innovation as intellectually unusual as a true sentence because too many people will be unable to reproduce it accurately. Biological evolution has a similar problem. Why would an accidental mutation permitting the creation of a true sentence? You are not allowed to give a cultural rreason for selection, some sort of group use or benefit. Biolog-alone, remember! Some kind of survival or reproductive success is required. You can try to argue that better thinking leads to better survival (or even better sex) but it is very hard to prove such a thing. Meanwhile a co-evolutionary scenario can posit a human lineage speaking phrases (like a two-year old) and then undergoes biological changes that permit constructionn of subject + predicate sentences while culture finds a way to make use of the biological change. In co-evolution one side can produce a change while the other side can provides a basis for selection.
Besides language, there are many traits that seem distinctively and universally human (suggestive of biological support) and yet are highly variable within the human world (suggestive of cultural shaping). Take your pick—music, dance, humor, morals, language, mythology, family relationships—other animals don’t have them but every culture does. I’m betting on co-evolution as the key to understanding what mamkes humans tick..
If I am reading this correctly, your preferred story of linguistic evolution is via the Baldwin Effect.
A lot of linguists are sympathetic to this point of view (Pinker and Jakendoff being vocal supporters of it). However, it needs to be acknowledged that just like the other two approaches/views you mention (a genetic leap vs. primarily cultural evolution), the coevolution/baldwin effect view point has some very troubling aspects to it.
In fact Christiansen and colleagues have some papers arguing against coevolution [Here's a ref: http://www.pnas.org/content/early/2009/01/21/0807191106.abstract]. The crux of their argument is that there needs to be temporary stability for each step of coevolution to occur; however, the coevolution scenarios sketched miss exactly that requirement. So, even though co-evolution is theoretically possible, the time-line AND stages usually proposed are not sufficient.
One way out, of course if to expand the time-line (as you do in many places) and say in fact that the process started much earlier and lasted much longer than we normally presume it to. This would still leave us to figure out the intermediate stages that allow the possibility to unfold. Whether the stages are sufficient for an account of modern language, and whether they are realistic/supportable or not are the main issues for this line of thinking.
Of course this also means we find a new hero to credit with the "great (sociological/cultural) leap forward" that seems to have happened around 50k years back, instead of language, i.e., if you grant the sociological/cultural leap the status of a fact.
P.S. - from my discussions with (some) generative linguists, many seem to be unwedded to any one of these evolutionary possibilities given that there are serious issues in identifying which of them is the correct view. Which is one of the things that led Chomsky and many others, for a long time to avoid, the issue of how language evovled at all, and instead concentrate on the "let's have a linguistic theory first" approach. This silence on the part of generativists has been intrepreted as a nod towards a massive genetic jump that accounts for all universal properties of languages (related to phonology, morphology, syntax, semantics, pragmatics...); however, in reality, it appears to be that most acknowledge that there seems to be no clear path to distinguish between the many possibilities.
Posted by: Karthik Durvasula | August 29, 2011 at 04:29 PM
Similar to the argument you and Christiansen make, Skinner predicted in 1957 that language would prove intractable if studied as a stand-alone process. That is to say, language does not exist as an orderly phenomenon outside of the social and cultural context of its use. Moreover, the acquisition and manifestation of language is a function of a history of social transactions. Consistent with this view, new research reveals that learning according to Bayesian statistical principles is powerful enough to give rise to breathtakingly complex behavior (Tenenbaum et al., 2011). These Bayesian learning mechanisms appear to be general rather than specific. Interestingly, they apparently apply across activities that are topographically distinct, yet were united by Skinner under the term, Verbal Behavior (behavior mediated by a social context). Many of these activities appear in your list of distinctively and universally human traits: music, dance, humor, morals, language, mythology, family relationships.
Skinner argued that the items on your list are in essence one thing in that they are all governed by a single set of laws. At this stage of the game, are you willing to bet against him?
Posted by: Paul Strand | August 29, 2011 at 05:45 PM
If, say, from next generation human start looses his ability to talk due to some mutation, what going happen with Language?
Is it going parish together with culture and all sorts of linguistic theories?
If your answer is No, you have to think again about eminence, prime and special position of speech/vocal manifestation of Language facility and about the relevance of FOXP2 to this facility.
Investigate Language ability on the merit of ability to speak seems equally unproductive as on ability to write.
-------------------------------------
BLOGGER: I'm more interested in what happened than in what will happen. We ended up as a talking species that includes some signers and writers. How did that happen?
Posted by: Jerry Moore | August 30, 2011 at 02:42 AM
Edmund says: "My favorite argument against cultural evolution alone is the adaptation to vocalization shown by the human body. The control of lips and tongue is quite unlike the less precise control available to apes. The loss of air sacs in the throat, common to all apes, widened the range of sounds available to speakers. The FOXP2 gene supports delicate motor control of vocalizations. Work with songbirds has shown that birds have special modules for learning their local song, and their foxp2 contributes to this process.
This assumes that language (at least in a comparatively full sense) developed alongside, and as a driver to, increased vocalisation. But while different languages use different aspects of vocalisation the full range of human sound controls are far wider than is necessary for language – so could there be a reason for vocalisation developing before a comprehensive language.
Edmund points out that the FOXP2 contributes to both human vocalisation and the ability of songbirds to sing, and this could be very relevant. When our ancestors first descended from the trees into more open areas and switched to hunting they would have needed communication skills to, for instance, coordinate an ambush style hunt. I have just posted "Babel's Dawn and the Evolution of Vocalisation" (http://trapped-by-the-box.blogspot.com/2011/08/babels-dawn-and-evolution-of.html) in which I discuss the possible role of using sounds in hunting – including the imitation of bird calls as signals, and animal calls as a lure. This approach links the need for better vocalisation with the need to get a full belly – which is an excellent evolutionary drive. Of course once our ancestors evolved more local skills they were in a position to use them for other means of communication, initially I suspect, to improve their hunting prowess still further.
Chris Reynolds
Posted by: CodilLanguage | August 30, 2011 at 09:38 AM
I concur with the main thrust of Paul Strand’s earlier comment. Skinner’s conception of verbal behavior as behavior mediated by social context, that is, other people, encompasses ritual behavior, dance, kin and other interrelationships, manner of dress, humor, tool making, religion, moral behavior, etc. Speech (overt and covert), listening, writing, reading, signing, and other linguistic or communicative activities are naturally included and get discussed the most by Skinner. However, I would be rather reticent to concur that a Bayesian probabilistic learning model really captures verbal behavior. It might serve possibly are a descriptive model, but not an explanatory one. In an interview published in book form, entitled B. F. Skinner: The Man and His Ideas, in discussing scientific methodology, Skinner said, “I don’t accept most current analyses of scientific methodology, however, or statistical methods which are taught as if they were the way scientists think.” In his orientation toward the behavior of the individual, he felt that statistical inferences about scientific methodology were a posteriori reconstructions. He would no doubt say the same for putatively claiming that children as well as adults have built-in Bayesian machines that calculated the probability that a certain hypothesis about “how language works” was more likely than another. All that this does is turn Chomsky’s formalistic language acquisition device into a probabilistic language acquisition device. Skinner recognized that real contingencies were involved in verbal behavior, but didn't think any statistical model could adequately encapsulate them
Posted by: Raymond Weitzman | August 30, 2011 at 04:59 PM