1.2 Linguistic variables
What is a linguistic variable?
- The classical view: Labov (1972: 7-8) explains that the linguistic variable should be salient on several levels:
- It occurs often enough that small data samples could already hint at its distributional idiosyncrasies.
- Previous research hints at the influence of extra-linguistic factors.
- On the level of the individual speaker, a limited awareness of the variable realisations is desirable, but should not be too high, lest it skew the nature of their linguistic output.
- The variable is deeply embedded in the language system.
A restriction: Meyerhoff (2009: 11) summarises: “In sum, a sociolinguistic variable can be defined as a linguistic variable that is constrained by social or non-linguistic factors […]”
A more open view: Kiesling (2011) argued, “Given the variability of what counts as a variable, we must define what counts as a variable more broadly than ‘two or more ways of saying the same thing’. We will simply say that a linguistic variable is a choice or option about speaking in a speech community… Note that this definition does not in any way require us to state that the meaning be the same, although there should be some kind of equivalence noted.”
Subtypes of variables
Linguistic perspective
- Phonetic/phonological
- Morphological
- Syntactic
- Pragmatic
Sociolinguistic perspective
Sociolinguistic variables also differ with regard to their salience in society.
- Stereotypes are strongly socially marked and part of popular discourse about language.
- h-dropping in Cockney
- Canadian eh at the end of sentences
- Australian dinkum: I was fair dinkum about my interest in their culture ‘authentic, genuine’
- Markers show both social and style stratification; all members of a society react similarly in taking care to avoid the pattern in formal registers.
(r)
(th)
- Indicators differentiate social groups. However, people are not aware of them and therefore do not avoid them in formal registers.
- Same vowel in God and Guard in New York City
Cf. Mesthrie (2011).
Many morphosyntactic variables in English
The table lists some of the most well-known (morpho-)syntactic alternations in English. For an up-to-date overview with various case studies, see Szmrescanyi & Grafmiller (2023).
| Variable | Example |
|---|---|
| Indefinite Pronouns | everybody vs. everyone |
| Case and order of coordinated pronouns | my husband and I vs. my husband and me vs. me and my husband |
| that vs. zero complementation | I don’t think that/Ø it’s a problem. |
| that vs. gerundial complementation | remember that vs. remember V-ing; try to vs. try and vs. try V-ing |
| Particle placement alternation | set the computer up vs. set up the computer |
| The dative alternation | give the book to John vs. give John the book |
| The genitive alternation | John’s house vs. the house of John |
| Relativization strategies | wh-word vs. that vs. Ø |
| Analytic vs. synthetic comparatives | warmer vs. more scary |
| Plural existentials | there are some places vs. there’s some places |
| Future temporal reference | will vs. going to vs. progressive etc. |
| Deontic modality | must vs. have to vs. need to vs. got to etc. |
| Stative possession | have vs. have got vs. got |
| Quotatives | say vs. be like vs. go etc. |
| not vs. no | not anybody vs. nobody; not anyone vs. no one; not anything vs. nothing |
| NOT vs. AUX contraction | that’s not vs. that isn’t etc. |
Cf. Gardner et al. (2021).
A statistical perspective
Sociolinguistic variables are fundamentally sets of discrete (i.e., clearly delimited) outcomes. This means each token in the data represents a choice among several possible, mutually exclusive variants. There may be two outcomes (binary: binomial) or more (multinomial), depending on the specific variable under study.
The distribution of outcomes for sociolinguistic variables is never truly continuous. Continuous variables like Age or vowel formant measurements such as F1 are treated separately – sociolinguistic variation typically focuses on the selection among categorical alternatives.
Quantitative analyses attempt to model the probabilities of these binomial or multinomial outcomes. Some common statistical questions involve:
Are all outcomes equally likely?
If we condition the sociolinguistic variable on another independent variable (such as
Gender,Variety,Social Class, …), does its probability distribution change?Are there any statistically meaningful patterns (hypothesis testing)?
Exercises
Exercise 1 Which of the following variables could be considered ‘good’ sociolinguistic variables, and which of them poor ones? Justify your answer.
- /fɔːθ flɔː/ vs. /fɔːrθ flɔːr/
- This enables him to preside over the process which I have described vs. This enables him to preside over the process that I have described vs. This enables him to preside over the process ∅ I have described.
- The pair found the briefcase on a bus station bench at Bath central bus station. vs. The briefcase was found on a bus station bench at Bath central bus station by the pair.
- Art is after all the subject of attention for both critic and historian, even though the functions and methods of the two sorts of writer have drawn apart. vs. Art histories often make an attempt to keep to chronology, although the difficulties include the crucial fact that in art there is no clear sequence of events. vs. Many of his readers approved his sensitive and appreciative understanding of paintings, though without sharing his political views.
- /pleɪɪŋ/ vs. /pleɪɪn/
- [tʰ] in /tɔp/ vs. [t] in stop.
Exercise 2 Two linguists aim to study the preference for passives among men and women. They extract all the passives from 500,000 words of male speech and all passives from 500,000 words of female speech and report the results. What is wrong with this approach?