Efficacy in deceptive vocal exaggeration of human body size

Pisanski, Katarzyna; Reby, David

doi:10.1038/s41467-021-21008-7

Download PDF

Article
Open access
Published: 12 February 2021

Efficacy in deceptive vocal exaggeration of human body size

Nature Communications volume 12, Article number: 968 (2021) Cite this article

3058 Accesses
15 Citations
13 Altmetric
Metrics details

Subjects

Abstract

How can deceptive communication signals exist in an evolutionarily stable signalling system? To resolve this age-old honest signalling paradox, researchers must first establish whether deception benefits deceivers. However, while vocal exaggeration is widespread in the animal kingdom and assumably adaptive, its effectiveness in biasing listeners has not been established. Here, we show that human listeners can detect deceptive vocal signals produced by vocalisers who volitionally shift their voice frequencies to exaggerate or attenuate their perceived size. Listeners can also judge the relative heights of cheaters, whose deceptive signals retain reliable acoustic cues to interindividual height. Importantly, although vocal deception biases listeners’ absolute height judgments, listeners recalibrate their height assessments for vocalisers they correctly and concurrently identify as deceptive, particularly men judging men. Thus, while size exaggeration can fool listeners, benefiting the deceiver, its detection can reduce bias and mitigate costs for listeners, underscoring an unremitting arms-race between signallers and receivers in animal communication.

Linking human male vocal parameters to perceptions, body morphology, strength and hormonal profiles in contexts of sexual selection

Article Open access 04 December 2020

Chest beats as an honest signal of body size in male mountain gorillas (Gorilla beringei beringei)

Article Open access 08 April 2021

Comparing accuracy in voice-based assessments of biological speaker traits across speech types

Article Open access 27 December 2023

Introduction

The honest signalling paradox has perplexed researchers in animal communication for five decades^{1,2,3,4,5,6,7,8,9}. Indeed, why should receivers pay attention to deceptive signals if these signals contain little to no reliable information? Conversely, if receivers detect and correct for deception, why should signallers continue to produce potentially costly deceptive displays?

At the crux of this paradox lies the inherent conflict of interest between signallers and receivers. While earlier theories saw animal communication as a cooperative exchange of information¹⁰, the production of animal signals during communication between unrelated individuals is now predominantly regarded as a selfish behaviour^2,10, whereby signallers attempt to manipulate the responses of receivers. Crucially, however, selection also operates on receivers to evade deception, for example by leading them to ignore deceptive signals or to recalibrate their responses. Indeed both individuals in a dyadic exchange are expected to behave in ways that maximise their own fitness², giving rise to an evolutionary arms race that is most apparent when interests diverge (e.g., mate choice) or are entirely opposed (e.g., resource contests)¹¹. Yet, even in ostensibly cooperative contexts such as alarm calling¹², signallers may stand to gain substantial fitness benefits by exaggerating (or in some cases, attenuating¹³) signals, if such deceptive signals succeed to elicit a beneficial response from receivers^6,7.

To remain ‘evolutionarily stable’¹⁴, signals must confer net fitness benefits to both senders and receivers², and thus should be generally reliable or ‘honest on average’^4,15. Even a putatively deceptive signal should be reliable enough to remain beneficial for receivers to attend to it. In other words, not only can deception and reliability coexist, deception depends on reliability, because without some element of truth a signalling system would collapse⁶. Signal reliability can be imposed by a number of mechanisms, including anatomical or physiological constraints (e.g., by-product information¹⁶ or honest indices¹⁵), developmental or metabolic costs^1,17, and reputation or retaliation costs^9,18. Constraints and costs, if high enough, can enforce signal honesty. Nevertheless, even a presumably reliable signal will be ‘incompletely honest’⁷ due either to deceptive processes that must operate within these constraints, including anatomical adaptations to the vocal apparatus in many mammals¹⁹, or nondeceptive processes, including developmental noise²⁰, communication errors, or environmental signal degradation⁷. While challenging⁶, it is imperative to dissociate deceptive from nondeceptive processes that can independently degrade signal reliability if we are to truly understand the evolution of a signalling system⁷.

Signal reliability is often measured as the strength of the correlation between the signal and the intended information^6,7. To gauge signal reliability we must establish (a) if the signal is reliable enough that a receiver will generally benefit by attending to it; (b) the constraints or costs that impose this degree of honesty; and perhaps most critically, (c) how receivers respond to the signal, including detecting and compensating, at least in part, for any deception⁶. Two major obstacles have limited the extent to which we can answer these questions using animal models: uncertainty in what a signal is actually intended to convey and uncertainty in what a receiver is attending to⁶.

We propose that studying deception in human communication signals offers a promising solution to these long-standing hurdles. Unlike other animals, humans can produce specific deceptive signals on demand^9,21,22,23, thus eliminating uncertainty in the signal’s intended function and allowing researchers to pin-point the contribution of deception in signal reliability. Moreover, researchers can directly measure the effects of this deception on human receivers using controlled psychoacoustic experiments, offering a full picture of the signaller–receiver communication chain (Fig. 1). Here, we apply this paradigm to study an ecologically relevant vocal signalling system observed across the animal kingdom—vocal communication of body size.

**Fig. 1: Human vocal communication of body size.**

Given the often substantial fitness benefits of a large body size, especially for males who gain access to resources and mates^11,24 (tall men included²⁵), it is unsurprising that many species of mammals, birds, fish, reptiles, amphibians, and arthropods have evolved anatomical or behavioural adaptations to exaggerate their apparent size^{6,19,26,27,28}. For example, the already-descended larynx of red deer stags²⁹ is lowered further still during roaring contests with rival males³⁰, extending the vocal tract even more to produce abnormally low formant frequency spacing (∆F, the overall spacing between any two consecutive formants in the frequency domain, see Fig. 1) given the animal’s true size^27,31.

Humans also possess a descended and sexually dimorphic larynx, with men boasting longer vocal tracts (reduced ∆F) and longer vocal folds (lower fundamental frequency or pitch, f_o) than women^28,32,33 (Fig. 1). Although ∆F scales allometrically with vocal tract length (VTL) and thus predicts body size, both between and within adult sexes, f_o is a poor predictor of human height at the intrasexual level^28,33,34 (see Fig. 1). Yet, despite strongly associating not only ∆F but also f_o with physical largeness^25,35,36, listeners can gauge relative body size from modal speech and nonverbal vocalisations^25,36,37. Critically, however, while we have recently shown that men and women can behaviourally lower their voice ∆F and f_o to further exaggerate their body size and strength^38,39, remarkably little is known about the role of such deception in size communication.

In this study, we combine acoustic analysis of vocal signals, produced by men and women attempting to sound physically larger or smaller, with a series of psychoacoustic playback experiments conducted on a representative sample of 200 human listeners. Their task was to judge the heights of these vocalisers, and to attempt to discriminate among honest, exaggerated, or attenuated vocal signals of body size. Using this innovative approach we address long-standing questions about the evolution of deceptive signals in animal communication: Does deceptive size signalling retain an element of honesty? Can listeners detect size deception and do they correct for it when judging height? If so, does it still benefit vocalisers to exaggerate their perceived size? Are there sex differences, as predicted by sexual selection^11,25, in the production and perception of exaggerated signals? Our results offer a unique lens into the conflict between deceivers and receivers, showing that while deceptive vocal signals can effectively bias listeners’ judgements of body size, such signals remain constrained and thus retain some reliable information. Specifically, we show that listeners often correctly discriminate between honest and deceptive vocal signals, and that when they do detect deception, they can recalibrate their height judgements accordingly. This research reveals that although listeners are not systematically fooled, it still pays off to deceive.

Results

Voice frequency shifts in size deception

As a first step toward answering our key research questions, we analysed speech signals (vowels /α i ɛ o u/) produced by men and women tasked with sounding physically larger and smaller than their true body size³⁸. Acoustic analyses performed in Praat⁴⁰ (see Methods) confirmed that both sexes volitionally lowered their voice fundamental frequency (f_o) and formant spacing (∆F) to deceptively exaggerate their apparent body size, and raised both frequency parameters to attenuate it, relative to their unmodulated (herein ‘honest’) vocal signals.

Men, by extending their apparent VTL more extremely than did women, lowered their formants more (Supplementary Tables 1 and 2), which is expected to simulate a larger body size (Fig. 1). In contrast, men did not raise their formants significantly more than did women to sound smaller, nor did they shift their voice pitch (f_o) more (LMM, Supplementary Table 2). The observed sex difference in formant shifts cannot be attributed to sexual dimorphism in starting VTL as controlling for baseline VTL or ∆F showed that the mean percentage change in men’s formant shifts was still three to four times greater than that of women (Supplementary Table 1). Even within sexes, taller men likewise shifted their formants during size deception more than did shorter men, wherein relative differences in men’s heights explained one-third of the variance in formant shift magnitude for size exaggeration. Again, this relationship was not observed among women, nor for f_o shifts in either sex (Supplementary Table 3). Taken together, these results support the hypothesis that the human male vocal tract may have evolved largely under selection pressure for size exaggeration^21,28, wherein men may perform vocal tract dynamics to maximise their perceived body size, and ultimately, their reproductive success²⁵, while simultaneously preserving phonetic range as illustrated by articulatory models⁴¹.

Formant measures (∆F and apparent VTL) taken from central vowel frequencies in honest vocal signals explained between 14% and 40% of the variance in actual height within sexes, whereas f_o explained virtually none (<3%; Supplementary Table 4). These findings corroborate studies on several other terrestrial mammals^26,27 including humans (see the meta-analysis in ref. ³⁴), where formants but not f_o are anatomically constrained and thus follow a degree of acoustic allometry (Fig. 1). Importantly, formant measures also predicted inter-individual differences in height from deceptive vocal signals, particularly among men exaggerating their size (∆F R² = 0.58) and women attenuating their size (∆F R² = 0.17; Supplementary Table 4). Here too, f_o did not significantly predict individual differences in actual height from deceptive signals (R² = 0.003–0.12; Supplementary Table 4). These acoustic analyses show that reliable formant-based information indicating inter-individual differences in body size is present in the human voice even during size deception, and thus, that listeners may be able to reliably gauge relative size from deceptive vocal signals.

Vocal size deception biases listeners

Do these deceptive signals fool listeners? To answer this imperative question, human adults (Experiment 1: n = 97, aged 18–63, 59 males) completed two psychoacoustic tasks involving (1) judging the absolute heights of vocalisers from their honest and deceptive vocal signals using a sliding metric/imperial scale, and (2) judging whether those same vocalisers were speaking naturally or instead deceptively exaggerating or attenuating their size (see Methods). To avoid cueing listeners to the possibility of vocal deception and thus to maintain ecological validity, listeners judged the height before assessing deception in a separate experimental block (but see Experiment 2).

Linear mixed models (LMMs) confirmed that listeners’ height judgements were indeed biased by size deception when judging both male (F_2,1938 = 299.2, p < 0.001) and female vocalisers (F_2,1938 = 174.3 p < 0.001), with no effects of listener sex (Supplementary Table 5). As illustrated in Fig. 2a, on average, listeners overestimated the height of size exaggerators (estimated marginal means, M 3.4 cm, 95% CI 2.8, 4.1 male voices; M 1.9 cm, 95% CI 1.23, 2.6 female voices) and underestimated the height of size attenuators, by an average of 4 cm (M −4.3 cm, 95% CI −5.0, −3.7 male voices; M −4.0 cm, 95% CI −4.7, −3.3 female voices). Men attempting to exaggerate (but not attenuate) their size biased listeners’ judgements more effectively than did women (Fig. 2a; F_2,3872 = 9.9, p < 0.001; Supplementary Table 5), consistent with previous suggestions that the exaggeration of apparent body size is under stronger sexual selection in male than female vocal signals^11,25,26,35, especially in sexually dimorphic species in which males are larger than females¹⁹.

**Fig. 2: Vocal size deception biases judgements of body size (Experiment 1).**

The moderate positive relationship between actual and perceived height in men’s honest vocal signals (r = 0.44) was retained in their exaggerated (r = 0.48) but not attenuated signals (Fig. 2b). In women, honest (r = 0.54) and attenuated (r = 0.50) signals showed the strongest relationships (Fig. 2b). Thus, although listeners’ individual height judgements were systematically shifted up by size exaggeration and down by size attenuation, inter-individual differences in height were preserved in listeners’ judgements. This could be expected based on our acoustic analyses indicating reliable formant-based cues to actual size in deceptive vocal signals (Supplementary Table 4). As further predicted, listeners generally associated lower absolute voice frequencies with larger size, a well-established perceptual association^25,27,36,37. However, this association was stronger for exaggerated than honest or attenuated signals (Supplementary Table 6). Together these results suggest that lowering voice frequencies can function to effectively maximise perceived absolute body size, again to the potential fitness benefit of the size exaggerator^11,25,35.

Listeners can detect deception

In a second task, the same listeners judging the same vocal stimuli correctly detected the presence or absence of size deception above chance (Fig. 3a; chance = 33%, grand mean correct 51% ± 0.007 SEM, 95% CI 49–52%) and more reliably detected deception by men than deception by women during intended size exaggeration (51% ± 0.02 vs 45% ± 0.02) and attenuation (49% ± 0.02 vs 40% ± 0.02) with no differences between male and female listeners (LMM, Supplementary Table 7, Fig. 3a).

A crucial question then is whether listeners recalibrated their height judgements for voice signals that they were able to correctly detect as deceptive in a subsequent task. As illustrated in Fig. 3b (LMMs, Supplementary Tables 8 and 9), honest signals correctly identified as such were indeed associated with reliable height judgements overall (i.e., error near 0, M −0.035 cm, 95% CI −0,8, −0.78 male voices; M 0.009 cm, 95% CI −0.87, 0.89 female voices), whereas honest signals misidentified as deceptive were underestimated (M −2.6 cm, 95% CI −3.6, −1.6 male voices; M −1.7 cm, 95% CI −2.7, −0.7 female voices). Surprisingly, however, listeners’ judgements of height were maximally biased for deceptive signals they correctly identified as deceptive in a subsequent task (Fig. 3b, c).

These results indicate that listeners remain partly ‘fooled’ by vocal deception, and were in fact most fooled by voices they could correctly identify as deceptive in a separate task. We examined the possibility that detectable deceivers shifted their voice frequencies more than those who remained undetected, which could increase detection while also maximally exploiting deep-seated sound-size correspondences between low frequencies and largeness^25,27,36,37. Although women who were detected as exaggerating did indeed shift their ∆F and f_o more than those who remained undetected (Supplementary Table 10), such acoustic differences were not observed among men nor in the context of size attenuation. Hence, we tested a second possibility that the listeners in this experiment, who were not ‘primed’ to contemplate deception when judging the heights of vocalisers, may have failed to detect deception at that stage, and thus to recalibrate their judgements.

Awareness of deception reduces bias

To test this possibility, we conducted a second psychoacoustic experiment on an independent sample of listeners (Experiment 2: n = 98, aged 18–71, 59 males). The experiment was identical to the first, except that listeners now completed both tasks concurrently, first indicating whether or not they perceived a vocaliser as deceptively altering their size, and then judging the height of that same vocaliser within the same trial (see Methods). The results confirmed our prediction (Fig. 4; LMMs, Supplementary Tables 11–14). Indeed, listeners primed to seek deceit were substantially less biased by vocal size exaggeration, overestimating or underestimating the heights of vocal deceivers by half the magnitude as listeners in Experiment 1 (Fig. 4a vs Fig. 2a and Supplementary Table 11). Although listeners in both experiments detected size deception with similar verity (Fig. 4b vs Fig. 3a and Supplementary Table 12), critically, listeners in Experiment 2 more effectively recalibrated their height judgements for vocal signals they correctly identified as deceptive (Fig. 4c, d vs Fig. 3b, c and Supplementary Tables 13, 14). This effect was most pronounced for size exaggerators who, when correctly detected as cheating, failed to fool listeners into perceiving them as much larger than their true body size (Fig. 4c, d). Moreover, male listeners recalibrated their size judgements after correctly detecting deception more effectively than did female listeners, specifically when assessing the body size of other men (Fig. 4d left panel and LMM, Supplementary Table 14).

**Fig. 4: Awareness reduces bias: Listeners recalibrate height judgements for signals correctly and concurrently detected as deceptive (Experiment 2).**

Discussion

Our results provide rare insight into an evolutionary arms race between signallers and receivers, addressing several long-standing questions aimed at resolving the honest signalling paradox^{1,2,3,4,5,6,7,8,9}. First, is honesty retained in deceptive signals? Yes. Presumably due to anatomical constraints on VTL^26,28 limiting the extent of deception, the relationship between formant frequencies and true body size (acoustic allometry) is preserved, albeit shifted, during vocal size deception. The resulting reliable cues to intra-individual differences in height ostensibly allowed listeners in the present study to ‘rank’ the relative heights of cheating vocalisers. Second, can listeners detect deception? Yes. Listeners correctly identified vocal signals as honest or deceptive much more often than expected by chance, though they nevertheless failed to detect deception approximately half of the time. Third, do listeners recalibrate their height judgements when they detect deception? Yes, largely, and much more if they are primed to seek deception. Indeed in Experiment 2, height judgements were significantly less biased for vocal signals correctly and concurrently detected as deceptive. Fourth, and crucially, does it still pay to deceive? Yes. We show that vocalisers who attempted to alter their apparent body size by shifting the frequency parameters of their voice effectively fooled listeners, who indeed perceived them as taller or shorter, by several centimetres on average. Although primed detection of deception reduced this bias, listeners’ height judgements remained biased overall, as deceit often went undetected.

Our results also show that males more effectively exaggerate their body size through voice modulation than do females. Despite being detected as cheating more often than women, we show that men, who stand to gain relatively greater fitness benefits by successfully exaggerating their size^11,25,35, shifted their formant frequencies more than did women. Men thus simulated a longer vocal tract and larger body size, and more effectively biased listeners to overestimate their height. From the receiver’s side, we also found that, when primed to the possibility of deception, male listeners were less susceptible than female listeners to deceptive signals produced by other men if they correctly detected them as exaggerating or attenuating their body size. The specificity of this effect to the second experiment suggests that male listeners are particularly attuned to the deceptive signals of other men when an explicit competitive context is induced. Together these findings support the prediction that pressure on listeners to counteract deception by recalibrating size judgements for deceptive signals may be maximised in the context of male–male competition. Finally, our finding that listeners can effectively gauge the relative heights of deceivers predicts an asymmetry in the impact of deception on male and female listeners⁶. Indeed, assuming all males exaggerate, females should retain the ability to rank relative male quality, which is crucial in mate choice. In contrast, males may overestimate the absolute size of exaggerating competitors when deception goes undetected. In male–male competition, the size and strength of a rival compared to oneself is critical³⁷, and thus males may overvalue the cost of continued conflict. This sexual asymmetry is again consistent with the dominant view that male–male competition is the primary mechanism of selection on men’s sexually dimorphic traits⁴², including voice pitch⁴³, and here, a probable key driver of size exaggeration.

Honesty has been identified in exaggerated vocal signals of other species, such as red deer^29,30,44 and koalas^45,46 who have both permanent and behavioural adaptations for size exaggeration and where physical constraints enforce reliability. Hypothetically this may lead receivers to adapt to deception by shifting their perceptual scale to exaggerated ranges. Here, we show that absolute deception remains effective, indicating that such perceptual shifts are only partial. We suggest that similar effectiveness may be predicted in nonhuman signalling systems where deception is behavioural and facultative, and especially where constraints enforce a degree of reliability in the signalling of interindividual differences.

Body size exaggeration is suspected in numerous species across a range of taxa;^6,8,19,26 however, to our knowledge, its effectiveness in biasing listeners had never been established. While researchers often generalize animal models to humans, our work shows that studying the human animal can answer key questions about animal behaviour that are otherwise difficult to tackle (see also⁴⁷). At the same time, humans are exceptionally complex. Beyond body size, human voice fundamental and formant frequencies predict a range of biologically and socially relevant traits, such as dominance, strength, and attractiveness^25,35,48, that can interact cross-modally with visual and olfactory cues to influence listeners’ perceptions of the signaler. Voice modulation can also communicate motivation (e.g., aggressive intent) rather than physical traits per se^49,50. Research into vocal deception of a wide range of traits⁵¹ and states⁵⁰, particularly in multimodal real-world contexts⁵², is needed to further elucidate the functions and tangible consequences of deceit in complex social environments. Humans also possess an unpreceded capacity for volitional vocal control²¹ and an advanced theory of mind. While intentionality is not necessary for deceptive signalling to evolve (i.e., functional deception⁵), this has nevertheless led many researchers to suggest that ‘human communication is permeated with deceitʼ⁶, perhaps more so than the communication systems of other animals. Indeed, our results suggest that priming listeners to deception can substantially decrease its effectiveness and thus reduce potential costs of being deceived for cognisant listeners. Further comparative work is needed to elucidate the cognition of deception⁸, in humans and nonhuman animals.

Methods

Vocal stimuli

Vocal stimuli derived from 40 adult English speakers: 20 men (mean age 19.6 ± 2.4 sd) whose heights ranged from 161 cm to 187 cm (mean height 178.4 ± 7 cm sd) and 20 women (mean age 19.1 ± 1.6 sd) whose heights ranged from 147 to 185 cm (mean height 164.9 ± 7.9 sd), taken from a larger sample of vocal stimuli (see³⁸). The stimuli were selected to be representative of a broad range of heights: the height distributions of male and female vocalisers closely parallel those observed in large cross-cultural samples of adults (men 178 ± 6.58 cm, n = 1334; women 165.96 ± 6.64 cm, n = 871)³⁴. Vocalisers were recorded in an anechoic sound-controlled chamber with a Sennheiser MKH 800 cardioid condenser microphone at a distance of approximately 10 cm. Voice stimuli consisted of a series of monophthong vowels (/α/, /i/, /ɛ/, /o/, /u/). Each vocaliser produced the vowels in three conditions, beginning with a baseline ‘honest’ condition in which they spoke the vowels in their natural voice. They were then asked to reproduce the vowels while sounding physically large, and again while sounding physically small, in a counter-balanced order across participants³⁸. No further instructions were given. Recordings were digitally encoded with an M-Audio Fast Track interface at a sampling rate of 96 kHz and 24-bit amplitude quantisation, and stored onto a computer as PCM WAV files. This procedure resulted in 120 vocal stimuli.

Acoustic analysis

Fundamental frequency (f_o) and the first four formant frequencies (F₁–F₄) were measured using the well-established autocorrelation algorithm (f_o range 30–500 Hz for men and 65–600 Hz for women) and Burg linear predictive coding algorithm (max formant 5000 Hz for men and 5500 Hz for women) in Praat acoustic software⁴⁰. Formant measures were taken from the mean centre frequencies of each vowel and averaged within vocalisers and voice conditions. From F₁ to F₄ we computed formant spacing (∆F), a measure of the distance among adjacent formants, and apparent VTL^31,34, an estimate of the length of the supralaryngeal vocal tract (Fig. 1), both of which explain the highest proportion of variance in height among humans³⁴ and many other mammals²⁷ within sex–age classes. Mean f_o measures were additionally transformed into equivalent rectangular bandwidth units (ERB, where Ei = 21.4*log₁₀(0.00437*fi + 1)⁵³ and formant measures were transformed into Bark units (where Zi = 26.81/(1 + 1960/fi) − 0.53)⁵⁴. These quasi-logarithmic scales control for any difference between the physical and perceived properties of these frequencies. However, due to extremely strong collinearity between the two measures of mean f_o (Hz and ERB, r = 0.99) and among various formant measures (∆F and VTL in Hz and Bark, r = −0.99), we found virtually identical results regardless of which measure was used for each given vocal parameter, and thus, data are presented for ∆F and f_o in Hz only.

Listeners

Two-hundred adult listeners took part in one of two psychoacoustic experiments. Sample sizes for Experiment 1 were determined prior to testing to achieve an average of 50 height judgements per vocal stimulus for a statistical power of 80%, in order to obtain a small-to-medium effect size in regressions between perceived and actual vocaliser height. While high inter-rated agreement (alphas > 0.80, ps < 0.001) among listeners is typically achieved with relatively small sample sizes (e.g., less than 15 listeners per sex for voice-based judgements of dominance or attractiveness⁴³), earlier studies on human vocal communication of body size have generally failed to find significant correlations between perceived and actual height in one or both sexes of vocalisers with samples of fewer than 25 listeners per vocal stimulus^55,56,57. In Experiment 2, tasks were conjoined, and thus participants rated all vocal stimuli (see Psychoacoustic playback experiments). English-speaking listeners were recruited via Amazon’s online recruitment platform, Mechanical Turk. The use of headphones was mandatory. In Experiment 1, three participants did not finish the study and were thus excluded from analyses. In the final sample (n = 97), 59 participants indicated male gender (aged 18–63, 9.2 sd), 36 indicated female gender (aged 18–63, 11.1 sd), and two indicated their gender as ‘other’. In Experiment 2, two participants provided random responses and were thus excluded from analyses. In the final sample (n = 98), 59 participants indicated male gender (aged 21–71, 9.7 sd) and 39 indicated female gender (aged 18–55, 9.4 sd). This research was approved by the Institutional Ethics Committee (C-REC; Certificates of approval: ER/KP292/11 and ER/REBY/12). All participants provided informed consent and were reimbursed monetarily at the recommended rate of $0.13 USD per minute⁵⁸.

Psychoacoustic playback experiments

Listeners completed a short demographic questionnaire and took part in one of the two psychoacoustic playback experiments, custom designed in Syntoolkit⁵⁹, each involving two tasks: judging vocaliser height and detecting vocal size deception. In Experiment 1, listeners performed these two tasks in separate, consecutive blocks. In each task listeners rated the voices of a random sample of 10 male and 10 female vocalisers, in each of three voice conditions (honest, exaggerated, and attenuated size), resulting in 60 trials per task, or a total of 120 trials per listener. Height judgements preceded the deception detection task so as not to prime nor bias listeners toward contemplating deception when judging height, thus reflecting a more ecologically valid experimental design. Importantly, the same voice stimuli were presented in both tasks within listeners to allow for meaningful comparisons. In Experiment 2, an independent sample of listeners performed the same tasks; however, the tasks were now performed concurrently for each vocal stimulus. Thus, listeners first indicated whether or not they perceived a vocaliser as deceptively altering their voice to sound larger or smaller, and then judged the height of that same vocaliser, within the same experimental trial. Listeners judged all 20 male and 20 female vocalisers, in each of three voice conditions (honest, exaggerated, and attenuated size), for a total of 120 trials per listener. In both Experiments, listeners were presented with a single vocal stimulus on each trial. Voice stimuli were blocked by the sex of the vocaliser, and block order and stimulus presentation within each block were randomized. Listeners were instructed to wear headphones and not to adjust their volume settings throughout the experiment; this was verified during debriefing.

Task 1: Judging height

To indicate the perceived height of a vocaliser, listeners used a vertical sliding bar, which appeared only after a voice stimulus finished playing. As the participant moved the cursor along the vertical sliding bar, the selected height was indicated in both metric (cm) and imperial (feet and inches) units. Maximum and minimum heights were labelled on the top and bottom of the scale, respectively, based on sex-specific distributions of heights in our samples, which correspond closely with those observed in the general population³⁴. Thus, the centre of the sliding bar was set to 178 cm for men and 165 cm for women, with end points corresponding to three standard deviations above and below these means: a range of 156–198 cm for men and 144–186 cm for women.

Task 2: Detecting deception

To indicate perceived vocal size deception or its absence, listeners were instructed to indicate whether they believed the vocaliser was speaking in their natural voice or changing their voice to sound physically smaller or larger than they actually are. These three options, presented as radial buttons, appeared only after the voice stimulus finished playing.

Statistical analysis

A series of LMMs fit by restricted maximum likelihood estimation were used to examine listeners’ height judgements, correct detection of vocal size deception, and the influence of this detectable deception on height judgements. The key variable of interest (size deception: honest, attenuated, exaggerated) was entered in LMMs as a fixed variable. Vocaliser and listener IDs were always included as random variables with random intercepts in all models, and the sex of both vocaliser and listener were entered as fixed variables in omnibus models. Vocaliser sex consistently showed significant effects; therefore separate LMMs are reported for male and female vocalisers. Where there were no significant effects of listener sex (Experiment 1) listener data were pooled for all analyses; otherwise listener sex was retained in final models where applicable. Full model parameters of LMMs are detailed below each respective output table (see Supplementary Tables 2, 5, 7, 8, 9, 11, 12, 13 and 14). All means, confidence intervals (95% CI), and standard errors (SEM) reported in the paper derive from LMMs. Significant effects in LMMs were further examined using pairwise tests with Šidák correction for multiple comparisons. Multiple and simple linear regression models were employed to examine relationships among continuous variables. Where applicable, height judgements were averaged across listeners for each vocaliser and voice condition. Spearman’s rho (r_s) correlation coefficients were used where data were nonnormally distributed or potentially nonlinear, and these statistics are reported in the paper, however Pearson’s r coefficients are given for comparison in Supplementary Tables 3, 4, and 6. Cook’s distances were calculated to identify influential outliers in simple regressions (where Di > 0.20), and the statistics reported in the paper exclude outliers. Removing or retaining outliers did not affect the direction and general pattern of relationships. We used an alpha level of 0.05 for all statistical tests, and all tests were two-tailed with the exception of simple regressions, where we had a priori directional predictions. Error bars in all figures represent standard errors of the mean (±SEM), whereas 95% CI are given in text. Statistical analyses were performed in SPSS 24 (IBM). Full statistical models and data are available in the supplementary files of this article (see Supplementary Data 1 and Supplementary Information).

Ethics statement

This research was approved by the University of Sussex’s Life Sciences & Psychology Cluster-based Research Ethics Committee (C-REC; Certificates of approval: ER/KP292/11 and ER/REBY/12) and complies with the American Psychological Association’s Ethical Principles of Psychologists and Code of Conduct, including obtaining informed consent from all human participants.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All data generated or analysed during this study are included in this published article and its supplementary information files, including source data for all figures provided as a Source Data file. Datasets and voice stimuli (n = 120 audio WAV files) are also available on the Open Science Framework (https://osf.io/r7gzb/, https://doi.org/10.17605/OSF.IO/R7GZB). Source data are provided with this paper.

References

Zahavi, A. Mate selection—a selection for a handicap. J. Theor. Biol. 53, 205–214 (1975).
Article CAS PubMed Google Scholar
Dawkins, R. & Krebs, J. R. Animal signals: information or manipulation. Behav. Ecol. Evol. Approach 2, 282–309 (1978).
Google Scholar
Maynard-Smith, J. & Parker, G. A. The logic of asymmetric contests. Anim. Behav. 24, 159–175 (1976).
Article Google Scholar
Johnstone, R. A. & Grafen, A. Dishonesty and the handicap principle. Anim. Behav. 46, 759–764 (1993).
Article Google Scholar
Hauser, M. D. The Evolution of Communication (MIT Press, 1996).
Searcy, W. A. & Nowicki, S. The Evolution of Animal Communication: Reliability and Deception in Signaling Systems (Princeton Univ. Press, 2005).
Carazo, P. & Font, E. ‘Communication breakdown’: the evolution of signal unreliability and deception. Anim. Behav. 87, 17–22 (2014).
Article Google Scholar
Mitchell, R. W. & Thompson, N. S. (eds.). Deception: Perspectives on Human and Nonhuman Deceit (SUNY Press, 1986).
Reid, S. A., Zhang, J., Anderson, G. L. & Keblusek, L. Costly signaling in human communication. In The Handbook of Communication Science and Biology (eds. Floyd, K. & Weber, R.) (Routledge, 2020).
Rendall, D., Owren, M. J. & Ryan, M. J. What do animal signals mean? Anim. Behav. 78, 233–240 (2009).
Article Google Scholar
Andersson, M. B. Sexual Selection (Princeton Univ. Press, 1994).
Flower, T. P., Gribble, M. & Ridley, A. R. Deception by flexible alarm mimicry in an African bird. Science 344, 513–516 (2014).
Article ADS CAS PubMed Google Scholar
Hurd, P. L. Is signalling of fighting ability costlier for weaker individuals? J. Theor. Biol. 184, 83–88 (1997).
Article Google Scholar
Maynard-Smith, J. & Price, G. R. The logic of animal conflict. Nature 246, 15–18 (1973).
Article MATH Google Scholar
Maynard-Smith, J. & Harper, D. Animal Signals (Oxford Univ. Press, 2003).
Schaefer, M. & Ruxton, G. By‐product information can stabilize the reliability of communication. J. Evol. Biol. 25, 2412–2421 (2012).
Article PubMed Google Scholar
Folstad, I. & Karter, A. J. Parasites, bright males, and the immunocompetence handicap. Am. Nat. 139, 603–622 (1992).
Article Google Scholar
Silk, J. B., Kaldor, E. & Boyd, R. Cheap talk when interests conflict. Anim. Behav. 59, 423–432 (2000).
Article CAS PubMed Google Scholar
Charlton, B. D. & Reby, D. The evolution of acoustic size exaggeration in terrestrial mammals. Nat. Commun. 7, 12739 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Arnott, G. & Elwood, R. W. Signal residuals and hermit crab displays: flaunt it if you have it! Anim. Behav. 79, 137–143 (2010).
Article Google Scholar
Pisanski, K., Cartei, V., McGettigan, C., Raine, J. & Reby, D. Voice modulation: a window into the origins of human vocal control? Trends Cogn. Sci. 20, 304–318 (2016).
Article PubMed Google Scholar
Ackermann, H., Hage, S. R. & Ziegler, W. Brain mechanisms of acoustic communication in humans and nonhuman primates: an evolutionary perspective. Behav. Brain Sci. 37, 529–546 (2014).
Article PubMed Google Scholar
Belyk, M. & Brown, S. The origins of the vocal brain in humans. Neurosci. Biobehav. Rev. 77, 177–193 (2017).
Article PubMed Google Scholar
Grafen, A. Biological signals as handicaps. J. Theor. Biol. 144, 517–546 (1990).
Article MathSciNet CAS PubMed Google Scholar
Pisanski, K. & Bryant, G. A. The evolution of voice perception. In The Oxford Handbook of Voice Studies (eds. Eidsheim, N. S. & Meizel, K. L.) (Oxford Univ. Press, 2019).
Fitch, W. T. & Hauser, M. D. Unpacking “honesty”: vertebrate vocal production and the evolution of acoustic signals. In Acoustic Communication (eds. Simmons, A. M., Fay, R. R. & Popper, A. N.) 65–137 (Springer, 2003).
Charlton, B. D., Pisanski, K., Raine, J. & Reby, D. Coding of static information in terrestrial mammal vocal signals. In Animal Signals and Communication (eds. Aubin, T. & Mathevon, N.) 115–136 (Springer Nature, 2020).
Fitch, W. T. The evolution of speech: a comparative review. Trends Cogn. Sci. 4, 258–267 (2000).
Article CAS PubMed Google Scholar
Fitch, W. T. & Reby, D. The descended larynx is not uniquely human. Proc. Biol. Sci. 268, 1669–1675 (2001).
Article CAS PubMed PubMed Central Google Scholar
Reby, D. et al. Red deer stags use formants as assessment cues during intrasexual agonistic interactions. Proc. Biol. Sci. 272, 941–947 (2005).
PubMed PubMed Central Google Scholar
Reby, D. & McComb, K. Anatomical constraints generate honesty: acoustic cues to age and weight in the roars of red deer stags. Anim. Behav. 65, 519–530 (2003).
Article Google Scholar
Titze, I. R. Principles of Vocal Production (Prentice-Hall, 1994).
Fitch, W. T. & Giedd, J. Morphology and development of the human vocal tract: a study using magnetic resonance imaging. J. Acoust. Soc. Am. 106, 1511–1522 (1999).
Article ADS CAS PubMed Google Scholar
Pisanski, K. et al. Vocal indicators of body size in men and women: a meta-analysis. Anim. Behav. 95, 89–99 (2014).
Article Google Scholar
Puts, D. Sexual selection on male vocal fundamental frequency in humans and other anthropoids. Proc. Biol. Sci. 283, 20152830 (2016).
Pisanski, K. & Rendall, D. The prioritization of voice fundamental frequency or formants in listeners’ assessments of speaker size, masculinity, and attractiveness. J. Acoust. Soc. Am. 129, 2201 (2011).
Article ADS PubMed Google Scholar
Raine, J., Pisanski, K., Oleszkiewicz, A., Simner, J. & Reby, D. Human listeners can accurately judge strength and height relative to self from aggressive roars and speech. iScience 4, 273–280 (2018).
Article ADS PubMed PubMed Central Google Scholar
Pisanski, K. et al. Volitional exaggeration of body size through fundamental and formant frequency modulation in humans. Sci. Rep. 6, 34389 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Raine, J., Pisanski, K., Bond, R., Simner, J. & Reby, D. Human roars communicate upper-body strength more effectively than do screams or aggressive and distressed speech. PLoS One 14, e0213034 (2019).
Article CAS PubMed PubMed Central Google Scholar
Boersma, P. & Weenink, D. Praat: Doing phonetics by computer v 6.1.21. https://www.fon.hum.uva.nl/praat/ (2020).
De Boer, B. Investigating the acoustic effect of the descended larynx with articulatory models. J. Phon. 38, 679–686 (2010).
Article Google Scholar
Puts, D. A. Beauty and the beast: mechanisms of sexual selection in humans. Evol. Hum. Behav. 31, 157–175 (2010).
Article Google Scholar
Kordsmeyer, T. L., Hunt, J., Puts, D. A., Ostner, J. & Penke, L. The relative importance of intra-and intersexual selection on human male sexually dimorphic traits. Evol. Hum. Behav. 39, 424–436 (2018).
Article Google Scholar
Charlton, B. D., Reby, D. & McComb, K. Effect of combined source (F0) and filter (formant) variation on red deer hind responses to male roars. J. Acoust. Soc. Am. 123, 2936 (2008).
Article ADS PubMed Google Scholar
Charlton, B. D. et al. Koalas use a novel vocal organ to produce unusually low-pitched mating calls. Curr. Biol. 23, R1035–R1036 (2013).
Article CAS PubMed Google Scholar
Charlton, B. D. et al. Cues to body size in the formant spacing of male koala (Phascolarctos cinereus) bellows: honesty in an exaggerated trait. J. Exp. Biol. 214, 3414–3422 (2011).
Article PubMed Google Scholar
Wilson, M. L., Miller, C. M. & Crouse, K. N. Humans as a model species for sexual selection research. Proc. Biol. Sci. 284, 20171320 (2017).
PubMed PubMed Central Google Scholar
Schild, C. et al. Linking human male vocal parameters to perceptions, body morphology, strength and hormonal profiles in contexts of sexual selection. Sci. Rep. 10, 1–16 (2020).
Article CAS Google Scholar
Morton, E. S. On the occurrence and significance of motivation-structural rules in some bird and mammal sounds. Am. Nat. 111, 855–869 (1977).
Article Google Scholar
Zhang, J., Hodges-Simeon, C., Gaulin, S. J. & Reid, S. A. Pitch lowering enhances men’s perceived aggressive intent, not fighting ability. Evol. Hum. Behav. 42, 51–60 (2020).
Hughes, S. M., Mogilski, J. K. & Harrison, M. A. The perception and parameters of intentional voice manipulation. J. Nonverbal Behav. 38, 107–127 (2014).
Article Google Scholar
Pisanski, K., Oleszkiewicz, A., Plachetka, J., Gmiterek, M. & Reby, D. Voice pitch modulation in human mate choice. Proc. Biol. Sci. 285, 20181634 (2018).
Glasberg, B. R. & Moore, B. C. Derivation of auditory filter shapes from notched-noise data. Hear. Res. 47, 103–138 (1990).
Article CAS PubMed Google Scholar
Traunmüller, H. Auditory scales of frequency representation. (1997).
van Dommelen, W. A. & Moxness, B. H. Acoustic parameters in speaker height and weight identification: sex-specific behaviour. Lang. Speech 38, 267–287 (1995).
Article PubMed Google Scholar
Collins, S. A. Men’s voices and women’s choices. Anim. Behav. 60, 773–780 (2000).
Article CAS PubMed Google Scholar
Bruckert, L., Lienard, J.-S., Lacroix, A., Kreutzer, M. & Leboucher, G. Women use voice parameters to assess men’s characteristics. Proc. R. Soc. B Biol. Sci. 273, 83–89 (2006).
Article Google Scholar
Chandler, J. & Shapiro, D. Conducting clinical research using crowdsourced convenience samples. Annu. Rev. Clin. Psychol. 12, 53–81 (2016).
Article PubMed Google Scholar
Hughes, J. E., Gruffydd, E., Simner, J. & Ward, J. Synaesthetes show advantages in savant skill acquisition: training calendar calculation in sequence-space synaesthesia. Cortex 113, 67–82 (2019).
Article PubMed Google Scholar

Download references

Acknowledgements

This research and both authors K.P. and D.R. were funded by the University of Lyon IDEXLyon ‘Programme Investissements d’Avenir’ (ANR-16-IDEX-0005) to D.R. and supported by the Labex CeLyA. Author K.P. was also supported by the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement no. 655859. Publication of this article in open access was financially supported by the Excellence Initiative - Research University (IDUB) programme for the University of Wroclaw. The authors thank Profs M. Greenfield, M. Hauber and N. Mathevon for their constructive feedback on earlier versions of this paper.

Author information

Authors and Affiliations

Equipe de Neuro-Ethologie Sensorielle (ENES), Centre de Recherche en Neurosciences de Lyon (CRNL), CNRS, INSERM, University of Lyon/Saint-Étienne, Saint-Étienne, France
Katarzyna Pisanski & David Reby
Institute of Psychology, University of Wrocław, Wrocław, Poland
Katarzyna Pisanski

Authors

Katarzyna Pisanski
View author publications
You can also search for this author in PubMed Google Scholar
David Reby
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.P. and D.R. conceived and designed the study; K.P. collected all data, conducted analyses, and created all figures and tables; K.P. and D.R. interpreted the results and wrote the manuscript.

Corresponding author

Correspondence to Katarzyna Pisanski.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks W. Tecumseh Fitch, Nate Pipitone and the other, anonymous, reviewer for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Description of Additional Supplementary Files

Supplementary data 1

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pisanski, K., Reby, D. Efficacy in deceptive vocal exaggeration of human body size. Nat Commun 12, 968 (2021). https://doi.org/10.1038/s41467-021-21008-7

Download citation

Received: 14 September 2020
Accepted: 05 January 2021
Published: 12 February 2021
DOI: https://doi.org/10.1038/s41467-021-21008-7

This article is cited by

Comparing accuracy in voice-based assessments of biological speaker traits across speech types
- Piotr Sorokowski
- Agata Groyecka-Bernard
- Katarzyna Pisanski
Scientific Reports (2023)
Resolving the bouba-kiki effect enigma by rooting iconic sound symbolism in physical properties of round and spiky objects
- Mathilde Fort
- Jean-Luc Schwartz
Scientific Reports (2022)
Individual differences in vocal size exaggeration
- Michel Belyk
- Sheena Waters
- Carolyn McGettigan
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.