Skip to content(if available)orjump to list(if available)

Human speech may have a universal transmission rate (2019)

golem14

Maybe more interesting: what’s the average reading speed per language, and what’s the variation? I know that reading speed varies a lot ( also depends on topic, a math textbook reads slower than an Ian Fleming book).

Are simultaneous translator’s brains different? They need to process two languages at once, and I never could do that even though I’m fluent in more than one language.

CMay

The paper does not seem to support that human speech has a universal transmission rate or that every spoken language has a universal information rate. They showed that information rate varied by individual and by language, just that it varied less than the syllable rate.

This was also bounded by a reading task, so the performance shown per language could be influenced by the average reading skills of people who speak those languages. They also asked them to pronounce differently than they normally would.

If you take the 17 languages they tested and get the average between them, you get 39bits/s. For English and French, the information rate they recorded was higher, with an average closer to ~45bits/s (just eyeballing their chart). Their results also showed Thai at ~35bits/s. A 10bits/s swing from median to median is pretty huge.

From the paper:

"We collected recordings of 170 native adult speakers of the aforementioned 17 languages, each reading at their normal rate a standardized set of 15 semantically similar texts across the languages (for a total amount of approximately 240,000 syllables). Speakers became familiar with the texts, by reading them several times before being recorded, so that they understand the described situation and minimize reading errors"

"Together, our findings show that while there is wide interspeaker variation in speech and IRs (information rates), this variation is also structured by language. This means that an individual’s speech behavior is not entirely due to individual characteristics but is further constrained by the language being spoken."

"However, languages seem to stably inhabit an optimal range of IRs, away from the extremes that can still be available to individual speakers. Languages achieve this balance through a trade-off between ID (information density) and SR (syllable rate), resulting in a narrower distribution of IRs compared to SRs. In the introduction, we rhetorically asked whether too low or too high an IR would impede communicative and/or cognitive efficiency. Our results here suggest that the answer to both questions is positive and that human communication seems to avoid two extreme sociolinguistic profiles: on the one hand, high ID languages spoken fast by their speakers (“high-fast”), and, on the other, low ID languages spoken slowly by their speakers (“low-slow”)."

https://www.science.org/doi/10.1126/sciadv.aaw2594

Yenrabbit

I've always found this interesting. Think+transmit seems more likely to be the bottleneck vs receive, given that we can easily parse most podcasts etc at 3-4X speed. If being understood by everyone wasn't required, I wonder if one could learn to boost both send and receive rates?

Buttons840

This suggest that maybe human minds are able to form ideas at a certain speed, and language has evolved to convey these ideas and not be a bottleneck, but there is no reason to improve language beyond this.

0cf8612b2e1e

3-4x! Are you the flash? I usually run things about 1.5x when I am commuting. 3x would require laser focus without distractions. Or you mean more a “can technically absorb the language being spoken” sense?

pbh101

Not OP but listen to podcasts at highly accelerated settings:

The information density of ‘two dudes talking’ or any unscripted format is very low, so it time-compresses well. Specific podcasts, typically scripted monologues with technical content, such as Causality [0] (recommended!), I need to listen to much slower. Ditto if it is in an accent which isn’t mine, which slows my comprehension. I also slow the speed if I’m driving. So, yes, it takes mental overhead, but is doable. Go one click at a time and it will feel natural.

[0]: https://engineered.network/causality/

0cf8612b2e1e

I suppose the format is a huge differentiator. I exclusively listen to highly produced content which has essentially no dead time. The content is already a compressed transmission of information.

joshdavham

This definitely fits with my experience as a language learner studying French and Japanese.

Japanese is definitely a faster spoken language than French, but French words tend to be a lot more verbose and packed with meaning. For Japanese speakers to communicate as much meaning as French speakers, they would need to speak faster.

imglorp

Maybe not just spoken language. I would suggest sign language has a similar bit rate as spoken. The evidence is that a sign interpreter conveys about the same information in a conversation in the same time.

readthenotes1

The bits are defined in another paper by the authors using "syntagmatic density of information ratio (SDIR)" which seems to be related to the number of syllables required to convey the same information in different languages, using Vietnamese as the baseline.

I cannot wait for independent replication!

https://www.researchgate.net/publication/235971274_A_cross-L...

ars

Universal between languages maybe, but certainly not universal between individual speakers.

Joel_Mckay

There is also a lot of nonverbal data. Imagine the horror of discovering your conversational ML build could hold a plausible verbal conversation only guessing 58% of spoken words accurately... then realizing humans likely fair much worse. =3

"Prisencolinensinainciusol" (Adriano Celentano)

https://www.youtube.com/watch?v=Ax9c8f82QZA

CamperBob2

39 bits per second, about twice the speed of Morse code

Guessing there's something very fundamental that the author misunderstands about Morse code.

kevingadd

I would expect you can only transmit so fast in morse code given the need for the dots and dashes to be clearly distinguished from each other and identifiable by the recipient.

Of course, if you know both ends are computers you can just transmit in some other encoding at a much higher rate.

vvoid

As Morse speeds up, you stop relying on individual dots and dashes and begin recognizing common combinations of letters. Faster still and you are mainly hearing word stems and suffixes.

The faster the information comes at you, the less important any particular bit is, because you have more context with which to autocorrect.

selcuka

> the dots and dashes to be clearly distinguished from each other

Yes, spaces are part of the morse code spec. It looks like a binary encoding but in fact it's ternary.

We can invent a 5-bit (or 6-bit, to include numbers and punctuation) morse-like code to avoid needing spaces.

CamperBob2

The point is, Morse code is as many "bits per second" as you want. You could send Morse code at 10 gigabits per second if you wanted. It is not meaningful to say that Morse code implies a particular data rate.

Historically the metric for Morse code is words per minute. Morse is similar to a Huffman code where common letters are allocated fewer elements, so it's not very meaningful to talk about "bits per second" with respect to Morse even if you do specify the number of words per minute. The number of "bits" will vary based on the letters being transmitted.

Nevermark

Yes, computers can transmit any language at any speed (if we include parallelism).

That wasn't contested.

Tadpole9181

Oh, come on, this is just being coy for no reason. Given the context, it is abundantly clear they mean "normal, human operated morse code".

A skilled operator is around 30 WPM. The average English word is 5 (rounded up) characters. Add one character for the space. That's 180 characters per minute, or 3 character per second. With 37 characters available in morse code, that's log_2(37) or 5.2 bits per charater.

So 15.6 bits per second. Just under half of the 39 bits they got for speech, like they said.

aaron695

[dead]