Wolfgang von Kempelen's Speaking Machine

Wolfgang von Kempelen's Speaking Machine

Wolfgang von Kempelen's Speaking Machine is a manually-operated speech synthesizer that began development in 1769, by Austro-Hungarian author and inventor Wolfgang von Kempelen. It was in this same year that he completed his far more infamous contribution to history: The Turk, a chess-playing automaton, later revealed to be a very far-reaching and elaborate hoax due to the chess-playing human-being occupying its innards. [4] But while the Turk’s construction was completed in six months, Kempelen’s Speaking Machine occupied the next twenty years of his life. [2] After two conceptual “dead ends” over the first five years of research, Kempelen’s third direction ultimately led him to the design he felt comfortable deeming “final”: a functional representational model of the human vocal tract. [3]

First Design

Kempelen’s first experiment with speech synthesis involved only the most rudimentary elements of the vocal tract necessary to produce speech-like sounds. A kitchen bellows, used to stoke fires in wood-burning stoves, was invoked as a set of lungs to supply the airflow. A reed extracted from a common bagpipe was implemented as the glottis, the source of the raw fundamental sound in the vocal tract. The bell of a clarinet made for a sufficient mouth, despite its rigid form. This basic model was able to produce simple vowel sounds only, though some additional articulation was possible by positioning one’s hand at the bell opening to obstruct airflow. The physical hardware for constructing the nasals, plosives and fricatives that most consonants require was not present, however. Kempelen, like many other early pioneers of phonetics, misunderstood the source of the perceived “higher frequencies” of certain sounds as a function of the glottis, rather than as the function of the formants of the entire vocal tract, so he abandoned his single-reed design for a multiple-reed approach. [2] [3]

Second Design

The second design involved a console, similar to that of a musical organ of the period, in which the operator manned a set of keys, one for each letter. The sounds were produced by a common bellows that fed air through various pipes with the appropriate shapes and obstructions needed to produce that letter. Through experimentation, he came to find that the reed’s resonant length was not crucial to the creation of the high-frequency components of certain vowels and fricatives, so he tuned them all to be the same pitch for the sake of consistency between letters. While not all letters were represented at this point, Kempelen had developed the technology required to produce most vowels and several consonants, including the plosive “p”, and the nasal “m”, and thus was in a position to begin forming syllables and short words. However, this immediately led to the primary flaw of his second design: the parallel nature of the multiple reeds allowed for more than one letter to be sounded at a time. And in the process of building syllables and words, the sonic “overlap” (now referred to as co-articulation) rendered sounds very uncharacteristic of human speech, undermining the intention of the design altogether. Kempelen comments:
“In order to continue my experiments it was necessary, above all, that I should have a perfect knowledge of what I wanted to imitate. I had to make a formal study of speech and continually consult nature as I conducted my experiments. In this way my talking machine and my theory concerning speech made equal progress, the one serving as guide to the other.” [3]
“It was possible, following the methods I’d been using, to invent separate letters, but never to combine them to form syllables, and that it was absolutely necessary to follow nature which has only one glottis and one mouth, through which every sound emerges and which gives a unity to them.” [2] [3]
Thus, Kempelen began work on his third, and ultimately final design, which itself was in many ways a “close-as-possible” representation of the physiology of the vocal tract.

Third Design

The third approach followed a similar design as the very first, which was conceptually more accurate to the natural design of the human vocal tract than that of the second design. It consisted, like before, of a bellows, a reed and a simulated mouth (this time made of India rubber, for better creation of vowel sounds via manipulation by hand), but also included a “throat” to which a “nasal cavity” was attached (complete with two “nostrils” for making the “n” and “m” sounds), as well as several levers and tubes dedicated to “s” and “sh” sounds, a rod that would interfere with the reeds vibration to make a rolling “r” sound, and a separate, smaller bellows that would allow air to pass the reed while the mouth was completely closed (a feature required for the “b” sound). At one point, a special valve intended to simulate the “f” fricative was included, but was later removed when it was revealed that the same sound could be achieved by simply closing all of the orifices of the machine and allowing air to leak from the cracks. Similarly, at one point in the design, there was an alternate “mouth” assembly consisting of a wooden box with a pair of hinged shutters that acted as lips. Inside the box resided a hinged, wooden, string-operated flap that acted as a tongue. The purpose of this assembly was to mimic the mouth and tongue in the construction of plosives such as “b” and “d”, but was later removed when Kempelen recognized that without a proper tongue, the machine would never be able to produce the “t”, “k”, “d” and “g” sounds. He found his way around this entire problem by replacing the “t” and “k” sounds with the “p” sound, and the “d” and “g” sounds with the “b” sound (which itself was simply a slight variation of the “p” sound). In the context of a familiar word, listeners often ignored the mispronunciation altogether (a phenomenon later explored by researchers in the field of cognitive science). Kempelen believed that people were more forgiving of the errors made by his machine due to the frequency of the reed and vocal tract resonant length he chose to use, which create a resonance much more like a young child, than that of an adult. [2] [3] This third design, unlike those before it, was completely capable of speaking complete phrases in French, Italian and English (German was possible, but required a greater skill-level by the operator, due to the more frequent use of consonants in the German language). Its greatest limitation was the bellows, which, although they were six times the capacity of human lungs, ran empty of air much faster than that of its human counterpart. Because the design was based on a single reed as the glottal sound-source, he had none of the problems of co-articulation that came inherently with the second design. But that single reed also meant that the Speaking Machine “spoke” in monotone [4] . Kempelen expended some time to try and introduce several prosodic pitch-variation mechanisms into the reed assembly, but to no avail. He decided to leave the design to be improved upon by the next batch of experimenters. All of these important additions for the third design came from the two decades of intensive research of the vocal tract in relation to spoken languages by Kempelen, for which the behavior of each crucial physiological element of speech production was scrutinized and replicated acoustically and/or mechanically. [3]

A Significant Contribution

Shortly after the completion and exhibition of his Speaking Machine, in 1804, von Kempelen died, though not before publishing an extremely comprehensive journal of the past twenty years of his research in phonetics. The 456 page book, titled "Mechanismus der menschlichen Sprache nebst Beschreibung einer sprechnenden Maschine" (which translates to "The Mechanism of Human Speech, with a Description of a Speaking Machine", published in 1791) [2] [4] , contained every technical aspect of both Kempelen’s construction of the Speaking Machine (including the preliminary designs) and his studies of the human vocal tract. [3]

In 1837, Sir Charles Wheatstone resurrected the work of Wolfgang von Kempelen, creating an improved replica of his Speaking Machine. [3] [4] Using new technology developed over the previous 50 years, Wheatstone was able to further analyze and synthesize components of acoustic speech, giving rise to the second wave of scientific interest in phonetics. After viewing Wheatstone’s improved replica of the Speaking Machine at an exposition, a young Alexander Graham Bell set out to construct his own speaking machine with the help and encouragement of his father. [4] [5] Bell’s experiments and research ultimately led to his invention of the telephone in 1876 [4] , which revolutionized global communication.


[1] Von Kempelen, Wolfgang. Mechanismus Der Menschlichen Sprache Nebst Bescreibung Einer Sprechnenden Maschine. Austria: Stuttgart-Bad Cannstatt, 1970.

[2] Dudley, Homer & Tarnoczy, T.H. "The Speaking Machine of Wolfgang Von Kempelen." The Journal of the Acoustical Society of America Volume 22, Number 2 March, 1950: pgs 151-166.

[3] Linggard, R. Electronic Synthesis of Speech. Cambridge: Cambridge University Press, 1985: pgs 4-9

[4] Standage, Tom. The Turk: The Life and Times of the Famous Eighteenth-Century Chess-Playing Machine. New York: Walker & Company, 2002: pgs 76-81

[5] Rossing, Thomas, et al. The Science of Sound. San Francisco: Addison Wesley, 2002: pg 365

Wikimedia Foundation. 2010.

Look at other dictionaries:

  • Wolfgang von Kempelen — Selbstporträt Kempelens, Kohlezeichnung mit Signatur Wolfgang von Kempelen (ungarisch Kempelen Farkas, slowakisch Ján Vlk Kempelen) (* 23. Januar 1734 in Pressburg; † 26. März 1804 in der Alservorstadt, heute Wien) war ein Universalgelehrter, Erf …   Deutsch Wikipedia

  • Wolfgang von Kempelen — Johann Wolfgang Ritter von Kempelen de Pázmánd ( hu. Kempelen Farkas; sk. Ján Vlk Kempelen) (23 January 1734 ndash; 26 March 1804) was a Hungarian author and inventor with Irish ancestors.LifeKempelen was from Pressburg (Bratislava), Kingdom of… …   Wikipedia

  • Wolfgang von Kempelen — Autorretrato al carboncillo de Kempelen, con la firma. Johann Wolfgang Ritter von Kempelen de Pázmánd[1] (en húngaro: Kempelen Farkas; en eslovaco: Ján Vlk Kempelen) (Bratislava, 23 de enero de …   Wikipedia Español

  • The Turk — This article is about the chess playing automaton. For other uses, see Turk (disambiguation). An engraving of the Turk from Karl Gottlieb von Windisch s 1784 book Inanimate Reason …   Wikipedia

  • Speech synthesis — Stephen Hawking is one of the most famous people using speech synthesis to communicate Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented… …   Wikipedia

  • Articulatory synthesis — refers to computational techniques for synthesizing speech based on models of the human vocal tract and the articulation processes occurring there. The shape of the vocal tract can be controlled in a number of ways which usually involves… …   Wikipedia

  • Hungary — This article is about the European country. For other uses, see Hungary (disambiguation). Republic of Hungary Magyar Köztársaság …   Wikipedia

  • Sintetizador del habla — Uno o varios wikipedistas están trabajando actualmente en este artículo o sección. Es posible que a causa de ello haya lagunas de contenido o deficiencias de formato. Si quieres, puedes ayudar y editar, pero por favor: antes de realizar… …   Wikipedia Español

  • Johann Nepomuk Mälzel — Johann Nepomuk M auml;lzel (August 15, 1772 July 21, 1838) was an inventor, engineer, and showman, best known for manufacturing a metronome and several music automatons, and displaying a fraudulent chess machine.Life and workM auml;lzel was born… …   Wikipedia

  • PSOLA — Dieser Artikel als Sprachausgabe. Unter Sprachsynthese versteht man die künstliche Erzeugung der menschlichen Sprechstimme (fälschlicherweise wird es oft auch als Synonym für Vorleseautomat oder Text to Speech System (TTS) verwendet) …   Deutsch Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”

We are using cookies for the best presentation of our site. Continuing to use this site, you agree with this.