Audio synchronizer

Audio synchronizer

An audio synchronizer is a variable audio delay utilized to correct or maintain audio video sync or timing [http://broadcastengineering.com/audio/broadcasting_managing_lip_sync/index.html] also known as lip sync error [http://en.wikipedia.org/wiki/Lip_sync_error] . See for example the specification given for audio to video timing given in ATSC Document IS-191 [ATSC Document IS-191 ( [http://www.atsc.org/standards/is_191.pdf] )] . Modern television systems utilize large amounts of video signal processing such as MPEG preprocessing, encoding and decoding, video synchronization and resolution conversion in pixelated displays. This video processing can cause delays in the video signal ranging from a few microseconds to tens of seconds. If the television program is displayed to the viewer with this video delay the audio video synchronization is wrong, and the video will appear to the viewer after the sound is heard. This effect is commonly referred to as A/V sync or lip sync error and can cause serious problems related to the viewer's enjoyment of the program.

In order to correct audio video sync problems the video processing circuitry outputs a DDO (digital delay output) signal which carries information about the amount of delay the video signal experiences due to the processing. The audio synchronizer receives the DDO and in response delays the audio by an equivalent amount, thereby maintaining proper audio video sync.

Modern audio synchronizers operate by digitizing and writing the audio signal into a ring memory, which is most commonly a RAM-based memory having independent read and write ability. At the appropriate delay time(as conveyed by the DDO) after an audio sample (or group of samples) are written into the memory the previously stored audio sample is read from the ring memory. The storage and reading of the audio samples takes place continuously in response to respective memory write and read addresses which are incremented by 1 count for every write or read operation. For example an audio sample would be written at address 1, a different sample read from (previously written) address 5, another sample written at address 2, yet another read from 6, write at 3, read from 7 and so on. The delay between writing and reading a particular sample is 4 addresses which when multiplied by the amount of time it takes to change from one address to the next gives the total audio delay. For examples of modern audio synchronizers, search "audio synchronizer" or "audio video sync" on the United States Patent Office web site at [http://www.uspto.gov/patft/index.html] .

Unfortunately, the video delays frequently make quick and large changes, for example a jump in delay time from 2 seconds to 6 seconds is possible. In order to maintain proper audio video sync, the audio delay needs to track these video delay changes. Changing the audio delay requires that the difference between the write address and the read address must be changed. This change can be accomplished by causing either the write or read address to jump forward or backward, however this jump causes some audio samples to be repeated or lost resulting in an unwanted and annoying pop, click, gap, distortion and/or noise in the audio signal. Some audio synchronizers operate by making repeated, very small jumps which causes unwanted (but less annoying) distortion and noise in the audio signal, rather than pops, gaps and clicks. Other audio synchronizers change delay by changing the speed of the reading of audio from the ring memory. If audio samples are read out of the memory more slowly than they are written, the delay increases. If audio samples are read out faster than they are written the delay decreases. Using variable speed reading prevents pops, clicks, gaps, distortion and noise from being introduced into the audio, but does create unwanted and annoying pitch errors. For example reading faster than writing causes the audio pitch to increase and reading slower than writing causes the pitch to decrease.

Audio synchronizers which utilize variable speed reading are generally preferred in professional applications. The control of audio delay is generally more accurate and more easily accomplished. Pitch errors in lower performance devices are uncompensated and are kept to a level that is generally not perceived by the average viewer, by limiting the amount of change of reading speed. Typically the change limit is in the order of 0.2%. Unfortunately this limits the rate of delay change and when large video delay changes occur the slow tracking rate of these uncompensated synchronizers can cause the audio video sync to be off for several seconds or minutes until the audio delay catches up with the video delay. Additionally, critical listeners such as people who have musical training can hear even the small pitch error which is allowed.

In higher performance audio synchronizers, the rate of delay change is allowed to be much faster, generally in the order of 25%, and the resulting pitch error is corrected with a pitch correction circuit. The pitch correction circuitry is frequently a proprietary design due to the difficulty in performing correction which is imperceptible to the critical listener. These higher performance audio synchronizers allow the audio delay to track even large and quick video delay changes without generating any artifacts which are perceptible to even critical listeners for most audio program material.

Recent development in video processing devices permit those devices to sense when a large video delay change will need to be made beforehand and allow that change information to be communicated to the audio synchronizer. The "advanced notice" from the video processing device allows the audio synchronizer to anticipate and take advantage of particular audio material (e.g., periods of relative silence or periods without music) to facilitate making corresponding large audio delay changes which do not risk generating noticeable audio artifacts. Further developments permit handshaking between the video processing device and the audio synchronizer to control when the video delay change is made to optimize the timing of the tracking audio delay change thereby further reducing the risk of generating noticeable audio artifacts and at the same time reducing the risk of mis-synchronization due to rapid video delay changes.

References


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать курсовую

Look at other dictionaries:

  • Audio to video synchronization — (also known as Audio video sync, Audio/video sync, AV sync, lip sync ndash; or lack of it: lip sync error, lip flap) refers to the relative timing of audio (sound) and video (image) portions during creation, post production (mixing), transmission …   Wikipedia

  • Impulse noise (audio) — Impulse noise is a category of (acoustic) noise which includes unwanted, almost instantaneous (thus impulse like) sharp sounds (like clicks and pops). Noises of the kind are usually caused by electromagnetic interference, scratches on the… …   Wikipedia

  • Film synchronizer — A film synchronizer is a device used in the editing phase of filmmaking.Film synchronizers generally have 1 to 8 gang(s) , or slots through which film can be threaded. Each gang consists of a group of large diameter sprockets on a common shaft. A …   Wikipedia

  • M-Audio — Type Subsidiary of Avid Technology Founded 1998 Headquarters Irwindale, CA, USA USA …   Wikipedia

  • MuEv — is an acronym for Mutual Events. MuEv is used in conjunction with the timing of sound and images, especially in television systems, to denote events which create temporally coincident sounds and images. These temporally coincident sound and image …   Wikipedia

  • Lip sync — or Lip synch (short for lip synchronization) is a technical term for matching lip movements with voice. The term can refer to: a technique often used for performances in the production of film, video and television programs; the science of… …   Wikipedia

  • Timebase correction — Time base correction is a technique to reduce or eliminate errors caused by mechanical instability present in analog recordings on mechanical media. Without time base correction, a signal from a videotape recorder or videocassette recorder cannot …   Wikipedia

  • Olivia MFSK — Spectrogram (waterfall display) of an Olivia 16/500 signal centered on 7073.25KHz Olivia MFSK is an amateur radioteletype protocol designed to work in difficult (low signal to noise ratio plus multipath propagation) conditions on shortwave bands …   Wikipedia

  • Pro Tools — Infobox Software name = Pro Tools logo = caption = Pro Tools LE 7.3 screenshot on Mac OS X developer = Digidesign latest release version = Pro Tools HD/LE 7.4 latest release date = November 7, 2007 operating system = Mac OS X Windows XP genre =… …   Wikipedia

  • midi — mi|di 〈Adj.; adv. u. präd.; Mode〉 halblang, wadenlang ● sie trägt midi * * * mị|di <indekl. Adj.> [wahrsch. Fantasiebildung zu engl. middle = Mitte, geb. nach ↑ mini] (Mode): (von Mänteln, Kleidern, Röcken der 70er Jahre) bis zur Mitte der …   Universal-Lexikon

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”