Practical issues in recording for the Web

Alan Anderson, School of Arts and Social Sciences, School of Commerce and Management, Southern Cross University, Lismore, 2480. Email: alan.anderson@scu.edu.au

 

Introduction

Expanding on issues raised in Anderson (2007b)[HREF 1], this poster aims to demonstrate how a fundamental knowledge of professional audio production techniques can help educators to produce high quality audio content for the Web. The quality of equipment used is one determining factor. Equally important is learning how to best use the technology in various recording and postproduction situations. Beginning with sound capture, for instance, it is important understand that microphones have a polar pattern (pickup pattern). Specific applications are discussed in Anderson (2007b) whilst the following diagrams of microphone pick-up patterns (figures 1, 2 & 3) serve as a point of reference for this poster presentation.

Microphone pickup patterns

Three commonly used patterns are shown in Figures 1, 2 and 3 (diagrams adapted from Microphone Madness, 2007)[HREF 2].

Working too close to a microphone will cause significant changes in both level and tone as the singer/speaker moves his or her head (White, 2002, p. 35). Other issues such as using a pop shield and signal processing technology to reduce sibilance and enhance a speaker’s voice are mentioned in Anderson (2007b) [HREF 1] and will be demonstrated or discussed further in this presentation.

Preamps - the next essential link in the signal path

A knowledge of signal path and gain structure issues helps to correctly configure audio software and hardware for various applications. It also helps to troubleshoot if components in the signal path (from recording to playback) fail or need reconfiguring from time to time for different recording applications. After sound is picked-up by a microphone, the next link in the signal chain is the preamp. The “preamplifier (preamp) precedes another amplifier, and its function is to prepare an electronic signal for further amplification or processing” [HREF 3]. In order to amplify the output signal from most professional microphones, it is necessary to use either a suitable soundcard, an external preamp interface or a mixing console with its own built-in preamps, attenuators and phantom power for condenser (capacitor) microphones. In these contexts, the “function of a preamp is to amplify a low level signal (possibly at high impedance) to line-level”. Examples of portable external preamp interfaces include M-AUDIO’s™ Firewire 410 and Digidesign’s MBox II (USB) bundled with ProTools LE software [HREF 3]. If a number of musical instruments and vocals must be recorded simultaneously, the preamp interface ought to have enough input channels to allocate one or more to each sound source (i.e. line in from an electric instrument, vocal microphone and/or close-miked instrument). Consequently, a preamp such as a Mackie™ Onyx 800R 8-Channel Microphone Preamp or a larger mixing console with more inputs and outputs (i/o) would be necessary.

Virtual instruments and virtual signal routing

Creating multiple tracks for virtual instruments entirely in the software domain is simple by comparison. Garage Band, Logic Pro, CuBase (to name but a few programs) enable one to create music by loop-based composition and/or record live acoustic instruments and vocals on multiple tracks. Logic Pro™is touted by Apple as “the industry-leading application for music creation and audio production,” whilst Garage Band is essentially an entry-level program that incorporates special features for creating Podcasts [HREF 4]. Having created a Podcast with background music, GarageBand enables one to put the talk above the tunes with a dynamic “ducking” effect that automatically reduces music volume while the announcer (DJ) is speaking. Radio stations have long been configuring their mixing consoles and outboard hardware for ducking so this is not a new concept. In the modern software domain, using programs such as Protools or LogicPro, dynamic ducking can be achieved by virtual signal routing of a compressor plugin on the music track with side chain input from the speech track. The advantage that Garage Band brings to the new user is that ducking is engaged automatically by use of a simple drop down menu – thus negating the need to do any virtual or physical re-routing of the signal path. A Speech Enhancer can be engaged just as easily, to reduce distracting background noise and simulate a professional microphone using compression techniques to improve the user’s voice [HREF 4].


Not everyone aiming to record audio content for the Web will be interested in creating complex audio content involving music and voice. Therefore, with the various needs of users in mind, Figure 4 outlines options ranging from a simple record-by-phone service to the use of professional studios equipped for production of commercial CD release quality music and voice. Options are ranked from least expensive (1) to most expensive (6).

Option ranked from least expensive (1) to most expensive (6) Pros Cons
1. Record-by-phone Convenient for travelers.
Adequate for voice recording.
Service provider does all the post-production and uploading.
Lo-fi audio, extraneous noise can be a problem if using a mobile phone [HREF 5].
Cost of the service.
Cost of phone calls to the service.
Possible time lost in the call waiting cue when seeking help.
Less control of your material.
2. Record and playback using built-in microphone and speakers on desktop computer or laptop computer.
Use free editing software such as Audacity™ or ProTools™ Free.
Basic multi-track recording, sequencing and mixing facilities.
Can overdub record and or comp together multiple audio tracks.
Can import previously recorded audio files, edit, compress and convert audio file for distribution via Web or on CD, DVD etc.
Can add common effects such as reverb, delay and perform basic mixing functions such as panning instruments to left or right, adjusting individual tracks levels and so on.
Ambient noise picked up by built-in microphone. Potential for feedback (sound from speakers spills back into microphone creating a loop) especially if attempting simultaneous playback and overdub recording using multi-track software. Also problematic for two-way voice or videoconferencing over Internet unless headphones or earphones are used.
3. Record and playback using built-in audio, external microphone and headphones. Consumer grade sequencing software such as Apple’s Garage Band™. USB music keyboard interface.
USB or Firewire interface for video camera.
Compared to Audacity, for instance, Garage Band provides more features for multi-track recording, MIDI sequencing, loop-based composition and mixing of music.
Facilities to record, edit, mix and compress audio files for distribution. Automatic presets. Enables user to import video clips and compose music to accurately sync to video using SMPTE time code.
More time to learn but considered user-friendly and intuitive to learn – especially if the user has some prior experience in music or audio production.
4. Using a laptop or desktop computer,
external preamp-interface with 1 or 2 inputs. Software such as Logic Pro™, Garage Band or similar programs.
Prosumer/professional level. Compared to Garage Band and comparable programs, Logic Pro, ProTools LE and similar level programs and hardware enable more advanced multi-track recording, sequencing and mixing of music. Advanced automation features, advanced signal routing and dynamics processing capabilities along with a large array of effects and software instruments. Can import video and sequence music to video for distribution in a range of formats including video podcast. Capable of producing commercial-release-quality recordings. More time to learn. Not as user-friendly and intuitive to learn as programs like Garage Band. Challenging initially for the user with no prior experience as a performer, composer or sequencer of music.
5. Digital Audio Workstation with 8 or more inputs/outputs and automated control surface, e.g. Digi02/ProTools LE. More i/o (in/outs) channels (8) than MBox Mini (1) or Mbox II (2) and sends to record and monitor with.
Preamps built-in.
Commonly used in semi-professional projects and smaller professional recording studios.
Less portable
Requires more audio training or time spent learning about music and voice recording.
6. Record in a professional recording and/or radio broadcast studio. Generally commercial-release quality (or at least high quality demo) recording equipment and rooms. Likely to be equipped with ProTool HD or equivalent and large automated recording and mixing console. Generally high costs involved: per hour for use of studio, any additional talent (session musicians, vocalists or voice-over artists) plus cost of employing one of the studio’s resident or approved casual audio engineers. The cost per hour and possibly unfamiliar professional environment can be off-putting to the inexperienced. Performance anxiety can result in a less-relaxed performance and a sonically perfect recording of a less-than-perfect performance.

Fig 4. Audio recording for Webcast options ranked from least expensive (1) to most expensive (6)

Fix it in the mix

Awareness of what can or cannot be corrected or smoothed over by audio signal processing technology is useful for deciding whether a particular recorded performance or ‘take’ is of adequate quality to keep before moving on to record the next track. Roe (2006) notes it is important not to become complacent or careless adopting a ‘fix it in the mix’ mentality because contrary to popular belief, audio processing cannot always remove all extraneous noises on the fly or in postproduction [HREF 6]. Unwanted sounds that occupy the same frequency range as other wanted sounds can be particularly troublesome - even with the benefit of powerful multi-band compressors and equalization (EQ) tools that can isolate and filter out selected frequencies. The general consensus among audio engineers is “if you can get it right at the recording stage life becomes so much easier at mixdown” (Roe, 2006) [HREF 6]. In a live (real-time) VoIP situation, there is no opportunity to stop and do another ‘take,’ so care must be taken to do and say things right the first time.usWeb papers contain two types of references.

 

Audio compression (data)

It is common practice in professional audio circles to aim for the highest possible audio quality when recording. But recording at a higher quality level inevitably creates a large file as the analogue data is recorded into uncompressed digital format such as WAV or AIFF. The digital source file must consequently be encoded to compress the file into the appropriate sampling rate and bit depth to distribute audio over the Internet (Steward, 2007) [HREF 7] (Gerth, 2001) [HREF 8]. Gerth notes that an "encoding tool is used to select a codec based on the bandwidth target (connection speed) and the audio content (music or voice) to compress the original sound file and create a new audio file." If a large dynamic range of sound is required,16 bit resolution is recommended, whereas, if all sounds are about the same volume, 8 bit may be suitable (ibid). Sampling rate refers to the number of samples of sound taken per second whereas 'bit depth' defines the dynamic resolution of the audio. The more bits used to represent the sampled value, the more accurate the sample will be with respect to the original analog sound wave (ibid). For the purpose of comparison, it is worth noting that CD quality audio is 44,100 (44.10 kHz) samples per second, 16 bit, 2 channel (stereo), whereas telephone quality audio is 1 channel (mono) 8 bit at 11,025 samples per second (11.025 kHz) [HREF 7] [HREF 8]. One minute of CD quality sound requires around 10.5 MB of disk space whereas one minute of telephone audio quality would require as little as 0.6MB. Smaller file sizes facilitate faster download times from Web sites but the most economical options for keeping file sizes to a minimum invariably involve some trade-off in sound quality [HREF 8] (Beggs & Thede, 2007)[HREF 9]. Classical music, for instance, is characteristically wide in its range of timbres and dynamic range, and more likely to be adversely affected by lower bit resolution than speech (see Figures 5 & 6) [HREF 8] [HREF 10]. Likewise, full-bodied vocal requires 16 bit resolution [HREF 8].

Fig 5. Storage requirements for one minute of audio (Gerth, 2001) [HREF 8]

Fig 6. Streaming audio (PC Magazine, 1999 in Gerth, 2001) [HREF 8]

Clearly, the appropriateness of particular formats and compression settings must therefore be weighed carefully in relation to the nature of the audio content and the purpose of including it in the Web site or Webcast.

Recommendations for good quality audio production

 

Conclusion

Expanding on issues raised in Anderson (2007b) [HREF 1], this poster presentation is designed to demonstrate how a fundamental knowledge of professional audio production techniques can help educators to produce high quality audio content for the Web. These skills are directly applicable to Voice over Internet Protocol (VoIP) applications and hybrid VoIP and Web-based learning environments such as Elluminate.

 

References

White, P. 2002, Recording and Production Techniques, Sanctuary Publishing Limited, London, UK.

Hypertext References

HREF 1
http://ausweb.scu.edu.au/aw07/papers/refereed/anderson3/paper.html

HREF 2
http://www.microphonemadness.com/faq.htm#whatelse

HREF 3]
http://www.tweakheadz.com/microphone_preamps.htm

HREF 4
http://www.apple.com/ilife/garageband/

HREF 5
http://freesound.iua.upf.edu/tagsViewSingle.php?id=36_6_117_27

HREF 6
http://www.mixbuss.com/article.php?id=32

HREF 7
http://ezinearticles.com/?Adding-Sound-To-Your-Web-Site---The-Good,-The-Bad-And-The-Ugly&id=22528

HREF 8
http://64.233.179.104/scholar?hl=en&lr=&safe=off&q=cache:1MloFpiKq8kJ:www.coe.uh.edu/courses/cuin7330/class7/audio-on-the-web.pdf+poor+audio+flash+audio+on+classical+music

HREF 9
http://www.intranetjournal.com/articles/200102/psm1a.html

HREF 10
http://musicalassumptions.blogspot.com/2007/05/high-fidelity-death-of-classical-music.html

Copyright

Alan Anderson© 2007. The author assigns to Southern Cross University and other educational and non-profit institutions a non-exclusive licence to use this document for personal use and in courses of instruction provided that the article is used in full and this copyright statement is reproduced. The authors also grant a non-exclusive licence to Southern Cross University to publish this document in full on the World Wide Web and on CD-ROM and in printed form with the conference papers and for the document to be published on mirrors on the World Wide Web.