Direct YouTube link: https://youtu.be/u8uqWfz7-98
Sound Metrics: Speech Interference Level
All sound metrics are used to help quantify various aspects of a sound or a noise. As the name suggests, speech interference level (SIL) was first created to give an estimate of how much a given noise spectrum will disrupt, or interfere with, effective speech communication. One common area of confusion related to speech interference level arises from the various forms it can take, and how they are different. This article will briefly discuss the history and origination of SIL, describe the different variants of SIL that exist in use today, illustrate how to calculate the different forms of SIL in Simcenter Testlab software, and make note of some important considerations when using SIL.
This Speech Interference Level article will cover:
1 – A brief history of Speech Interference Level
Much of the current understanding of speech intelligibility comes from research performed around the time of World War II. Scientists at Bell Telephone Laboratory (N.R. French and J.C. Steinberg), along with Leo L. Beranek at Harvard University (who was working on improving speech communications for aircraft pilots as part of the war effort) devised several methods to quantify the effectiveness of human speech communication for a given background noise level. The main result of their work was the metric for Articulation Index. However, articulation index and other similar methods required rather complex calculations, and as a result were impractical for most applications at the time. There existed a need for a simple, single number metric that could be used to quickly give an estimate of the level of noise in the frequency bands critical to speech communication. This number could then be used to infer how difficult voice communication might be given a noise spectrum. Thus, the Speech Interference Level was born.
The original formulation of SIL came from studies on the effectiveness of conversation in aircraft during flight. Beranek showed the number of words heard correctly between a talker and a listener was highly correlated with the average of the noise levels (in decibels) of three octave-frequency bands: 600-1200 Hz, 1200-2400 Hz, 2400-4800 Hz. This method provided a straight-forward calculation that related the level of noise in the aircraft to the effectiveness of speech communication. Think of this as the original “SIL”. It is not standardized, nor currently in use in its original form.
Several years later in 1967, the Acoustical Society of America standardized the octave-frequency bands used for noise analysis. The new standardized octave-frequency bands were to have center frequencies based on the sequence of “Preferred Numbers”, which resulted in center frequencies at multiples and sub-multiples of 1000 Hz. Thus, the octave bands (and 1/3rd Octave bands) we know today are a result of this standardization. Using the new “preferred number octave bands”, the SIL became the “PSIL”, or “preferred-octave speech interference level”. It is the octave band naming that is “preferred”, not the speech interference level metric calculation. PSIL is not described in any international standard but is commonly used across industries. PSIL is the average of the 500, 1000, & 2000 Hz octave bands.
In 1977 American National Standards Institute standardized the speech interference level, and included 4 octave bands: 500, 1000, 2000, 4000 Hz. This is “ANSI SIL”, “4-band SIL”, or simply “SIL”, as it is the only standardized form of speech interference level.
Some industries, most notably the aviation industry, instead choose to use the 1000, 2000, 4000 Hz bands in the SIL calculation for various reasons. This calculation method is not standardized, and is known as “SIL3” in Siemens Simcenter Testlab software to differentiate it from PSIL and ANSI SIL.
2 – Various formulations of SIL
All the different forms of speech interference level are calculated by taking the arithmetic mean of un-weighted, full-octave band sound pressure levels, as expressed in decibels (dB). The only difference between the various forms of SIL is the octave bands included in the calculation. The various forms are listed below:
2.1 – Example calculations
The spectrum below is used to calculate each of the formulations of SIL (see Figure 1). Decibel amplitudes for each of the octave bands are shown in the legend of the plot. Note the X-axis shows full octave bands (1/1 as opposed to 1/3rd) and the Y-axis is un-weighted decibels (not dB(A)).
Often the SIL in decibels is used in conjunction with a chart like the one shown in Figure 2 below. This diagram provides a method of relating the speech interference level and distance between speaker and listener to a level of effort required for reliable communication. Ideally the environment is designed so the intersection of the lines corresponding to the measured SIL and the distance between the speaker and the listener occurs within the blue shaded region (or below, in terms of SIL). This allows for highly reliable speech communication without added fatigue or information loss.
For example (Figure 3 below): Given the ANSI SIL calculated earlier of 85.00 dB, the chart in Figure 2 can be used to determine how effective speech communication will be between a pair of communicators. If the communicators are 1 foot apart, the chart shows the intersection is within the “Communicating Voice” envelope, and verbal communication should be easy and effective. However, if the distance between the communicators is increased to 8 feet, the background noise (SIL = 85.0 dB) will make verbal communication impossible. If the noise level cannot be lowered, and the speaker and listener cannot be moved closer together, some other type of amplification or communication system must be employed.
3 – Calculating SIL in Simcenter Testlab
Due to the fact that speech interference level is calculated on an octave-frequency band basis, it should be noted that there are multiple ways to calculate and display octave-based frequency information in Simcenter Testlab (formerly known as LMS Test.Lab). These different methods can be broken down into two main categories: FFT-based octaves, and Filter-based octaves.
3.1 – FFT octaves: This method is simply a display technique and will not be exact according to any standard that uses or references octave bands. FFT octaves are fixed-sampled, narrow-band frequency spectra lumped into amplitude groups corresponding to octave band frequency limits. For example, if fixed-sampled narrow band frequency data is added to an “Octave” display in Testlab, the software groups the data into the appropriate frequency bands and takes an RMS of the narrow-band amplitudes. This is shown in Figure 4 below.
The source data is the same between the top and bottom displays. By definition* the 1000 Hz 1/1 octave band has a minimum frequency of 707.95 Hz and a max frequency of 1412.54 Hz. Placing a double-x cursor on the upper narrow band plot and calculating the RMS of this band shows the same amplitude as the octave plot. FFT octaves are a convenient way of viewing narrow-band frequency data in octave and 1/3rd octave formats without requiring the usage of time-domain filters during acquisition or post-processing.
*By default, Testlab uses “Ideal Base 10” frequency definitions for FFT-based octaves as described in ANSI S1.11-2004.
3.2 – Filter-based Octaves: The standardized method for calculating octave band amplitudes is to employ a series of time-domain band-pass filters on the incoming transducer signals. These filters have themselves been standardized by the Acoustical Society of America, in conjunction with the American National Standards Institute and International Electrotechnical Commission (IEC). The specifications for the octave band filters are set forth in ANSI S1.11-2004: “Specification for Octave-Band and Fractional-Octave-Band Analog and Digital Filters” and IEC61260:1995: “Electroacoustics – Octave-Band and Fractional-Octave-Band Filters”.
Unlike the FFT-based octaves, which place a razor-sharp edge to the frequencies at the limits of the band (for instance the 1kHz band is all frequencies from 707.95 Hz – 1412.54 Hz, inclusive) time-domain filters have a filter shape, pass band, and roll-off associated with the filter. This means the frequencies (particularly at the edges) of the octave band are treated differently than with FFT-based octaves. See Figure 5.
Time domain filters are not capable of knife-edge corners at the edges of the pass-band like the FFT-based method, they must decrease the amplitude more gradually, or “roll off”. This roll-off means that frequencies in the region between center-frequencies of neighboring bands will participate in more than one octave band. This is the case for the shaded region of Figure 4 – these frequencies will contribute to the overall level of both the yellow colored 1kHz octave band as well as the green 2kHz octave band (albeit at reduced amplitudes). This effect highlights the importance of utilizing octave filters that adhere to a widely accepted standard, particularly when comparing data across different organizations, regions or industries. If all data is acquired/processed using standardized time-domain filters, the effects of the filter will be the same for every case.
To use the ANSI-IEC standardized octave filters in Testlab, the user must first turn on the corresponding Add-in. Add-ins can be turned on and off by clicking: Tools > Add-ins in Testlab and checking on the box for “ANSI-IEC Octave Filtering” as shown in Figure 6 below. This Add-in uses 23 Tokens and will add an “RTO” tab to certain processing areas of Testlab. RTO stands for “Real Time Octaves”.
3 – Calculating SIL in Simcenter Testlab (Continued)
The various forms of SIL can be added to the legend of any frequency spectrum (regardless of whether that spectrum was created using fixed-sampling or real time octaves) by right-clicking on the border of the curve legend, then clicking “Options…” as shown in Figure 7 below.
The Curve Legend Options dialogue box will appear. Click on the “Calculated Content” tab along the top of the window. In the list of available functions will be the three forms of SIL as previously described. Highlight the desired metric in the left window and click the “Add to selection” arrow in the center as shown in Figure 8.
When adding multiple metrics to the curve legend it is sometimes helpful to include a descriptor using the “Prefix” box as shown in Figure 9. Text added here will be added to the legend before each selected metric, and will help the clarity of the information in the legend, particularly when multiple metrics are added at the same time. See Figure 10.
3.3 – SIL vs Time
The various forms of speech interference level can also be plotted versus time (or RPM, or any other tracking parameter) to see how the SIL values change over the course of a test (see Figure 11 below). This functionality requires the Sound Quality Metrics Add-In.
The various formulations of SIL can be found on the “Psychoacoustic Metrics” tab, which is in the “Sections” portion of the Time Data Processing worksheet. To specify the use of the standardized ANSI-IEC time-domain filters, select the “Psychoacoustic Metrics RTO” tab as shown below in Figure 12.
4 – Important considerations for using SIL
Questions? Contact macdonald@siemens.com
Related Acoustics Links: