How human perception and digital systems shape forensic audio analysis
Understanding the human auditory system’s capabilities and limitations
Definition: One sound (masker) raises the threshold of hearing for another sound (probe/maskee), potentially making it inaudible
Three types critical for forensics:
Pre-masking (backward masking):
Post-masking (forward masking):
Not energetic overlap—higher-level perceptual interference
When analyzing audio evidence, ask:

| Scale | Basis | Primary Forensic Use |
|---|---|---|
| Bark Scale | 24 critical bands; 1 Bark = 1 critical band | Audio compression (MP3 standard); loudness modeling |
| Mel Scale | Pitch perception; 1000 mels = 1000 Hz at 40 dB SPL | Speech recognition; speaker identification (MFCCs) |
| ERB Scale | Equivalent Rectangular Bandwidth; smoother auditory filter refinement | Noise reduction algorithms; high-resolution psychoacoustic research |

Gain normalization & playback
A-weighting
Codec artifacts
Loudness integration window: ~200 ms
Temporal fusion and pitch: ~30 ms
Transient detection thresholds
Butt splice
Cross-fade
Background forensics
Interaural Time Difference (ITD)
Interaural Level Difference (ILD/IAD)
Cone of confusion
Law of the first wavefront
Time windows: 1–30 ms
Forensic implication
Talker discontinuity
Reverberant mismatches
Binaural unmasking artifacts
AGC pumping
Task: Listen to provided audio example with simultaneous masker and probe tones
Questions:
How sampling, quantization, and perceptual coding affect forensic evidence
Nyquist-Shannon Sampling Theorem
Standard rates
Aliasing
6 dB per bit rule
Low bit-depth consequences
How MP3, AAC, and Opus work:

| Technique | Function | Forensic Impact |
|---|---|---|
| Spectral Band Replication (SBR) | Removes high frequencies during encoding, reconstructs by transposing low frequencies | Gunshots or high-pitched speech may sound “natural” even if HF data was never recorded |
| Perceptual Noise Substitution (PNS) | Replaces noise-like bands with random noise parameters | Loss of subtle background “fingerprints” used to identify recording location |
| Joint Stereo (M/S) | Converts L/R to Sum (M) and Difference (S) to save bits | Can create stereo artifacts; intensity stereo replaces correlated HF with envelope + directional cues |
Pre-echo
Spectral holes (“birdies”)
Aliasing
Re-encoding buildup
Uncertainty Principle: Impossible to achieve arbitrarily high resolution in both time and frequency simultaneously
STFT (uniform)
Wavelets (multi-resolution)
Auditory filterbanks (non-uniform)
The pitfall
Visual-only risks
Best practice
Task: Compare original uncompressed audio with MP3 versions at different bitrates
Analysis:
Forensic question: If this were evidence, how would you explain these artifacts to a jury? Would you recommend enhancement or caution against it?
When does perception matter? When does signal evidence dominate?
| Feature | Perceptual Audibility | Signal Evidence |
|---|---|---|
| Primary utility | Evaluating earwitness testimony; detecting codec artifacts | Geometric reconstruction (multilateration); calculating speed (Doppler effect) |
| Limitations | Ear integrates over ~200 ms; very brief sounds seem quieter than they are | High-amplitude sounds can be clipped or distorted by recorders |
| Forensic conflict | “Quality” (how nice it sounds) ≠ “intelligibility” (understandability) | Processed audio may sound “cleaner” but have lower speech intelligibility |
Key principle: Use perceptual analysis for testimony evaluation and enhancement assessment; use signal analysis for geometric and physical reconstruction
ENF (Electrical Network Frequency)
Multilateration (TDOA)
Ballistic shock waves
Spectral signatures
“CSI Effect”
Standards
Expert responsibilities
Laboratory setup
Iterative audition
Cognitive bias mitigation
Playback calibration
Enhancement caution
Hash verification
Format standards
Scenario: You receive a 128 kbps MP3 recording of an alleged confession.
Defense: “I kept watching her,” Prosecution transcript: “I killed Winchester.”
Your tasks
Psychoacoustics & Perception
Digital Audio & Codecs
Forensic Audio
Standards