|
Neural Encoding Model (NEM)
BrainMedia has pioneered the translation of the most recent and leading edge neurophysiology into computational models that can be programmed into codecs (a field called “computational neurophysiology”). Because of this pioneering position, BrainMedia was able to build a blocking patent portfolio related to these breakthroughs. The result is the new BrainMedia Neural Encoding Model (NEM™). This new codec identifies the 1-3% of audio signals that is actually represented in the brain (cortex). After music passes through the inner ear (where traditional codecs work), it is further filtered before it reaches the human cortex. It is this highly processed and filtered signal that the brain actually hears
After music passes through the inner ear (where traditional codecs work), it is further filtered before it reaches the human cortex. It is this highly processed and filtered signal that the brain actually hears.
Music Encoding Breakthrough
New performance gains in audio codec quality will only come from breakthroughs understandings of how the brain does and does not process sound. All standardized and de facto standard codecs are based on a primitive understanding of human sound perception (called “psychoacoustics”) pre-dating the recent revolution in auditory neurosciences. With advent of new and powerful experimental tools, academic researchers at leading institutes are unraveling for the first time the details of what the human brain can actually hear, and why.
Not Neural Network-Based
NEM is based on the actual neurophysiology of the human brain and is distinctly different than, for example, the field of “neural networks” which does not benefit from any insight into how the brain actually processes sound. The core of BrainMedia’s audio encoder is our patented Neural Encoding Model that reveals which abstract nuances of any audio signal are processed by the brain and represented in the cortex. By comparing the neural representation of the highly compressed audio signal to the representation of the original audio, an advanced statistical analysis calculates the “perceptual distance” between the compressed and original audio.
How it Works
The NEM codec minimizes the perceptual distance of the compressed audio from the original audio; a small perceptual distance implies that as far as the cortical representation is concerned, the person is truly enjoying the perception of the original signal. This results in codec audio performance since the NEM model guides the precise placement of the tiny fraction of “bits” that are necessary to represent the audio with near zero perceptual distance from the bit-rich original signal.
NEM and NEMx
The BrainMedia Advantage
The introduction of the eAAC+ and AMR-WB+ codecs demonstrated to the market that the performance of an audio codec can be augmented at low bitrates by using techniques based on “source models”. Source model techniques such as “spectral extension” and “parametric stereo” can be used to approximately recreate portions of the audio signal that were not actually encoded by the “baseband codec”.
For example, in eAAC+, the AAC codec is used to encode only a portion of the lower frequencies (baseband audio), and spectral extension(the “+” part) and parametric stereo (the “e” part) are used to reconstruct the omitted audio. A low bit rate is achieved by using the baseband as a source of “hints” on how to approximate the missing upper bands, as well providing additional compact hints, in lieu of sending the high bit rate AAC upper bands.
The result is a superior user experience based on substantial advances in quality and lower bitrates over either eAAC+ or AMR-WB+