Accurate and efficient speech signal processing method for artificial intelligence

This technology is a pitch-synchronous signal processing method which can capture relevant speech information for artificial intelligence.

Unmet Need: Reliable processing method of raw speech signals

Many artificial intelligence systems rely on inputs in the form of raw speech signals which need to be converted into text for processing by the neural network. Traditional signal processing methods only crudely represent some speech features, losing valuable and important properties along the way. Newer approaches feed raw signal waveforms into large neural networks, but these methods can be computationally prohibitive and prone to noise and irregularity.

The Technology: Pitch-synchronized method for accurate and efficient speech processing

This technology is a pitch-synchronous speech processing method for artificial intelligence, separating waveforms into pitch periods and timbre vectors which more closely mimic human hearing patterns. These serve as more complete and accurate representations of speech compared to traditional methods, while being more concise and less noisy than using raw speech signals.

Applications:

Artificial intelligence
Speaker voice recognition
Educational tool for language learning
Accessibility tool for hearing impaired

Advantages:

Mimics human hearing and speech processing
Complete and concise representation of speech
Less noisy
More efficient and improved accuracy

Lead Inventor:

Julian Chengjun Chen, Ph.D.

Related Publications:

Chen CJ, Miller DA. “Pitch-synchronous analysis of human voice” J Voice. 2020 Jul; 34(4): 494-502.

Tech Ventures Reference:

IR CU22076
Licensing Contact: Dovina Qu