Columbia Technology Ventures

Accurate and efficient speech signal processing method for artificial intelligence

This technology is a pitch-synchronous signal processing method which can capture relevant speech information for artificial intelligence.

Unmet Need: Reliable processing method of raw speech signals

Many artificial intelligence systems rely on inputs in the form of raw speech signals which need to be converted into text for processing by the neural network. Traditional signal processing methods only crudely represent some speech features, losing valuable and important properties along the way. Newer approaches feed raw signal waveforms into large neural networks, but these methods can be computationally prohibitive and prone to noise and irregularity.

The Technology: Pitch-synchronized method for accurate and efficient speech processing

This technology is a pitch-synchronous speech processing method for artificial intelligence, separating waveforms into pitch periods and timbre vectors which more closely mimic human hearing patterns. These serve as more complete and accurate representations of speech compared to traditional methods, while being more concise and less noisy than using raw speech signals.

Applications:

  • Artificial intelligence
  • Speaker voice recognition
  • Educational tool for language learning
  • Accessibility tool for hearing impaired

Advantages:

  • Mimics human hearing and speech processing
  • Complete and concise representation of speech
  • Less noisy
  • More efficient and improved accuracy

Lead Inventor:

Julian Chengjun Chen, Ph.D.

Related Publications:

Tech Ventures Reference: