Hardware


Speech Recognition Hardware Solution - Hardware Architecture

NSC's unique and innovative hardware product offering provides a server-based, multi-engine, speech product that is deployed on a dedicated PCI blade.

The NSC SpeechBlade™ platform supports the required memory and processing resources for the speech recognition task. It enables full scalability from incorporation of speech recognition into an existing application server (“all-in-one-box” solution) up to addition of multiple servers allowing for deployments of a large number of speech recognition channels.

Key Features in NSC's HW Product offering :

  • High Performance
  • Compact Implementation
  • Unique Architecture
  • No CPU resources needed for speech processing - In small deployments, this allows for an "all-in-one-box" architecture.
  • High density – Cost savings, simplified architecture
  • Multi-application Capability – Allows the user to "farm" resources either in a centralized or distributed mode.
  • Flexible allocation of speech recognition resources per blade – Allows for local/distributed system architectures

The Concept - Speech Server on a Blade

NSC provides multiple speech processing engines operating on the NSC SpeechBlade™ product line - PCI blades comprising a unique platform that enables the entire speech processing operation to be done on hardware. This unique approach by NSC allows a "CPU-free", high density product with simple system architecture.

NSC’s hardware-based products implement NSC's unique technology through three high-level components:

The Engines
Includes all the speech engine software components. The engine consists of firmware, server software, application development tools, and APIs.

The Platform – SpeechBlade™
Where the NSC engines reside and operate. SpeechBlade™ is a product line of PCI blades developed by NSC.

Language Packages
Packages that support specific languages/dialects and include linguistic components.

NSC’s products run on Windows (XP, server 2003 R2-32Bit and Win7 ) and Linux and support multiple local and remote applications.

NSC Hardware-Based Speech Recognition Engines

The speech recognition engine part of the hardware-based product includes all the product software components. The speech engine consists of firmware that operates on the NSC SpeechBlade™ , NSC’s PCI blade, as well as server software, application development tools, and APIs.

The NSC SpeechBlade™ Server includes two main speech engines for the hardware platform that implement the company's unique technology and operate on the NSC SpeechBlade™ product line of PCI blades.

These two speech engines deliver different speech recognition functionality on NSC’s hardware platform and target the needs of different market segments.

NSC Spotter
Keyword spotting engine for NSC’s hardware platform designed for locating pre-defined words in phone calls in real time and off-line from call recordings targeting the Security and Intelligence, Call Center and Online Media markets.

NSC Speecher
Speech recognition engine for NSC’s hardware platform designed for speech enabling services in the Call Center, Telco, and Enterprise markets. Its main focus is voice enabling IVR (Interactive Voice Response) applications.

NSC Hardware-based Speech Recognition Engine Components:

  • NSCServer - Runs on the host computer and performs resource management activities as well as managing the communication between an application running on the host machine and the firmware running on the NSC SpeechBlade™.
  • NSC Firmware - Runs on the NSC SpeechBlade™ and implements NSC's proprietary speech technology on standard DSPs. NSC develops and implements several firmware modules designed to support two speech engines: NSC Speecher and NSC Spotter.
  • API Application - Programming Interface between the speech-driven application and the speech recognition engine.
  • Development Tools - Provided for the development of speech-driven applications on various target platforms.

NSC SpeechBlade

The Platform – The NSC SpeechBlade

The NSC SpeechBlade™ is NSC's high-density, cost-effective PCI blade product line comprising a unique platform that supports running multiple telephony speech processing engines. The blades are composed of basic modules, each including a DSP, memory and peripherals. Each module is autonomous and can be independently configured to a different speech engine, language, and resource type. Several speech processing resources can run on a single module depending on the speech processing task complexity. The entire speech processing operation is done on the PCI blade – thus creating a robust, CPU-free platform for the operation of multiple speech processing engines.

The product line of blades allows for full scalability from very small capacity (e.g., 2 speech recognition channels) to very high capacity (e.g., 90’s of speech recognition channels in a single server):

NSC SpeechBlade™ - 2
(Half size PCIU)

Supports up to 2 NSC Speecher/Spotter resources
NSC SpeechBlade™ - 4
(Half size PCIU)

Supports up to 4 NSC Speecher/Spotter resources
NSC SpeechBlade™ - 12
(Half size PCIU)

Supports up to 12 NSC Speecher/Spotter resources
NSC SpeechBlade™ - 30
(Half size PCIU)

Supports up to 30 NSC Speecher/Spotter resources

The NSC SpeechBlade™ has a PCI interface for communication with the NSCServer and an H.100 bus interface for acquisition of speech samples directly from a Computer Telephony Integration (CTI) board.

The Speech Acquisition Process

Each speech resource acquires speech samples independently. There are three alternative interfaces for acquisition of the speech samples:

  • Through the H.100 bus interface (when the CTI board resides on the same machine).
  • Through the PCI interface (when the application uses API commands remotely for streaming the speech samples).
  • Through RTP

Through the PCI Interface, API commands can be issued and language packages and firmware can be downloaded.

NSC SpeechBlades™ offer Optimum Flexibility
Each module can be independently configured to run a different language package, different speech engine, and different grammars. This allows supporting multiple applications and multiple speech engines.

NSC Spotter

NSC Spotter - A Robust Real-Time Word Spotting Engine for NSC’s hardware platform driving Audio Surveillance, Speech Analytics, and data mining applications

NSC Spotter is the Keyword Spotting (KWS) engine for NSC’s hardware platform designed for locating words out of a pre-defined list in spoken conversations. It operates in real time or on recorded calls, and is targeted for the Security and Intelligence and Call Center markets. NSC Spotter is used for speech analytics and call surveillance in the Security and Intelligence market as well as for mining business information and monitoring agent performance in the Call Center market.

In addition, NSC Spotter can be used in any market segment, service or application where there is a need for analysis of a large number of speech calls or recordings.

NSC Spotter implements a unique keyword spotting algorithm - a one-stage approach. The approach searches for keywords in a one-stage operation. The recognition is phoneme-based - NSC Spotter looks for the complete word in the speech stream based on a target sequence of phonemes that represent the keyword. The unique approach by NSC that operates directly on the incoming speech in one stage rather than the usual two-stage system that translates the speech into text (phonemes or words) and then performs a text search for the keyword.

NSC Spotter has the following main features:

  • Compact implementation - Delivers up to 90 real-time, simultaneous speech recognition resources on a single PC - the DSP-based NSC SpeechBlade™.
  • Noise robustness - Leverages noise robustness algorithms to deliver highly accurate keyword spotting in noisy conditions and for corrupted speech.
  • Real-time keyword spotting - Supports real-time recognition of keywords in an ongoing, spontaneous conversation. NSC’s solution can support real-time keyword spotting applications in large scale systems, as well as off-line analysis of large recording databases for Speech Analytics.
  • Multi-language support - Supports keyword spotting in multiple languages simultaneously on a single NSC SpeechBlade™.
  • High accuracy on a variety of audio channels - NSC has field-proven capabilities and knowledge in recognizing speech from radio channels of different types, as well as landline and cellular telephony.

Read the KWS for Speech Analytics White Paper

NSC Speecher

NSC Speecher - A High-Accuracy Speech Recognition Engine for NSC’s hardware platform targeted at Telephony Speech-Driven Services

NSC Speecher is the telephony speech recognition engine for NSC’s hardware platform designed for speech enabling services in the Call Center, Telco, and Enterprise markets. Its main focus is IVR (Interactive Voice Response) applications. NaturalASR speech enables services such as: Call center automation, information access, automated attendant, value-added services, call routing, and voice controlled voicemail.

NSC Speecher has the following main features:

  • High Accuracy - Leverages advanced algorithms that enable speech recognition with high accuracy.
  • Compact Implementation - Delivers up to 120 real-time, simultaneous speech recognition resources on a single PCI add-in card – the DSP-based NSC SpeechBlade™.
  • Natural Language Understanding - Supports Natural Language Understanding (NLU) allowing users to speak naturally and not be forced into fixed speech templates. Additionally, NSC Speecher supports recognition of complex grammars allowing thousands of possible utterances.
  • Multi-Language Support - Supports speech recognition in multiple languages simultaneously on a single NSC SpeechBlade™. Allows for handling of calls simultaneously in multiple languages.
  • Barge-In Support - Many speech-driven services must be able to handle interruption of the prompt by the caller (“barge-in”). Since the speech recognition engine is active while prompts are played, the result is an echo added to the speech to be recognized. NSC has implemented echo cancellation on the SpeechBlade™ to cancel this echo during barge-in resulting in just speech that needs to be recognized.
  • Dynamic Grammar Support - For each recognition request, a different grammar can be used.

© 2010 Natural Speech Communications Ltd. All Rights Reserved.