SDK Features
All audio filters are real-time and language-independent.
Noise Cancellation (NC)
Noise Cancellation algorithm is designed to remove background noise during real-time
communication. Krisp SDK includes technologies for both Outbound (Microphone) and Inbound (Speaker) Noise Cancellation.
De-Reverberation
When the noise cancellation algorithm runs, it also automatically performs de-reverberation removing room echo from the audio.
The technical specs and more details about the algorithms can be found here.
Background Voice Cancellation (BVC)
Background Voice Cancellation (BVC) technology is developed to cancel all background voices. It also removes all background noises and reverberation. The technology does not require user voice enrollment or training on user voice data. Krisp has deployed this technology in its Desktop applications, fixing the problem of cross-talk in call centers and offices.
BVC technology is designed to work with any headset and earbud. It works best with wired USB headsets with a boom microphone and is also compatible with most Bluetooth headsets, including AirPods.
Read more for the specs and details about the algorithm and supported devices.
Real-Time Noise and Voice Statistics
This real-time algorithm retrieves per-frame statistics about the levels of processed voice and removed noise. These statistics are represented as values within the range of 0 to 100, indicating the amount of voice and removed noise in each frame.
In addition to per-frame statistics, the algorithm includes an end-of-stream feature that enables users to retrieve information on the amount of removed noise classified into four categories: no noise, low, medium, and high. This feature also provides information on the total talk time accumulated from the start of the processing until the point at which the statistics are retrieved.
Accent Localization (AL)
Accent Localization (AL) is an AI-powered real-time voice conversion system designed to enhance communication clarity by neutralizing accents in call center environments. AL processes the audio securely on the device and dynamically changes the agentโs accent into the customer's natively understood accent in real-time.
Check out more details on AL technology data governance and architecture in the following doc.
Updated about 1 month ago