📝 What is Krisp's AI Voice SDK?

AI Voice SDK is a collection of real-time AI-powered technologies that improve speech clarity in real-time communication applications such as

Voice AI Agents
Online video and audio conferencing
Online collaboration
Streaming, broadcasting or podcasting
Mobile calls

Currently, AI Voice SDK has been deployed on more than 200M+ devices and is clearing 75B+ mins of speech every month. It is integrated inside apps such as Discord, RingCentral, and Twilio Video SDK.

AI Voice SDK includes an impressive collection of AI-powered technologies that can be integrated into your application:

Voice Isolation SDK for Voice AI Agents

Deployed on server to remove background voices and noise in real-time improving the input for Voice AI Agents. Designed to be telco robust.

Noise Cancellation SDK for Calls

In real-time, separates background noises from a human voice, both inbound and outbound.

Voice Cancellation SDK for Calls

In real-time, separates background voices and noises from the main speaker's voice.

Accent Conversion SDK for Calls

In real-time, converts speakers accent while maintaining their voice.

AI Noise and Voice Statistics

In real-time, evaluates the level of noise and voice in the audio stream.

Session AI Noise and Voice Statistics

At the end of the session provides noise statistics based on the noise intensity levels and talk time

How Does it Work?

Runs on-device, on cloud or API

Integrated in client apps (native and browser) and on server (Voice AI Agents)
Integrates into the local application’s audio path (e.g. WebRTC)

Minimal latency

Not noticeable for real-time communications
Measured latency dependent on frame size and platform

Multiple models for different needs

Supports NB, WB and Full-Band audio
Supports different CPU speeds

Removed noise types

Other voices, TV, Music, Office, Street, Chatter, Baby, Animals, Keyboard, Fan, etc.