AI Voice SDK

Use AI Voice SDK with the help of our comprehensive guide. Easily integrate AI-powered noise cancellation and audio solutions into your applications.

📝 What is Krisp's AI Voice SDK?

AI Voice SDK is a collection of real-time AI-powered technologies that improve speech clarity in real-time communication applications such as

  • Voice AI Agents
  • Online video and audio conferencing
  • Online collaboration
  • Streaming, broadcasting or podcasting
  • Mobile calls

Currently, AI Voice SDK has been deployed on more than 200M+ devices and is clearing 75B+ mins of speech every month. It is integrated inside apps such as Discord, RingCentral, and Twilio Video SDK.

AI Voice SDK includes an impressive collection of AI-powered technologies that can be integrated into your application:

Voice Isolation SDK for Voice AI Agents

Deployed on server to remove background voices and noise in real-time improving the input for Voice AI Agents. Designed to be telco robust.

Noise Cancellation SDK for Calls

In real-time, separates background noises from a human voice, both inbound and outbound.

Voice Cancellation SDK for Calls

In real-time, separates background voices and noises from the main speaker's voice.

Accent Conversion SDK for Calls

In real-time, converts speakers accent while maintaining their voice.

AI Noise and Voice Statistics

In real-time, evaluates the level of noise and voice in the audio stream.

Session AI Noise and Voice Statistics

At the end of the session provides noise statistics based on the noise intensity levels and talk time

How Does it Work?

Runs on-device, on cloud or API

  • Integrated in client apps (native and browser) and on server (Voice AI Agents)
  • Integrates into the local application’s audio path (e.g. WebRTC)

Minimal latency

  • Not noticeable for real-time communications
  • Measured latency dependent on frame size and platform

Multiple models for different needs

  • Supports NB, WB and Full-Band audio
  • Supports different CPU speeds

Removed noise types

  • Other voices, TV, Music, Office, Street, Chatter, Baby, Animals, Keyboard, Fan, etc.