Scene-based AI noise reduction refers to the real-time automatic recognition of different scenes and intelligent adjustment of AI noise reduction strategies to provide the best noise reduction and audio quality effects. Currently, two common noise reduction scenes are supported:
This feature is suitable for 1v1 or multi-person audio and video call scenes such as voice chat rooms, meetings, and voice gaming, as well as live streaming or online KTV scenes with sound cards, singing, and near-field music.
To enable music scene recognition, please turn on the music detection switch and contact ZEGOCLOUD technical support to configure the music detection function.
Developers can use this feature to eliminate the following noises:
Scene | Typical Noises |
---|---|
Meeting Room |
|
Office |
|
Transportation |
|
Internet Cafe |
|
Coffee Shop |
|
原始音频包含:鼠标点击声、键盘声、鼓掌声、摩擦声、办公室嘈杂声、空调声等。
AI 降噪后:
原始音频包含:雨声、电车声、炒菜声、汽车呼啸声等。
AI 降噪后:
Before implementing the AI denoising feature, please make sure:
Developers can follow the following steps to configure AI noise reduction:
Please contact ZEGOCLOUD technical support to enable the music detection feature. If it is already enabled, please ignore this step.
For the specific process of initialization and logging into the room, please refer to the implementation guide in the video call documentation for "Create engine" and "Join room".
Call the enableANS interface to enable noise suppression. This feature can make the human voice clearer after it is enabled.
After enabling noise suppression, developers can call the setANSMode interface to set the ANS mode and enable the AI denoising feature. The following are some AI denoising modes, for more modes please refer to ZegoANSMode.
AI Denoising Mode | Applicable Scenarios |
---|---|
ZegoANSMode.AI |
Lightweight mode with low power consumption and package size, still providing good denoising effects. Suitable for indoor noise environments and relatively comfortable regions in China. |
ZegoANSMode.AI_BALANCED |
Balanced mode that completely eliminates noise while preserving the human voice without loss. Slightly increased power consumption. Suitable for complex communication environments such as outdoor markets, transportation, and regions with severe noise interference. |
ZegoANSMode.AI_LOW_LATENCY |
Low latency mode that maintains pure denoising effects and high-fidelity voice quality even with a 10ms delay. Suitable for latency-sensitive scenarios such as game voice chat, game team communication, and real-time singing. |
// Enable ANS
engine.enableANS(true);
// Set AI noise suppression mode according to requirements. Note: After setting ANS mode to ZegoANSMode, ZEGO Express SDK will forcibly disable transient noise suppression [enableTransientANS]
engine.setANSMode(ZegoANSMode.AI);