midasheng-audio-denoise

Voice enhancement and noise reduction service. Accepts a noisy audio file and returns a clean, denoised version. Use when user needs to remove background noise from audio files, clean up recordings, or preprocess audio for speech recognition.

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "midasheng-audio-denoise" with this command: npx skills add jimbozhang/midasheng-audio-denoise

midasheng-audio-denoise

Voice enhancement and noise reduction service powered by advanced backend algorithms.

1. Trigger

Use this skill when the user wants to:

  • Remove background noise from an audio file
  • Clean up a noisy recording
  • Enhance voice clarity
  • Preprocess audio for speech recognition

2. API Details

Endpoint: POST https://llmplus.ai.xiaomi.com/dasheng/audio/denoise (multipart/form-data)

Parameters:

  • file: The audio file to denoise

Response: Binary audio stream (WAV format)

3. Usage

Basic denoising

curl -X POST "https://llmplus.ai.xiaomi.com/dasheng/audio/denoise" \
  -F "file=@noisy_recording.mp3" \
  -o clean_recording.wav

Script usage

python3 scripts/denoise.py noisy_audio.mp3 -o clean_audio.wav
python3 scripts/denoise.py --queue   # Check queue status

4. Queue Status(排队情况)

查询命令

python3 scripts/denoise.py --queue
# 或直接调 API:
curl -X POST "https://llmplus.ai.xiaomi.com/metrics?path=/dasheng/audio/denoise"

返回字段

  • active: 当前活跃请求数
  • avg_latency_ms: 平均处理耗时(毫秒)
  • 预估等待时长 = active × avg_latency_ms

何时调用

  1. IM 即将超时但 denoise 服务还未返回结果时:查排队情况告知用户,请用户稍后来问。
  2. 用户稍后询问任务进度但服务仍未返回时:查最新排队情况返回给用户。

状态分级

  • 🟢 active=0 或预估等待 <5s → 服务空闲
  • 🟡 预估等待 5-30s → 轻微排队
  • 🔴 预估等待 >30s → 排队较长,建议稍后重试

5. Supported Audio Formats

Input: mp3, wav, flac, ogg, m4a. Output: WAV.

6. Troubleshooting

  • Empty output: Input file may be too short
  • API request failed: Verify network connectivity
  • Poor results: Works best on speech with background noise

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Multi Edge-TTS CN

Edge-TTS 在线语音合成 skill。基于微软 Edge TTS 引擎,生成速度快(1-2秒),支持多种音色和输出格式。同时支持飞书(OGG/Opus)和企业微信(AMR)。默认音色 xiaoxiao_lively。需联网。

Registry SourceRecently Updated
General

vedic-destiny

吠陀命盘分析中文入口。用于完整命盘研判、命主盘 Rashi chart 与九分盘 Navamsha chart 联读、既往事件回看、出生时间稳定度判断、事业主题、婚姻主题、时空盘专题,以及基于 Jagannatha Hora PDF、星盘截图或文本命盘数据的系统拆盘。当用户提到完整星盘、事业方向、婚姻问题、关系窗...

Registry SourceRecently Updated
General

One Person Company OS

Build a visual operating cockpit for an AI-native one-person company across promise, buyer, product, delivery, cash, learning, and assets. / 为 AI 一人公司建立可视化经营...

Registry SourceRecently Updated
General

健康追踪

健康追踪技能 - 追踪饮水、睡眠、步数等健康数据,JSON存储。

Registry SourceRecently Updated