feat(social): energy+ZCR voice activity detection node (Issue #242) #247
Loading…
x
Reference in New Issue
Block a user
No description provided.
Delete Branch "sl-jetson/issue-242-vad"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
New
vad_nodeinsaltybot_social:/social/speech/audio_raw(UInt8MultiArrayPCM-16 LE)energy_db >= rms_threshold_db AND zcr_min <= zcr <= zcr_maxVadStateMachineprovides onset/offset hysteresis to prevent chattering/social/speech/is_speaking(Bool) and/social/speech/energy(Float32linear RMS)Parameters
rms_threshold_db-35.0zcr_min0.01zcr_max0.40onset_frames2is_speaking=trueoffset_frames8is_speaking=falseaudio_topic/social/speech/audio_rawZCR bands (16 kHz)
Test plan
test_vad_node.py)pcm16_bytes_to_float32: roundtrip, edge casesrms_linear/rms_db: silence, full-scale, sine RMS = A/√2zero_crossing_rate: alternating=1.0, sine in range, higher freq → higher ZCRVadStateMachine: onset/offset hysteresis, resetCloses #242
🤖 Generated with Claude Code