Skip to content
forked from ricky0123/vad

Voice activity detector (VAD) for the browser with a simple API

License

Notifications You must be signed in to change notification settings

davidsainez/vad

 
 

Repository files navigation

Voice Activity Detection for Javascript

npm vad-web npm vad-node npm vad-react

🎉 New collaborators, Discord channel, and more 🚀

I am thrilled to share that we have new collaborators from Pleap who are going to be sharing in the development work for this project. We are also starting a Discord server for users and contributors alike. We would love to have additional collaborators - if you are interested, please write us on Discord!

Overview

This package aims to provide an accurate, user-friendly voice activity detector (VAD) that runs in the browser. It also has limited support for node. Currently, it runs Silero VAD [1] using ONNX Runtime Web / ONNX Runtime Node.js.

For documentation and a demo, visit vad.ricky0123.com.

Quick Start

To use the VAD via a script tag in the browser, include the following script tags:

<script src="https://cdn.jsdelivr.net/npm/onnxruntime-web/dist/ort.js"></script>
<script src="https://cdn.jsdelivr.net/npm/@ricky0123/[email protected]/dist/bundle.min.js"></script>
<script>
  async function main() {
    const myvad = await vad.MicVAD.new({
      onSpeechStart: () => {
        console.log("Speech start detected")
      },
      onSpeechEnd: (audio) => {
        // do something with `audio` (Float32Array of audio samples at sample rate 16000)...
      }
    })
    myvad.start()
  }
  main()
</script>

Documentation for bundling the voice activity detector for the browser or using it in node or React projects can be found on vad.ricky0123.com.

References

[1] Silero Team. (2021). Silero VAD: pre-trained enterprise-grade Voice Activity Detector (VAD), Number Detector and Language Classifier. GitHub, GitHub repository, https://github.com/snakers4/silero-vad, [email protected].

About

Voice activity detector (VAD) for the browser with a simple API

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • TypeScript 60.0%
  • JavaScript 21.9%
  • HTML 10.3%
  • Shell 4.0%
  • Nunjucks 2.6%
  • CSS 1.2%