wenet-kws

Production First and Production Ready End-to-End Keyword Spotting Toolkit.

The goal of this toolkit it to...

Small footprint keyword spotting (KWS), or specifically wake-up word (WuW) detection is a typical and important module in internet of things (IoT) devices. It provides a way for users to control IoT devices with a hands-free experience. A WuW detection system usually runs locally and persistently on IoT devices, which requires low consumptional power, less model parameters, low computational comlexity and to detect predefined keyword in a streaming way, i.e., requires low latency.

Typical Scenario

We are going to support the following typical applications of wakeup word:

Single wake-up word
Multiple wake-up words
Customizable wake-up word
Personalized wake-up word, i.e. combination of wake-up word detection and voiceprint

Dataset

We plan to support a variaty of open source wake-up word datasets, include but not limited to:

All the well-trained models on these dataset will be made public avaliable.

Runtime

We plan to support a variaty of hardwares and platforms, including:

Web browser
x86
Android
Raspberry Pi

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Related tags

Overview

wenet-kws

Typical Scenario

Dataset

Runtime

Owner

中文問句產生器；使用台達電閱讀理解資料集(DRCD)

Bidirectional Variational Inference for Non-Autoregressive Text-to-Speech (BVAE-TTS)

Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies

Faster, modernized fork of the language identification tool langid.py

Code for the paper "Are Sixteen Heads Really Better than One?"

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Code for text augmentation method leveraging large-scale language models

A full spaCy pipeline and models for scientific/biomedical documents.

Dé op-de-vlucht Pieton vertaler. Wereldwijd gebruikt door meer dan 1.000+ succesvolle bedrijven!

Unet-TTS: Improving Unseen Speaker and Style Transfer in One-shot Voice Cloning

Deduplication is the task to combine different representations of the same real world entity.

Utilities for preprocessing text for deep learning with Keras

RIDE automatically creates the package and boilerplate OOP Python node scripts as per your needs

🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)

自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器

VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool that is specifically attuned to sentiments expressed in social media, and works well on texts from other domains.

this repository has datasets containing information of Uber pickups in NYC from April 2014 to September 2014 and January to June 2015. data Analysis , virtualization and some insights are gathered here

Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further languages

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

A PyTorch Implementation of End-to-End Models for Speech-to-Text