Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Last update: May 05, 2022

Overview

Speech_38_ru_commands

Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Программа умеет распознавать 38 ключевых слов на русском языке , произнесенных в микрофон из списка:

дальше, вперед, назад, вверх, вниз, выше, ниже, домой, громче, тише, лайк, дизлайк, следующий, предыдущий, сначала, перемотай, выключи, стоп, хватит, замолчи, заткнись, останови, пауза, включи, смотреть, продолжи, играй, запусти, ноль, один, два, три, четыре, пять, шесть, семь, восемь, девять.

Используемая модель была подготовлена для соревнования Yandex Cup 2021 ML Challenge: ASR. Получило 3 место из 54 участников. с показателем точности 92.01

Скачать модель по ссылке https://disk.yandex.ru/d/L053qF-0OPKlog

Пример запуска программы:

python speech_38_ru_commands.py --porog 1.2

где , число 1.2 - это порог уверенности в команде. Можно задавать в диапазоне 0.0 - 7.9999

Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Related tags

Overview

Speech_38_ru_commands

Owner

Andrey

This is a modification of the OpenAI-CLIP repository of moein-shariatnia

Transformer related optimization, including BERT, GPT

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL 2021.

Pipelines de datos, 2021.

The source code of HeCo

VMD Audio/Text control with natural language

Backend for the Autocomplete platform. An AI assisted coding platform.

LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)

novel deep learning research works with PaddlePaddle

Repository to hold code for the cap-bot varient that is being presented at the SIIC Defence Hackathon 2021.

Lumped-element impedance calculator and frequency-domain plotter.

Language-Agnostic SEntence Representations

(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.

BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents

A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approaches for achieving this in this repo.

Making text a first-class citizen in TensorFlow.

News-Articles-and-Essays - NLP (Topic Modeling and Clustering)

Implementation of Natural Language Code Search in the project CodeBERT: A Pre-Trained Model for Programming and Natural Languages.

A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset

precise iris segmentation