ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

Last update: Dec 02, 2022

Related tags

Text Data & NLP ConferencingSpeech2022

Overview

ConferencingSpeech 2022 challenge

This repository contains the datasets list and scripts required for the ConferencingSpeech 2022 challenge. For more details about the challenge, please see our website.

Details

baseline, this folder contains baseline system include inference model exported by inference scripts;
eval, this folder contains evaluation scripts to calculate PLCC, RMSE and SRCC;
data-sets, this folder contains training and development test data-sets provied to the participant;
- Tencent Corpus, this dataset includes about 14,000 speech chinese speech clips with simulated (e.g. codecs, packet-loss, background noise) and live conditions.
- NISQA Corpus, the NISQA Corpus includes more than 14,000 speech samples with simulated (e.g. codecs, packet-loss, background noise) and live (e.g. mobile phone, Zoom, Skype, WhatsApp) conditions.
- IU Bloomington Corpus, there are 10,000 speech signals extracted from COSINE and VOiCESdatasets, each truncated between 3 to 6 seconds long.
- PSTN Corpus, there are about 80,000 speech clips through classic public switched telephone networks, each truncated 10 seconds long.

Requirements

To install requirements install Anaconda and then use:

conda env create -f envs.yml

This will create a new environment with the name "conferencingSpeech". Activate this environment to go on:

conda activate conferencingSpeech

Code license

Apache 2.0

ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

Related tags

Overview

ConferencingSpeech 2022 challenge

Details

Requirements

Code license

Owner

A Chinese to English Neural Model Translation Project

Contains analysis of trends from Fitbit Dataset (source: Kaggle) to see how the trends can be applied to Bellabeat customers and Bellabeat products

LeBenchmark: a reproducible framework for assessing SSL from speech

This is the offline-training-pipeline for our project.

Negative sampling for solving the unlabeled entity problem in NER. ICLR-2021 paper: Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition.

BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents

Correctly generate plurals, ordinals, indefinite articles; convert numbers to words

GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot, a language model

Chatbot for the Chatango messaging platform

DeepSpeech - Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃

New Modeling The Background CodeBase

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

Tools, wrappers, etc... for data science with a concentration on text processing

AI and Machine Learning workflows on Anthos Bare Metal.

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Train 🤗-transformers model with Poutyne.

Twitter bot that uses NLP models to summarize news articles referenced in a user's twitter timeline

Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.

Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)