CorNet Correlation Networks for Extreme Multi-label Text Classification

Easy Data Augmentation Implementation This repository contains my Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Per

9 Oct 31, 2022

Fake news detector filters - Smart filter project allow to classify the quality of information and web pages

fake-news-detector-1.0 Lists, lists and more lists... Spam filter list, quality keyword list, stoplist list, top-domains urls list, news agencies webs

1 Jan 04, 2022

CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus

CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus CVSS is a massively multilingual-to-English speech-to-speech translation corpus, co

118 Jan 06, 2023

The first online catalogue for Arabic NLP datasets.

Masader The first online catalogue for Arabic NLP datasets. This catalogue contains 200 datasets with more than 25 metadata annotations for each datas

94 Dec 26, 2022

auto_code_complete is a auto word-completetion program which allows you to customize it on your need

auto_code_complete v1.3 purpose and usage auto_code_complete is a auto word-completetion program which allows you to customize it on your needs. the m

2 Feb 22, 2022

A Multi-modal Model Chinese Spell Checker Released on ACL2021.

ReaLiSe ReaLiSe is a multi-modal Chinese spell checking model. This the office code for the paper Read, Listen, and See: Leveraging Multimodal Informa

106 Dec 29, 2022

Full Spectrum Bioinformatics - a free online text designed to introduce key topics in Bioinformatics using the Python

Full Spectrum Bioinformatics is a free online text designed to introduce key topics in Bioinformatics using the Python programming language. The text is written in interactive Jupyter Notebooks, whic

33 Dec 28, 2022

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

Indobenchmark Toolkit Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG) resources fo

11 Aug 26, 2022

Conditional Transformer Language Model for Controllable Generation

CTRL - A Conditional Transformer Language Model for Controllable Generation Authors: Nitish Shirish Keskar, Bryan McCann, Lav Varshney, Caiming Xiong,

1.7k Dec 28, 2022

Voice Assistant inspired by Google Assistant, Cortana, Alexa, Siri, ...

author: @shival_gupta VoiceAI This program is an example of a simple virtual assitant It will listen to you and do accordingly It will begin with wish

1 Jan 06, 2022

硕士期间自学的NLP子任务，供学习参考

NLP_Chinese_down_stream_task 自学的NLP子任务，供学习参考任务1 ：短文本分类 (1).数据集：THUCNews中文文本数据集(10分类) (2).模型：BERT+FC/LSTM，Pytorch实现 (3).使用方法：预训练模型使用的是中文BERT-WWM, 下载地

12 May 31, 2022

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

CRNN paper：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition 1. create your ow

3 Apr 02, 2022

CorNet Correlation Networks for Extreme Multi-label Text Classification

Related tags

Overview

CorNet

Prerequisites

Datasets

Run

Baselines

Owner

Guangxu Xun

This project is part of Eleuther AI's quest to create a massive repository of high quality text data for training language models.

My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensorflow

Fake news detector filters - Smart filter project allow to classify the quality of information and web pages

CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus

The first online catalogue for Arabic NLP datasets.

auto_code_complete is a auto word-completetion program which allows you to customize it on your need

A Multi-modal Model Chinese Spell Checker Released on ACL2021.

Full Spectrum Bioinformatics - a free online text designed to introduce key topics in Bioinformatics using the Python

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

Conditional Transformer Language Model for Controllable Generation

Voice Assistant inspired by Google Assistant, Cortana, Alexa, Siri, ...

硕士期间自学的NLP子任务，供学习参考

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

NLP command-line assistant powered by OpenAI

Yet another Python binding for fastText

Minimal GUI for accessing the Watson Text to Speech service.

Chinese segmentation library

hashily is a Python module that provides a variety of text decoding and encoding operations.

Text Normalization（文本正则化）

Predict an emoji that is associated with a text