Text Classification in Turkish Texts with Bert

Last update: Dec 31, 2022

Overview

You can watch the details of the project on my youtube channel

Project Interface

Project Second Interface

Goal= Correctly guessing the classification of texts and audios

BERT_Text_Classification

It is a text classification task implementation transformers (by HuggingFace) with BERT. It contains several parts:

--Data pre-processing

--BERT tokenization and input formating

--Train with BERT

--Evaluation

--Save and load saved model

Text-classification-transformers

Text classification tasks are most easily encountered in the area of natural language processing and can be used in various ways.

However, the given data needs to be preprocessed and the model's data pipeline must be created according to the preprocessing.

The purpose of this Repository is to allow text classification to be easily performed with Transformers (BERT)-like models if text classification data has been preprocessed into a specific structure.

Implemented based on Huggingfcae transformers for quick and convenient implementation.

Text Classification in Turkish Texts with Bert

Related tags

Overview

You can watch the details of the project on my youtube channel

Project Interface

Project Second Interface

BERT_Text_Classification

Text-classification-transformers

📝 read_dataset

Unique Categories

☄️ Available models

🏴‍☠️ Model Performance

Predictions Vs Actuals

🃏 predictor

97.22 📈

Owner

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Knowledge Graph,Question Answering System，基于知识图谱和向量检索的医疗诊断问答系统

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

An open source library for deep learning end-to-end dialog systems and chatbots.

2021 2학기 데이터크롤링 기말프로젝트

Rootski - Full codebase for rootski.io (without the data)

Text preprocessing, representation and visualization from zero to hero.

Longformer: The Long-Document Transformer

A curated list of efficient attention modules

[AAAI 21] Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning

Contract Understanding Atticus Dataset

Easy to use, state-of-the-art Neural Machine Translation for 100+ languages

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Gpt2-WebAPI - The objective of this API is to provide the 3 best possible responses to sentences that the user would input via http GET request as a parameter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

CJK computer science terms comparison / 中日韓電腦科學術語對照 / 日中韓のコンピュータ科学の用語対照 / 한·중·일 전산학 용어 대조

Model parallel transformers in JAX and Haiku

Deduplication is the task to combine different representations of the same real world entity.

Treemap visualisation of Maya scene files

Deploying a Text Summarization NLP use case on Docker Container Utilizing Nvidia GPU