Kochat

챗봇 빌더는 성에 안차고, 자신만의 딥러닝 챗봇 애플리케이션을 만드시고 싶으신가요?
Kochat을 이용하면 손쉽게 자신만의 딥러닝 챗봇 애플리케이션을 빌드할 수 있습니다.

# 1. 데이터셋 객체 생성
dataset = Dataset(ood=True)

# 2. 임베딩 프로세서 생성
emb = GensimEmbedder(model=embed.FastText())

# 3. 의도(Intent) 분류기 생성
clf = DistanceClassifier(
    model=intent.CNN(dataset.intent_dict),                  
    loss=CenterLoss(dataset.intent_dict)                    
)

# 4. 개체명(Named Entity) 인식기 생성                                                     
rcn = EntityRecognizer(
    model=entity.LSTM(dataset.entity_dict),
    loss=CRFLoss(dataset.entity_dict)
)

# 5. 딥러닝 챗봇 RESTful API 학습 & 빌드
kochat = KochatApi(
    dataset=dataset, 
    embed_processor=(emb, True), 
    intent_classifier=(clf, True),
    entity_recognizer=(rcn, True), 
    scenarios=[
        weather, dust, travel, restaurant
    ]
)

# 6. View 소스파일과 연결                                                                                                        
@kochat.app.route('/')
def index():
    return render_template("index.html")

# 7. 챗봇 애플리케이션 서버 가동                                                          
if __name__ == '__main__':
    kochat.app.template_folder = kochat.root_dir + 'templates'
    kochat.app.static_folder = kochat.root_dir + 'static'
    kochat.app.run(port=8080, host='0.0.0.0')

Why Kochat?

한국어를 지원하는 최초의 오픈소스 딥러닝 챗봇 프레임워크입니다. (빌더와는 다릅니다.)
다양한 Pre built-in 모델과 Loss함수를 지원합니다. NLP를 잘 몰라도 챗봇을 만들 수 있습니다.
자신만의 커스텀 모델, Loss함수를 적용할 수 있습니다. NLP 전문가에겐 더욱 유용합니다.
챗봇에 필요한 데이터 전처리, 모델, 학습 파이프라인, RESTful API까지 모든 부분을 제공합니다.
가격 등을 신경쓸 필요 없으며, 앞으로도 쭉 오픈소스 프로젝트로 제공할 예정입니다.
아래와 같은 다양한 성능 평가 메트릭과 강력한 시각화 기능을 제공합니다.

Documentation

Reference

License

Copyright 2020 Hyunwoong Ko.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

kochat

Related tags

Overview

Kochat

Why Kochat?

Documentation

Reference

License

Owner

A fast and easy implementation of Transformer with PyTorch.

QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries

This repository implements a brute-force spellchecker utilizing the Damerau-Levenshtein edit distance.

Simple python code to fix your combo list by removing any text after a separator or removing duplicate combos

Sentiment Classification using WSD, Maximum Entropy & Naive Bayes Classifiers

pysentimiento: A Python toolkit for Sentiment Analysis and Social NLP tasks

UniSpeech - Large Scale Self-Supervised Learning for Speech

Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow

BiQE: Code and dataset for the BiQE paper

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Code for papers "Generation-Augmented Retrieval for Open-Domain Question Answering" and "Reader-Guided Passage Reranking for Open-Domain Question Answering", ACL 2021

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

The simple project to separate mixed voice (2 clean voices) to 2 separate voices.

Search msDS-AllowedToActOnBehalfOfOtherIdentity

Twitter-Sentiment-Analysis - Analysis of twitter posts' positive and negative score.

A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.

ASCEND Chinese-English code-switching dataset

An extension for asreview implements a version of the tf-idf feature extractor that saves the matrix and the vocabulary.

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

Search-Engine - 📖 AI based search engine