Fake news detection

Implements a fake news detection program using classifiers for Data Mining course at UoA.

Description

The project is the categorization of text data by news articles and specifically the detection of fake news. The data contains 2 files in csv format (Fake.csv, True.csv)

Data Preprocessing

Removed punctuation and made all letters uniform after dropped every null row

Feature Extraction

To analyse the preprocessed data it has to be represented in a numeric format by using:

Bag of Words - one of the simplest word embedding approaches
TF-IDF is a bag words that applies a regularization algorithm.
Word vectors from Word2Vec model to create a vector representation for a sentence.

Classifiers

For every of the following classifiers there is a detailed analysis in the pytorch file

Logistic Regression
Naive Bayes
Support Vector Machine
Random Forests
Voting Classifier

Metrics

We evaluate performance of each method in test data using the following evaluation metrics:

Accuracy score
F1 score which is the weighted average of precision and recall and thus it is used especially for uneven class distribution problems.

Contributors

Apostolos Karvelas

Ioannis Papadimitriou

Implements a fake news detection program using classifiers.

Related tags

Overview

Fake news detection

Description

Data Preprocessing

Feature Extraction

Classifiers

Metrics

Contributors

Owner

Apostolos Karvelas

(CVPR 2022 Oral) Official implementation for "Surface Representation for Point Clouds"

This repository contains code used to audit the stability of personality predictions made by two algorithmic hiring systems

Large-Scale Unsupervised Object Discovery

Py-FEAT: Python Facial Expression Analysis Toolbox

Unofficial & improved implementation of NeRF--: Neural Radiance Fields Without Known Camera Parameters

Pre-trained models for a Cascaded-FCN in caffe and tensorflow that segments

A Simple Key-Value Data-store written in Python

Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation" in EMNLP 2021

[Nature Machine Intelligence' 21] "Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence"

Code for Domain Adaptive Video Segmentation via Temporal Consistency Regularization in ICCV 2021

A transformer which can randomly augment VOC format dataset (both image and bbox) online.

PyTorch wrappers for using your model in audacity!

This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.

Pytorch Lightning code guideline for conferences

Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"

A project studying the influence of communication in multi-objective normal-form games

This is the dataset for testing the robustness of various VO/VIO methods

iris - Open Source Photos Platform Powered by PyTorch

Official repository for the CVPR 2021 paper "Learning Feature Aggregation for Deep 3D Morphable Models"

Code for the ACL2021 paper "Lexicon Enhanced Chinese Sequence Labelling Using BERT Adapter"