Fake news detector filters - Smart filter project allow to classify the quality of information and web pages

Last update: Jan 04, 2022

Related tags

Overview

fake-news-detector-1.0

Lists, lists and more lists...

Spam filter list, quality keyword list, stoplist list, top-domains urls list, news agencies websites list, university websites list, business websites lists and government organizations lists.

This gives us an initial score for the authority presenting the information.

If we can verify the source we are on the right track for building a fake news detector.

SPAM FILTER

The spam filter also gives us clues on the quality of the source.

TESTING, TESTING and TESTING

Next step is running more tests to see if the concept works.

Then we need to find more lists and maybe other tools like an API that can give us clues im discovering fake news.

I don't have all the answers but I am willing to code. It is a complicated problem and we may be limited on what can be done.

API may be a solution

I found two API that can make the project work.

URL Reputation API https://www.apivoid.com/api/url-reputation/

With this URL Reputation API you can detect potentially phishing and malicious URLs. We deeply analyze the URL (including the URL content, URL pattern, domain name, HTTP headers, domain TLD, etc) It not free so I will abandonne the API for the moment.

I found another API that could help the project in a more complicated way.

Search API worldwide news https://newsapi.org/?ref=apilist.fun

We could cross reference news events with this API. We could us it to validate if the story is fake or is trending. But this could get complicated.

Memo Sim @ Fake news detector filters project

AFTER TESTING : THE LIST CONCEPT WORKS VERY WELL

The lists work very well together and the system is able to detect bad and good sites. I am very happy with this module. We are also able to get nice quality indicators and statistics for web page quality source evaluation.

Fake news detector filters - Smart filter project allow to classify the quality of information and web pages

Related tags

Overview

Owner

Memo Sim

Pytorch NLP library based on FastAI

Use the power of GPT3 to execute any function inside your programs just by giving some doctests

A python script that will use hydra to get user and password to login to ssh, ftp, and telnet

A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.

Sentello is python script that simulates the anti-evasion and anti-analysis techniques used by malware.

Code for "Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures Inside Arguments".

LSTM model - IMDB review sentiment analysis

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

Neural network sequence labeling model

novel deep learning research works with PaddlePaddle

:mag: Transformers at scale for question answering & neural search. Using NLP via a modular Retriever-Reader-Pipeline. Supporting DPR, Elasticsearch, HuggingFace's Modelhub...

This repository structures data in title, summary, tags, sentiment given a fragment of a conversation

Faster, modernized fork of the language identification tool langid.py

IMDB film review sentiment classification based on BERT's supervised learning model.

A program that uses real statistics to choose the best times to bet on BloxFlip's crash gamemode

Search-Engine - 📖 AI based search engine

CDLA: A Chinese document layout analysis (CDLA) dataset

Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx

The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.

Malware-Related Sentence Classification