Quantifiers-and-Negations-in-RE-Documents

This project was part of my work for a seminar at the Technical University of Munich (TUM) during my bachelor studies in 2019. The python project can be used to find quantifiers and negations in documents. It searches for problematic findings. Problematic findings are i.e. sentences that use specific combinations of quantifiers and negations that are ambiguous. This means there are multiple valid interpretations of the sentence. It can extract those and report them.

Motivation:

You want to avoid ambiguous sentences as they can cause problems that are hard to find and possibly hard to fix. This is especially the case for technical specifications and similar use cases. In this project we compare two different approaches to finding ambiguous sentences:

String based search
NLP based search

We want to find out if the computational overhead of using NLP gives better results than standard string based search methods.

Features:

Detect quantifiers and negations in .xml or .txt documents
Search either by a string based search or by NLP based search (using Stanfords CoreNLP library [1])
Extract possibly ambiguous sentences
Compare string search results with NLP search results

Prerequisites:

Java 8 or higher
Python 3.6 or higher as project interpreter
Stanford Corenlp library: https://stanfordnlp.github.io/CoreNLP/download.html
Environment variable "CORENLP_HOME" set to where the CoreNLP library is stored

References:

[1] Christopher D.Manning, MihaiSurdeanu, JohnBauer, JennyFinkel, StevenJ.Bethard, and David McClosky. The Stanford CoreNLP natural language processing toolkit. In Association for Computational Linguistics (ACL) System Demonstrations, pages 55–60, 2014.

Quantifiers and Negations in RE Documents

Related tags

Overview

Quantifiers-and-Negations-in-RE-Documents

Owner

Nicolas Ruscher

vits chinese, tts chinese, tts mandarin

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

A Python package implementing a new model for text classification with visualization tools for Explainable AI :octocat:

Open source code for AlphaFold.

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

Script to download some free japanese lessons in portuguse from NHK

Code for our ACL 2021 (Findings) Paper - Fingerprinting Fine-tuned Language Models in the wild .

Generate a cool README/About me page for your Github Profile

Include MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

✨Rubrix is a production-ready Python framework for exploring, annotating, and managing data in NLP projects.

SimCTG - A Contrastive Framework for Neural Text Generation

This is an incredibly powerful calculator that is capable of many useful day-to-day functions.

Full Spectrum Bioinformatics - a free online text designed to introduce key topics in Bioinformatics using the Python

Training code of Spatial Time Memory Network. Semi-supervised video object segmentation.

A framework for implementing federated learning

Utility for Google Text-To-Speech batch audio files generator. Ideal for prompt files creation with Google voices for application in offline IVRs

Code for paper Multitask-Finetuning of Zero-shot Vision-Language Models

Translation for Trilium Notes. Trilium Notes 中文版.

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

Binary LSTM model for text classification