Count the frequency of letters or words in a text file and show a graph.

Last update: Apr 09, 2022

Overview

Word Counter

By EBUS Coding Club

Count the frequency of letters or words in a text file and show a graph.

Requirements

Python 3.9 or higher
matplotlib

Usage

Download the source code and unzip the downloaded file. Run pip install -r requirements.txt in the source code directory to install the required packages. Create a text file in the same directory as main.py named input.txt and fill it with text you want to analyze. Run the script in an IDE of your choice or with python main.py.

Objective

Given a text file, count the frequency (number of occurrences) of either letters or words, and show a bar graph to visualize the results. Do not include whitespace or punctuation in the results, with the exception of apostrophes that are inside words.

Next Steps

Add command line arguments for input file path and other options
Add timers for significant steps to diagnose performance
Optimize speed and memory usage
Anything else you can think of to improve the script

License

MIT License

Count the frequency of letters or words in a text file and show a graph.

Related tags

Overview

Word Counter

Requirements

Usage

Objective

Next Steps

License

Owner

EBUS Coding Club

Python library for parsing resumes using natural language processing and machine learning

Question answering app is used to answer for a user given question from user given text.

Python Implementation of ``Modeling the Influence of Verb Aspect on the Activation of Typical Event Locations with BERT'' (Findings of ACL: ACL 2021)

Deep Learning Topics with Computer Vision & NLP

GNES enables large-scale index and semantic search for text-to-text, image-to-image, video-to-video and any-to-any content form

Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles

Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition

Tools to download and cleanup Common Crawl data

TaCL: Improve BERT Pre-training with Token-aware Contrastive Learning

An implementation of WaveNet with fast generation

CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus

Python powered crossword generator with database with 20k+ polish words

NL. The natural language programming language.

Use AutoModelForSeq2SeqLM in Huggingface Transformers to train COMET

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

A python project made to generate code using either OpenAI's codex or GPT-J (Although not as good as codex)

profile tools for pytorch nn models

Full Spectrum Bioinformatics - a free online text designed to introduce key topics in Bioinformatics using the Python

Japanese Long-Unit-Word Tokenizer with RemBertTokenizerFast of Transformers

TweebankNLP - Pre-trained Tweet NLP Pipeline (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Models + Tweebank-NER