Creating an Audiobook (mp3 file) using a Ebook (epub) using BeautifulSoup and Google Text to Speech

Last update: Aug 25, 2022

Related tags

Text Data & NLP epub2audiobook

Overview

epub2audiobook

Creating an Audiobook (mp3 file) using a Ebook (epub) using BeautifulSoup and Google Text to Speech

Input examples

qual a pasta do seu arquivo ebook?

C:\Users\yourname\Documents...\newfolder\
it has to have a last forward-slash (" \ ")

qual o nome do seu arquivo ebook (sem o .epub)?

Input-book

About

I wrote this in 2020. This is one of my first python projects. I've made improvements on it, but it isn't my cleanest code. Please be kind in the comments, I look forward to hearing some feedbacks :)

I was inspired by this youtube video https://youtu.be/q-nvbuc59Po The Google Cloud Text 2 Speech library is not completely free, you have to create an account and give them your credit card number. But there is a free tier. If you process less characters than the tier's minimun, you will not be charged.

I "created" a free ebook for you to try this code. It is a combination of samples of public domain books that I enjoy.

Owner

GitHub Repository

Labelling platform for text using distant supervision

With DataQA, you can label unstructured text documents using rule-based distant supervision.

245 Aug 05, 2022

ReCoin - Restoring our environment and businesses in parallel

Shashank Ojha, Sabrina Button, Abdellah Ghassel, Joshua Gonzales "Reduce Reuse R

1 Mar 14, 2022

Beautiful visualizations of how language differs among document types.

Scattertext 0.1.0.0 A tool for finding distinguishing terms in corpora and displaying them in an interactive HTML scatter plot. Points corresponding t

2k Dec 27, 2022

SummerTime - Text Summarization Toolkit for Non-experts

A library to help users choose appropriate summarization tools based on their specific tasks or needs. Includes models, evaluation metrics, and datasets.

213 Jan 04, 2023

A python package for deep multilingual punctuation prediction.

This python library predicts the punctuation of English, Italian, French and German texts. We developed it to restore the punctuation of transcribed spoken language.

27 Dec 22, 2022

LSTM model - IMDB review sentiment analysis

NLP - Movie review sentiment analysis The colab notebook contains the code for building a LSTM Recurrent Neural Network that gives 87-88% accuracy on

1 Jan 29, 2022

Uncomplete archive of files from the European Nopsled Team

European Nopsled CTF Archive This is an archive of collected material from various Capture the Flag competitions that the European Nopsled team played

4 Nov 24, 2021

pyMorfologik MorfologikpyMorfologik - Python binding for Morfologik.

Python binding for Morfologik Morfologik is Polish morphological analyzer. For more information see http://github.com/morfologik/morfologik-stemming/

18 Dec 29, 2021

Data and code to support "Applied Natural Language Processing" (INFO 256, Fall 2021, UC Berkeley)

anlp21 Course materials for "Applied Natural Language Processing" (INFO 256, Fall 2021, UC Berkeley) Syllabus: http://people.ischool.berkeley.edu/~dba

48 Dec 06, 2022

ACL'2021: Learning Dense Representations of Phrases at Scale

DensePhrases DensePhrases is an extractive phrase search tool based on your natural language inputs. From 5 million Wikipedia articles, it can search

540 Dec 30, 2022

This project is part of Eleuther AI's quest to create a massive repository of high quality text data for training language models.

42 Dec 13, 2022

Mednlp - Medical natural language parsing and utility library

Medical natural language parsing and utility library A natural language medical

3 Aug 24, 2022

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

pySBD: Python Sentence Boundary Disambiguation (SBD) pySBD - python Sentence Boundary Disambiguation (SBD) - is a rule-based sentence boundary detecti

549 Jan 06, 2023

LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)

LV-BERT Introduction In this repo, we introduce LV-BERT by exploiting layer variety for BERT. For detailed description and experimental results, pleas

14 Aug 24, 2022

Product-Review-Summarizer - Created a product review summarizer which clustered thousands of product reviews and summarized them into a maximum of 500 characters, saving precious time of customers and helping them make a wise buying decision.

Product-Review-Summarizer - Created a product review summarizer which clustered thousands of product reviews and summarized them into a maximum of 500 characters, saving precious time of customers an

1 Jan 01, 2022

Creating an Audiobook (mp3 file) using a Ebook (epub) using BeautifulSoup and Google Text to Speech

Related tags

Overview

epub2audiobook

Input examples

qual a pasta do seu arquivo ebook?

qual o nome do seu arquivo ebook (sem o .epub)?

About

Owner

Labelling platform for text using distant supervision

ReCoin - Restoring our environment and businesses in parallel

Beautiful visualizations of how language differs among document types.

SummerTime - Text Summarization Toolkit for Non-experts

A python package for deep multilingual punctuation prediction.

LSTM model - IMDB review sentiment analysis

Uncomplete archive of files from the European Nopsled Team

pyMorfologik MorfologikpyMorfologik - Python binding for Morfologik.

Data and code to support "Applied Natural Language Processing" (INFO 256, Fall 2021, UC Berkeley)

ACL'2021: Learning Dense Representations of Phrases at Scale

This project is part of Eleuther AI's quest to create a massive repository of high quality text data for training language models.

Mednlp - Medical natural language parsing and utility library

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)

Product-Review-Summarizer - Created a product review summarizer which clustered thousands of product reviews and summarized them into a maximum of 500 characters, saving precious time of customers and helping them make a wise buying decision.

Longformer: The Long-Document Transformer

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

Towards Nonlinear Disentanglement in Natural Data with Temporal Sparse Coding

Associated Repository for "Translation between Molecules and Natural Language"

Unsupervised text tokenizer focused on computational efficiency