Audio media crawler for lbry.

Last update: Dec 03, 2022

Related tags

Overview

Audio media crawler for lbry.

Requirements

Python 3.8
Poetry 1.1.7
Elasticsearch 7.14.0
Lbry-sdk 0.99.0

Development

This project uses poetry as a dependency management tool.

Install dependencies

Installs all defined dependencies of the project. For more information please read the poetry documentation.

poetry install

Tasks

Update hooks

Setup and update pre-commit hooks. You should run this the first time after poetry install.

poetry run task update-hooks

Format code

For more information please read the black documentation

poetry run task format

Commands

Basic usage

For more information please read the poetry documentation.

poetry run podcatcher <command>

Sync

Scan all audio streams to find music and podcasts episodes, keeping elasticsearch in sync.

poetry run podcatcher sync

Retry sync

Retry failed sync from last checkpoint. If no previous failed sync occured it will just run a normal sync.

poetry run podcatcher retry-sync

Cache sync

Skip scan and sync existent cache data to elasticsearch.

poetry run podcatcher cache-sync

Clear cache

Remove all files on the cache directory.

poetry run podcatcher clear-cache

Drop

Remove all indices from elasticsearch and all files from the cache directory.

poetry run podcatcher drop

Audio media crawler for lbry.

Related tags

Overview

Audio media crawler for lbry.

Requirements

Development

Install dependencies

Tasks

Update hooks

Format code

Commands

Basic usage

Sync

Retry sync

Cache sync

Clear cache

Drop

Owner

Hound.fm

A Powerful Spider(Web Crawler) System in Python.

让中国用户使用git从github下载的速度提高1000倍!

Binance Smart Chain Contract Scraper + Contract Evaluator

LSpider 一个为被动扫描器定制的前端爬虫

Twitter Scraper

Web Content Retrieval for Humans™

OSTA web scraper, for checking the status of school buses in Ottawa

A web scraper that exports your entire WhatsApp chat history.

Web Scraping COVID 19 Meta Portal with Python

Explore scraping with BeautifulSoup!

Scrap the 42 Intranet's elearning videos in a single click

Binance Smart Chain Contract Scraper + Contract Evaluator

Demonstration on how to use async python to control multiple playwright browsers for web-scraping

This is a simple website crawler which asks for a website link from the user to crawl and find specific data from the given website address.

京东抢茅台，秒杀成功很多次讨论，天猫抢购，赚钱交流等。

A Happy and lightweight Python Package that searches Google News RSS Feed and returns a usable JSON response and scrap complete article - No need to write scrappers for articles fetching anymore

对于有验证码的站点爆破，用于安全合法测试

Jobinja.ir jobs scraper.

SearchifyX, predecessor to Searchify, is a fast Quizlet, Quizizz, and Brainly webscraper with various stealth features.

A webdriver-based script for reserving Tsinghua badminton courts.