Haphazard scripts for scraping bitcoin/bitcoin data from GitHub

Last update: Oct 12, 2022

Related tags

Web Crawling bitcoin-github-scrape

Overview

This is a quick-and-dirty tool used to scrape bitcoin/bitcoin pull request and commentary data.

Each output/<pr number> folder contains

comments.json: an aggregated list of both issue and review comments, in Github's original format
commits.json: a list of commit objects corresponding to the PR, in Github's original format
pr.json: the pull request object, in Github's original format
comments_abbrev.csv: abbreviated representation of each comment in CSV format
pr_abbrev.csv: abbreviated representation of the PR in CSV format
done: the datetime we retrieved the PR data

Limitations

Right now this doesn't really handle open PRs (or PRs that are expected to be updated) properly since it will not refresh data once the done sentinel is created. This could be fixed by comparing various timestamps to the done sentinel and overwriting.

Haphazard scripts for scraping bitcoin/bitcoin data from GitHub

Related tags

Overview

Limitations

See also

Owner

James O'Beirne

学习强国自动化百分百正确、瞬间答题，分值45分

feapder 是一款简单、快速、轻量级的爬虫框架。以开发快速、抓取快速、使用简单、功能强大为宗旨。支持分布式爬虫、批次爬虫、多模板爬虫，以及完善的爬虫报警机制。

Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments)

Using Selenium with Python to Web Scrap Popular Youtube Tech Channels.

京东茅台抢购最新优化版本，京东茅台秒杀，优化了茅台抢购进程队列

A module for CME that spiders hashes across the domain with a given hash.

Twitter Eye is a Twitter Information Gathering Tool With Twitter Eye

Google Developer Profile Badge Scraper

IGLS - Instagram Like Scraper CLI tool

This is python to scrape overview and reviews of companies from Glassdoor.

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

Minecraft Item Scraper

中国大学生在线四史自动答题刷分(现仅支持英雄篇)

This is a script that scrapes the longitude and latitude on food.grab.com

Crawler do site Fundamentus.com com o uso do framework scrapy, tanto da aba detalhada como a de resumo.

Scrapy-based cyber security news finder

京东云无线宝积分推送，支持查看多设备积分使用情况

Automated Linkedin bot that will improve your visibility and increase your network.

An Automated udemy coupons scraper which scrapes coupons and autopost the result in blogspot post

Poolbooru gelscraper - a simple python script for scraping images off gelbooru pools.

Haphazard scripts for scraping bitcoin/bitcoin data from GitHub

Related tags

Overview

Limitations

See also

Owner

James O'Beirne

学习强国 自动化 百分百正确、瞬间答题，分值45分

feapder 是一款简单、快速、轻量级的爬虫框架。以开发快速、抓取快速、使用简单、功能强大为宗旨。支持分布式爬虫、批次爬虫、多模板爬虫，以及完善的爬虫报警机制。

Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments)

Using Selenium with Python to Web Scrap Popular Youtube Tech Channels.

京东茅台抢购最新优化版本，京东茅台秒杀，优化了茅台抢购进程队列

A module for CME that spiders hashes across the domain with a given hash.

Twitter Eye is a Twitter Information Gathering Tool With Twitter Eye

Google Developer Profile Badge Scraper

IGLS - Instagram Like Scraper CLI tool

This is python to scrape overview and reviews of companies from Glassdoor.

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

Minecraft Item Scraper

中国大学生在线 四史自动答题刷分(现仅支持英雄篇)

This is a script that scrapes the longitude and latitude on food.grab.com

Crawler do site Fundamentus.com com o uso do framework scrapy, tanto da aba detalhada como a de resumo.

Scrapy-based cyber security news finder

京东云无线宝积分推送，支持查看多设备积分使用情况

Automated Linkedin bot that will improve your visibility and increase your network.

An Automated udemy coupons scraper which scrapes coupons and autopost the result in blogspot post

Poolbooru gelscraper - a simple python script for scraping images off gelbooru pools.

学习强国自动化百分百正确、瞬间答题，分值45分

中国大学生在线四史自动答题刷分(现仅支持英雄篇)