SkyScrapers: A collection of variety of Scraping Apps

Last update: Feb 17, 2022

Related tags

Web Crawling SkyScrapers

Overview

SkyScrapers

Collection of variety of Web Scraping Apps

The web-scrapers involved in this project are:

StockSymbolScraper
UnsplashImagesScraper

Tech

ScyScrapers uses a number of open source projects to work properly:

BeautifulSoup - Beautiful Soup is a Python library for pulling data out of HTML and XML files.
Selenium - Selenium is for automating web applications for testing purposes.

❗ ❗ ❗ Please check the markdown files named as THEORY inside respecetive project directories to get insight about the individual project.

And of course SkyScrapers itself is open source with a public repository on GitHub.

Installation

SkyScrapers requires Python's latest version to run.

Install the dependencies before start.

npm install <package_name>

Owner

Biplov Pokhrel

Just started. NITR 2023.

GitHub Repository

The first public repository that provides free BUBT website scraping API script on Github.

BUBT WEBSITE SCRAPPING SCRIPT I think this is the first public repository that provides free BUBT website scraping API script on github. When I was do

3 Feb 10, 2022

Telegram group scraper tool

Telegram Group Scrapper

2 Jan 11, 2022

Scrap-mtg-top-8 - A top 8 mtg scraper using python

1 Jan 24, 2022

This program scrapes information and images for movies and TV shows.

Media-WebScraper This program scrapes information and images for movies and TV shows. Summary For more information on the program, read the WebScrape_

1 Dec 05, 2021

The core packages of security analyzer web crawler

Security Analyzer 🐍 A large scale web crawler (considered also as vulnerability scanner tool) to take an overview about security of Moroccan sites Cu

10 Jul 03, 2022

Comment Webpage Screenshot is a GitHub Action that captures screenshots of web pages and HTML files located in the repository

Comment Webpage Screenshot is a GitHub Action that helps maintainers visually review HTML file changes introduced on a Pull Request by adding comments with the screenshots of the latest HTML file cha

21 Sep 29, 2022

Github scraper app is used to scrape data for a specific user profile created using streamlit and BeautifulSoup python packages

Github Scraper Github scraper app is used to scrape data for a specific user profile. Github scraper app gets a github profile name and check whether

6 Apr 05, 2022

a way to scrape a database of all of the isef projects

ISEF Database This is a simple web scraper which gets all of the projects and abstract information from here. My goal for this is for someone to get i

1 Mar 18, 2022

Libextract: extract data from websites

Libextract is a statistics-enabled data extraction library that works on HTML and XML documents and written in Python

499 Dec 09, 2022

LSpider 一个为被动扫描器定制的前端爬虫

LSpider LSpider - 一个为被动扫描器定制的前端爬虫什么是LSpider? 一款为被动扫描器而生的前端爬虫~ 由Chrome Headless、LSpider主控、Mysql数据库、RabbitMQ、被动扫描器5部分组合而成。

321 Dec 12, 2022

Simple python tool for the purpose of swapping latinic letters with cirilic ones and vice versa in txt, docx and pdf files in Serbian language

Alpha Swap English This is a simple python tool for the purpose of swapping latinic letters with cirylic ones and vice versa, in txt, docx and pdf fil