Scrap the 42 Intranet's elearning videos in a single click

Last update: Oct 27, 2022

Related tags

Web Crawling 42intra_scraper

Overview

42intra_scraper

Scrap the 42 Intranet's elearning videos in a single click.

Why you would want to use it ?

Adjust speed at your convenience. (The intra doesn't allow this)
Working in a remote location where internet is hit or miss ? Download what you need and you'll have it in your computer.
Have a friend that is freeze and can't access the intra's resources ? You can download the videos, compress them and send them via drive.

How to use it:

git clone [email protected]:Dovalich/42intra_scraper.git

pip3 install -r requirements.txt

python3 intra_scraper.py

And then all you have to do is follow the instructions that the program gives you, that is:

enter your 42 intranet username
enter your 42 intranet password
enter the elearning link you want to scrap for example https://elearning.intra.42.fr/tags/38/notions

Here's a short Tutorial gif:

How does it work ?

It's fairly simple.

The program makes a post request to the intranet using your logins (via the requests module).
Once logged-in, it recursively searches for any links that are in the middle of the page (the ones that contain videos).
Once it finds a video link, it downloads it based on the video quality you chose (SD or HD).

Note

As you can see in the code I don't store your user name and password. In fact I only use them once to login. But be careful when using these types of scripts. You should always read the source code before giving away sensitive information.

If you have feedback on the code please let me know! 👨‍🎓

And feel free to use it however you want.

Scrap the 42 Intranet's elearning videos in a single click

Related tags

Overview

42intra_scraper

Why you would want to use it ?

How to use it:

How does it work ?

Note

Owner

Noufel

crypto currency scraping

A Python Covid-19 cases tracker that scrapes data off the web and presents the number of Cases, Recovered Cases, and Deaths that occurred because of the pandemic.

Simple tool to scrape and download cross country ski timings and results from live.skidor.com

NASA APOD Discord Bot - Fetches information from NASA APOD site.

A Happy and lightweight Python Package that searches Google News RSS Feed and returns a usable JSON response and scrap complete article - No need to write scrappers for articles fetching anymore

Google Scholar Web Scraping

Introduction to WebScraping Workshop - Semcomp 24 Beta

Console application for downloading images from Reddit in Python

Scrape all the media from an OnlyFans account - Updated regularly

Find papers by keywords and venues. Then download it automatically

Scrapy-based cyber security news finder

爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》

Simple tool to scrape and download cross country ski timings and results from live.skidor.com

This was supposed to be a web scraping project, but somehow I've turned it into a spamming project

Unja is a fast & light tool for fetching known URLs from Wayback Machine

Web Scraping Framework

SearchifyX, predecessor to Searchify, is a fast Quizlet, Quizizz, and Brainly webscraper with various stealth features.

Scrapy uses Request and Response objects for crawling web sites.

A training task for web scraping using python multithreading and a real-time-updated list of available proxy servers.

Python web scrapper