Twitter Scraper

Last update: Dec 30, 2022

Related tags

Overview

tweety

Twitter's API is annoying to work with, and has lots of limitations — luckily their frontend (JavaScript) has it's own API, which I reverse–engineered. No API rate limits. No restrictions. Extremely fast.

Prerequisites

Before you begin, ensure you have met the following requirements:

Internet Connection
Python 3.6+
BeautifulSoup (Python Module)
Requests (Python Module)

All Functions

get_tweets()
get_user_info()
get_trends() (can be used without username)
search() (can be used without username)
tweet_detail() (can be used without username)

Using tweety

Getting Tweets:

Description:

Get 20 Tweets of a Twitter User

Required Parameter:

Username or User profile URL while initiating the Twitter Object

Optional Parameter:

pages : int (default is 1,starts from 2) -> Get the mentioned number of pages of tweets
include_extras : boolean (default is False) -> Get different extras on the page like Topics etc

Output:

Type -> dictionary

Structure

    {
      "p-1" : {
        "result": {
            "tweets": []
        }
      },
      "p-2":{
        "result": {
            "tweets": []
        }
      }
    }

Example:

>> from tweet import Twitter >>> all_tweet = Twitter("Username or URL").get_tweets(pages=2) >>> for i in all_tweet: ... print(all_tweet[i]) ">

python
Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from tweet import Twitter
>>> all_tweet = Twitter("Username or URL").get_tweets(pages=2)
>>> for i in all_tweet:
...   print(all_tweet[i])

Getting Trends:

Description:

Get 20 Locale Trends

Output:

Type -> dictionary

Structure

", "url":"
" }, { "name":"

", "url":"

" } ] } ">
  {
    "trends":[
      {
        "name":"
      
       "
      ,
        "url":"
      
       "
      
      },
      {
        "name":"
      
       "
      ,
        "url":"
      
       "
      
      }
    ]
  } 

Example :

>> from tweet import Twitter >>> trends = Twitter().get_trends() >>> for i in trends['trends']: ... print(i['name']) ">

python
Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from tweet import Twitter
>>> trends = Twitter().get_trends()
>>> for i in trends['trends']:
...   print(i['name'])

Searching a keyword:

Description:

Get 20 Tweets for a specific Keyword or Hashtag

Required Parameter:

keyword : str -> Keyword begin search

Optional Parameter:

latest : boolean (Default is False) -> Get the latest tweets

Output:

Type -> list

Example:

>> from tweet import Twitter >>> trends = Twitter().search("Pakistan") ">

python
Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from tweet import Twitter
>>> trends = Twitter().search("Pakistan")

Getting USER Info:

Description:

Get the information about the user

Required Parameter:

Username or User profile URL while initiating the Twitter Object

Optional Parameter:

banner_extensions : boolean (Default is False) -> get more information about user banner image
image_extensions : boolean (Default is False) -> get more information about user profile image

Output:

Type -> dict

Example:

>> from tweet import Twitter >>> trends = Twitter("Username or URL").get_user_info() ">

python
Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from tweet import Twitter
>>> trends = Twitter("Username or URL").get_user_info()

Getting a Tweet Detail:

Description:

Get the detail of a tweet including its reply

Required Parameter:

Identifier of the Tweet -> Either Tweet URL OR Tweet ID

Output:

Type -> dict
Structure

  {
    "conversation_threads":[],
    "tweet": {}
  }

Example:

>> from tweet import Twitter >>> trends = Twitter().tweet_detail("https://twitter.com/Microsoft/status/1442542812197801985") ">

python
Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from tweet import Twitter
>>> trends = Twitter().tweet_detail("https://twitter.com/Microsoft/status/1442542812197801985")

Updates:

Update 0.1:

Get Multiple Pages of tweets using pages parameter in get_tweets() function
output of get_tweets has been reworked.

Update 0.2:

Again reworked and simplified tweets in get_tweets function 😜
Added tweet_detail function for getting details about a tweet including replies to it

Update 0.2.1:

Fixed Hashtag Search

Twitter Scraper

Related tags

Overview

tweety

Prerequisites

All Functions

Using tweety

Getting Tweets:

Description:

Required Parameter:

Optional Parameter:

Output:

Example:

Getting Trends:

Description:

Output:

Example :

Searching a keyword:

Description:

Required Parameter:

Optional Parameter:

Output:

Example:

Getting USER Info:

Description:

Required Parameter:

Optional Parameter:

Output:

Example:

Getting a Tweet Detail:

Description:

Required Parameter:

Output:

Example:

Updates:

Update 0.1:

Update 0.2:

Update 0.2.1:

Owner

Tayyab Kharl

A Python module to bypass Cloudflare's anti-bot page.

This is a web crawler that works on employ email data by gmane.org and visualizes it in different ways.

Extract gene TSS site form gencode/ensembl/gencode database GTF file and export bed format file.

Crawler job that scrapes comments from social media posts and saves them in a S3 bucket.

API to parse tibia.com content into python objects.

a Scrapy spider that utilizes Postgres as a DB, Squid as a proxy server, Redis for de-duplication and Splash to render JavaScript. All in a microservices architecture utilizing Docker and Docker Compose

Python scrapper scrapping torrent website and download new movies Automatically.

哔哩哔哩爬取器：以个人为中心

对于有验证码的站点爆破，用于安全合法测试

Dictionary - Application focused on word search through web scraping

Open Crawl Vietnamese Text

Scraping web pages to get data

河南工业大学 完美校园 自动校外打卡

This project was created using Python technology and flask tools to scrape a music site

This is a web scraper, using Python framework Scrapy, built to extract data from the Deals of the Day section on Mercado Livre website.

Scrape Twitter for Tweets

a high-performance, lightweight and human friendly serving engine for scrapy

This script is intended to crawl license information of repositories through the GitHub API.

A tool can scrape product in aliexpress: Title, Price, and URL Product.

Deep Web Miner Python | Spyder Crawler

河南工业大学完美校园自动校外打卡