ASOUL直播间弹幕抓取&&数据分析（更新中）

这些文件用于爬取ASOUL直播间的弹幕（其他直播间也可以）和其他信息，以及简单的数据分析生成。

我们的B站专栏：A-SOUL数据组 😘

运行环境要求

需要python3、requirements.txt中的库以及mysql数据库。

直播信息抓取

简介

这部分文件主要用于抓取直播间的所有信息，例如弹幕、SC、进场等，并且保存到mysql数据库中。

使用方法

安装pip库，在目录下命令行
```
pip install -r requirement.txt
```
修改live/config.txt中room_id（你所关心的直播间号）、target_id（你所关心的主播的UID）、medal _room_id（你认为所有的和你所关心的主播相关的直播间号）。默认是asoul相关。
修改live/config.txt中mysql_config中的host、port、user、password和db。

同目录下新建一个python文件，在新文件添加语句如下：

import live_data_collection

def live_monitor(live_room_id):
    my_live_monitor = live_data_collection.bilibili_live_data(live_room_id)
    my_live_monitor.live_monitor()

room_id = '22625025'     #这里是你要爬取的房间id
live_monitor(room_id)

运行新文件，成功的话，程序会自动生成mysql中的表，并且程序终端会有心跳包等信息输出，可以连接数据库查看新的信息。

其他

关于数据库表结构和字段含义，请在live_data_collection.py等相关文件里面寻找注释。

ASOUL直播间弹幕抓取&&数据分析

Related tags

Overview

ASOUL直播间弹幕抓取&&数据分析（更新中）

目录

运行环境要求

直播信息抓取

简介

相关文件

使用方法

其他

Owner

Python reader for Linked Data in HDF5 files

Python Library for learning (Structure and Parameter) and inference (Statistical and Causal) in Bayesian Networks.

University Challenge 2021 With Python

wikirepo is a Python package that provides a framework to easily source and leverage standardized Wikidata information

Synthetic data need to preserve the statistical properties of real data in terms of their individual behavior and (inter-)dependences

Kennedy Institute of Rheumatology University of Oxford Project November 2019

VHub - An API that permits uploading of vulnerability datasets and return of the serialized data

Stock Analysis dashboard Using Streamlit and Python

Detecting Underwater Objects (DUO)

A lightweight, hub-and-spoke dashboard for multi-account Data Science projects

The Master's in Data Science Program run by the Faculty of Mathematics and Information Science

Pypeln is a simple yet powerful Python library for creating concurrent data pipelines.

Hydrogen (or other pure gas phase species) depressurization calculations

Uses MIT/MEDSL, New York Times, and US Census datasources to analyze per-county COVID-19 deaths.

DefAP is a program developed to facilitate the exploration of a material's defect chemistry

A tool to compare differences between dataframes and create a differences report in Excel

Random dataframe and database table generator

AptaMat is a simple script which aims to measure differences between DNA or RNA secondary structures.

Repositori untuk menyimpan material Long Course STMKGxHMGI tentang Geophysical Python for Seismic Data Analysis

Useful tool for inserting DataFrames into the Excel sheet.