猫眼字体识别

该 github repo 在于帮助xjtlu的同学们识别猫眼的扭曲字体。已经打包上传至 pypi ，可以使用 pip 直接安装。

猫眼字体的识别不出来的原理与解决思路在采茶上

使用方法：

import MaoYanFontRecognize

m = MaoYanFontRecognize.MaoYanFont()
rate, rate_num, money = m.translate(rate_raw, rate_num_raw, font_file, money_raw= -1, money_unit=1)

更细致的使用方式请看test。

请注意，每一个电影的详情页的字体都是猫眼特殊生成的。也就是说，每次刷新之后的字体文件都是不一样的。所以要把每一个详情页中的字体文件同时下载下来，当成参数font_file使用。

参数解释：

返回值：

rate: 猫眼评分
rate_num: 猫眼评分人数
money: 票房，元人民币为单位，如果是美元会按照 2021/10/26 日汇率进行计算。
money_unit: 票房的单位，只有 3 种单位
1. 万: 1e4,
2. 亿: 1e8，
3. 万美元: 63900

输入值：

rate_raw: 未经处理的猫眼评分，从猫眼上直接爬下来的数据，放进 bs4 之后的 tag 里的 contents，下面代码是实例，以下的几个属性都差不多：
```
spans = soup("span", class_="stonefont")
rate_raw = spans[0].contents[0]
rate_num_raw = spans[1].contents[0]
money_raw = spans[2].contents[0]
```
rate_num_raw: 未经处理的评分人数，
font_file: 每一个电影的详情页都会有一个新生成的字体，这个属性需要这个字体文件的io.BytesIO()的形式，建议通过网页上的详情把字体下载下来再传进来。
money_raw: 未经处理的票房,
money_unit: 票房单位。

测试

测试结果：

输出： 2 extra bytes in post.stringData array，是TTFont库造成的，不会影响正常使用。

Give a solution to recognize MaoYan font.

Related tags

Overview

猫眼字体识别

测试

Owner

Aruix

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

A Python script to capture images from multiple webcams at once and save them into your local machine

Create single line SVG illustrations from your pictures

Official code for :rocket: Unsupervised Change Detection of Extreme Events Using ML On-Board :rocket:

A synthetic data generator for text recognition

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

Library used to deskew a scanned document

Slice a single image into multiple pieces and create a dataset from them

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

A tool combining EasyOCR and LaMa to automatically detect text and replace it with an inpainted background.

A curated list of papers, code and resources pertaining to image composition

WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

A simple component to display annotated text in Streamlit apps.

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

Detecting Text in Natural Image with Connectionist Text Proposal Network (ECCV'16)

Msos searcher - A half-hearted attempt at finding a magic square of squares

Rubik's Cube in pygame with OpenGL

Optical character recognition for Japanese text, with the main focus being Japanese manga

Converts an image into funny, smaller amongus characters

Smart computer vision application