A Pythonic library for Nvidia Codec.

The project is still in active development; expect breaking changes.

Why another Python library for Nvidia Codec?

Comparison to Video-Processing-Framework

Methodologies

VPF is written fully in C++ and uses pybind to expose Python interfaces. PNC is written fully in Python and uses ctypes to access Nvidia C interfaces. Our codes tends to be more concise, less duplicative and easier to read and write.

Performance

Preliminary tests shows little to no difference in terms of performance, because the heavy lifting is done on the GPU anyway. Both library can saturate GPU decoder. PNC uses more CPU than VPF as expected from Python vs. C++, but still negligible (less than 10% of Ryzen 3100 single core for 8K*4K HEVC)

Resource Management

In VPF Surface given to user are not owned by the user. It will be overwritten by new frames which is counter-intuitive; Picture are not exposed to user at all - they are always mapped (post-processed and copied) to Surface so the picture can be ready for new frames. The latter is inefficient when only a subset of Pictures are needed (e.g. screenshots).
The above is because VPF allocates the bare minimum of resources needed for most decoding tasks. PNC allows the user to specify the amount of resources to be allocated for advanced applications. Users own the resources and decide when and whether to deal with them.
Managing resources is not painful: similar to pycuda, we shift the burden of managing host/device resources to the Python garbage collector. Resources (such as Picture and Surface) are automatically freed when the user drops the reference.

Things to come

TODO Cropping and scaling support in postprocessing
TODO Color space conversion from YUV (bt. 601/709, full-range/limit-range) to RGB using pycuda
Encoder

Acknowledgements

Many thanks to @rarzumanyan for all the helps and explanations!

A Pythonic library for Nvidia Codec.

Related tags

Overview

A Pythonic library for Nvidia Codec.

Why another Python library for Nvidia Codec?

Things to come

Acknowledgements

Owner

Zesen Qian

Old Photo Restoration (Official PyTorch Implementation)

Assessing syntactic abilities of BERT

TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

Toward Multimodal Image-to-Image Translation

The implementation of the CVPR2021 paper "Structure-Aware Face Clustering on a Large-Scale Graph with 10^7 Nodes"

“Robust Lightweight Facial Expression Recognition Network with Label Distribution Training”, AAAI 2021.

Implement object segmentation on images using HOG algorithm proposed in CVPR 2005

DNA-RECON { Automatic Web Reconnaissance Tool }

[CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Multi-layer convolutional LSTM with Pytorch

This is an open-source toolkit for Heterogeneous Graph Neural Network(OpenHGNN) based on DGL [Deep Graph Library] and PyTorch.

Measures input lag without dedicated hardware, performing motion detection on recorded or live video

A Python package for generating concise, high-quality summaries of a probability distribution

Tool for installing and updating MiSTer cores and other files

This repository provides a PyTorch implementation and model weights for HCSC (Hierarchical Contrastive Selective Coding)

Code repository accompanying the paper "On Adversarial Robustness: A Neural Architecture Search perspective"

A PyTorch implementation of "Predict then Propagate: Graph Neural Networks meet Personalized PageRank" (ICLR 2019).

Attention-based Transformation from Latent Features to Point Clouds (AAAI 2022)

This is the repo for our work "Towards Persona-Based Empathetic Conversational Models" (EMNLP 2020)