Graph-based community clustering approach to extract protein domains from a predicted aligned error matrix

Last update: Nov 23, 2022

Related tags

Overview

pae_to_domains

Graph-based community clustering approach to extract protein domains from a predicted aligned error matrix

Overview

Using a predicted aligned error matrix corresponding to an AlphaFold2 model (e.g. as downloaded from https://alphafold.ebi.ac.uk/), returns a series of lists of residue indices, where each list corresponds to a set of residues clustering together into a pseudo-rigid domain.

Requirements

Python >=3.7
NetworkX >= 2.6.2

Known Issues

Due to an internal implementation issue in NetworkX (Issue #4992) some combinations of PAE matrix and resolution can lead to a KeyError. Solutions to this are being explored, and it will hopefully be fixed in the next NetworkX release.

Usage

While primarily intended as a code snippet to be incorporated into larger projects, this can also be called from the command line. At its simplest:

python pae_to_domains.py pae_file.json

... will yield a .csv file with each line providing the indices for one residue cluster. Full help for the command-line version:

positional arguments:
  pae_file              Name of the PAE JSON file.

optional arguments:
  -h, --help            show this help message and exit
  --output_file OUTPUT_FILE
                        Name of output file (comma-delimited text format.
                        Default: clusters.csv
  --pae_power PAE_POWER
                        Graph edges will be weighted as 1/pae**pae_power.
                        Default: 1.0
  --pae_cutoff PAE_CUTOFF
                        Graph edges will only be created for residue pairs
                        with pae



Example
Using https://alphafold.ebi.ac.uk/entry/Q9HBA0 as an example case...
resolution=0.5: 
resolution=1.0: 
resolution=2.0:

Graph-based community clustering approach to extract protein domains from a predicted aligned error matrix

Related tags

Overview

pae_to_domains

Overview

Requirements

Known Issues

Usage

Example

Owner

Tristan Croll

PointPillars inference with TensorRT

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

Arabic Car License Recognition. A solution to the kaggle competition Machathon 3.0.

Pytorch implementation of One-Shot Affordance Detection

Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.

No-reference Image Quality Assessment(NIQA) Algorithms (BRISQUE, NIQE, PIQE, RankIQA, MetaIQA)

Official pytorch implementation of the AAAI 2021 paper Semantic Grouping Network for Video Captioning

NICE-GAN — Official PyTorch Implementation Reusing Discriminators for Encoding: Towards Unsupervised Image-to-Image Translation

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

A library of multi-agent reinforcement learning components and systems

"Projelerle Yapay Zeka Ve Bilgisayarlı Görü" Kitabımın projeleri

Invariant Causal Prediction for Block MDPs

Repo for "Physion: Evaluating Physical Prediction from Vision in Humans and Machines" submission to NeurIPS 2021 (Datasets & Benchmarks track)

Code and project page for ICCV 2021 paper "DisUnknown: Distilling Unknown Factors for Disentanglement Learning"

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

Fast image augmentation library and an easy-to-use wrapper around other libraries

Recurrent Conditional Query Learning

Object-aware Contrastive Learning for Debiased Scene Representation

Python code for the paper How to scale hyperparameters for quickshift image segmentation

Script utilizando OpenCV e modelo Machine Learning para detectar o uso de máscaras.