Get mutations in cluster by querying from LAPIS API

Last update: Oct 22, 2021

Related tags

Data Analysis cluster-mutations

Overview

Cluster Mutation Script

Get mutations appearing within user-defined clusters.

Usage

Clusters are defined in the clusters dict in main.py:

clusters = {
    '21A.Delta': ['11514.','4181.','6402.','27752T','28461G','22995A'],
    '21J.Delta': ['4181T','6402T','27752T','28461G','22995A'],
    '21I.Delta': ['5584G', '11514T', '22227T','27752T','28461G','22995A']
}

python main.py

Output is in folder output based on the name of the cluster in clusters dict.

Requirements

Requires internet connection since it queries https://github.com/cevo-public/LAPIS

Owner

neherlab

Computational biology at the Biozentrum, Basel

GitHub Repository

This is an example of how to automate Ridit Analysis for a dataset with large amount of questions and many item attributes

1 Nov 17, 2021

Tablexplore is an application for data analysis and plotting built in Python using the PySide2/Qt toolkit.

81 Dec 26, 2022

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.

Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc

359 Dec 22, 2022

A model checker for verifying properties in epistemic models

Epistemic Model Checker This is a model checker for verifying properties in epistemic models. The goal of the model checker is to check for Pluralisti

2 Dec 22, 2021

Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).

PandasVault ⁠— Advanced Pandas Functions and Code Snippets The only Pandas utility package you would ever need. It has no exotic external dependencies

374 Jan 07, 2023

Python for Data Analysis, 2nd Edition

Python for Data Analysis, 2nd Edition Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media Buy

18.6k Jan 08, 2023

PipeChain is a utility library for creating functional pipelines.

PipeChain Motivation PipeChain is a utility library for creating functional pipelines. Let's start with a motivating example. We have a list of Austra

2 Aug 07, 2022

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format.

2 Dec 01, 2021

PLStream: A Framework for Fast Polarity Labelling of Massive Data Streams

PLStream: A Framework for Fast Polarity Labelling of Massive Data Streams Motivation When dataset freshness is critical, the annotating of high speed

4 Aug 02, 2022

This tool parses log data and allows to define analysis pipelines for anomaly detection.

logdata-anomaly-miner This tool parses log data and allows to define analysis pipelines for anomaly detection. It was designed to run the analysis wit

32 Nov 27, 2022

Elementary is an open-source data reliability framework for modern data teams. The first module of the framework is data lineage.

Data lineage made simple, reliable, and automated. Effortlessly track the flow of data, understand dependencies and analyze impact. Features Visualiza

898 Jan 09, 2023

Get mutations in cluster by querying from LAPIS API

Related tags

Overview

Cluster Mutation Script

Usage

Requirements

Owner

neherlab

This is an example of how to automate Ridit Analysis for a dataset with large amount of questions and many item attributes

Tablexplore is an application for data analysis and plotting built in Python using the PySide2/Qt toolkit.

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.

A model checker for verifying properties in epistemic models

Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).

Python for Data Analysis, 2nd Edition

PipeChain is a utility library for creating functional pipelines.

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

PLStream: A Framework for Fast Polarity Labelling of Massive Data Streams

This tool parses log data and allows to define analysis pipelines for anomaly detection.

Elementary is an open-source data reliability framework for modern data teams. The first module of the framework is data lineage.

NumPy and Pandas interface to Big Data

Exploratory Data Analysis for Employee Retention Dataset

Tools for working with MARC data in Catalogue Bridge.

A highly efficient and modular implementation of Gaussian Processes in PyTorch

Desafio proposto pela IGTI em seu bootcamp de Cloud Data Engineer

Import, connect and transform data into Excel

pandas: powerful Python data analysis toolkit

The Spark Challenge Student Check-In/Out Tracking Script

Provide a market analysis (R)