Optimal Randomized Canonical Correlation Analysis

Last update: Nov 21, 2021

Related tags

Machine Learning ORCCA

Overview

ORCCA

Optimal Randomized Canonical Correlation Analysis

This project is for the python version of ORCCA algorithm.

It depends on Numpy for matrix calculation and works with any CCA calculation package. Here we recommend

cca zoo https://github.com/jameschapman19/cca_zoo

$ pip install cca-zoo

for CCA calculation as it provides several other CCA algorithms that can be used in algorithm comparison. Please feel free to delete the cca_zoo dependency in the manuscript by deleting line2 and ORCCA_cor function then use another CCA package of your choice.

Some working exmaples for using ORCCA:

Generate ORCCA mapping for a given pair of dataset X and Y with 5 reselected random features

sample = ORCCA(X,Y,width1=0.1)

sample.ORCCA_mapping(m=5)

Calculate the canonical correlations for a given pair of dataset X and Y with 5 reselected random features

sample = ORCCA(X,Y,width1=0.1)

sample.ORCCA_cor(m=5)

Owner

Yinsong Wang

I am a Ph.D. student at Northeastern University advised by Prof. Shahin Shahrampour. My research interest lies in general machine learning.

GitHub Repository

A Python implementation of FastDTW

fastdtw Python implementation of FastDTW [1], which is an approximate Dynamic Time Warping (DTW) algorithm that provides optimal or near-optimal align

651 Jan 04, 2023

MegFlow - Efficient ML solutions for long-tailed demands.

Efficient ML solutions for long-tailed demands.

371 Dec 21, 2022

ml4h is a toolkit for machine learning on clinical data of all kinds including genetics, labs, imaging, clinical notes, and more

65 Dec 20, 2022

AP1 Transcription Factor Binding Site Prediction

A machine learning project that predicted binding sites of AP1 transcription factor, using ChIP-Seq data and local DNA shape information.

1 Jan 21, 2022

A library of sklearn compatible categorical variable encoders

Categorical Encoding Methods A set of scikit-learn-style transformers for encoding categorical variables into numeric by means of different techniques

2.1k Jan 07, 2023

PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows.

An open-source, low-code machine learning library in Python 🚀 Version 2.3.5 out now! Check out the release notes here. Official • Docs • Install • Tu

6.7k Jan 08, 2023

MaD GUI is a basis for graphical annotation and computational analysis of time series data.

MaD GUI Machine Learning and Data Analytics Graphical User Interface MaD GUI is a basis for graphical annotation and computational analysis of time se

10 Dec 19, 2022

Official code for HH-VAEM

HH-VAEM This repository contains the official Pytorch implementation of the Hierarchical Hamiltonian VAE for Mixed-type Data (HH-VAEM) model and the s

8 Nov 30, 2022

Using Logistic Regression and classifiers of the dataset to produce an accurate recall, f-1 and precision score

1 Jan 31, 2022

A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.

5.7k Dec 30, 2022

A simple application that calculates the probability distribution of a normal distribution

probability-density-function General info An application that calculates the probability density and cumulative distribution of a normal distribution

1 Oct 25, 2022

Python module for machine learning time series:

seglearn Seglearn is a python package for machine learning time series or sequences. It provides an integrated pipeline for segmentation, feature extr

536 Dec 29, 2022

K-means clustering is a method used for clustering analysis, especially in data mining and statistics.

K Means Algorithm What is K Means This algorithm is an iterative algorithm that partitions the dataset according to their features into K number of pr

1 Nov 01, 2021

nn-Meter is a novel and efficient system to accurately predict the inference latency of DNN models on diverse edge devices

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

241 Dec 26, 2022

Optimal Randomized Canonical Correlation Analysis

Related tags

Overview

ORCCA

Some working exmaples for using ORCCA:

Owner

Yinsong Wang

A Python implementation of FastDTW

MegFlow - Efficient ML solutions for long-tailed demands.

ml4h is a toolkit for machine learning on clinical data of all kinds including genetics, labs, imaging, clinical notes, and more

AP1 Transcription Factor Binding Site Prediction

A library of sklearn compatible categorical variable encoders

PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows.

MaD GUI is a basis for graphical annotation and computational analysis of time series data.

Official code for HH-VAEM

Using Logistic Regression and classifiers of the dataset to produce an accurate recall, f-1 and precision score

A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.

A simple application that calculates the probability distribution of a normal distribution

Python module for machine learning time series:

K-means clustering is a method used for clustering analysis, especially in data mining and statistics.

nn-Meter is a novel and efficient system to accurately predict the inference latency of DNN models on diverse edge devices

pywFM is a Python wrapper for Steffen Rendle's factorization machines library libFM

LightGBM + Optuna: no brainer

Meerkat provides fast and flexible data structures for working with complex machine learning datasets.

Library for machine learning stacking generalization.

SmartSim makes it easier to use common Machine Learning (ML) libraries like PyTorch and TensorFlow

Interactive Web App with Streamlit and Scikit-learn that applies different Classification algorithms to popular datasets