Predict halo masses from simulations via graph neural networks

Last update: Nov 15, 2022

Overview

HaloGraphNet

Predict halo masses from simulations via Graph Neural Networks.

Given a dark matter halo and its galaxies, creates a graph with information about the 3D position, stellar mass and other properties. Then, it trains a Graph Neural Network to predict the mass of the host halo. Data are taken from the CAMELS hydrodynamic simulations, specially suited for Machine Learning purposes. Neural nets architectures are defined making use of the package PyTorch-geometric.

See the papers arXiv:2111.08683 for more details.

Scripts

Here is a brief description of the codes included:

main.py: main driver to train and test the network.
onlytest.py: tests a pre-trained model.
hyperparams_optimization.py: optimize the hyperparameters using optuna.
camelsplots.py: plot several features of the CAMELS data.
captumtest.py: studies interpretability of the model.
halomass.py: using models trained in CAMELS, predicts the mass of real halos, such as the Milky Way and Andromeda.
visualize_graphs.py: display several halos as graphs in 2D or 3D.

The folder Hyperparameters includes files with lists of default hyperparameters, to be modified by the user. The current files contain the best values for each CAMELS simulation suite and set separately, obtained from hyperparameter optimization.

The folder Models includes some pre-trained models for the hyperparameters defined in Hyperparameters.

In the folder Source, several auxiliary routines are defined:

constants.py: basic constants and initialization.
load_data.py: contains routines to load data from simulation files.
plotting.py: includes functions for displaying the loss evolution and the results from the neural nets.
networks.py: includes the definition of the Graph Neural Networks architectures.
training.py: includes routines for training and testing the net.
galaxies.py: contains data for galaxies from the Milky Way and Andromeda halos.

Requisites

The libraries required for training the models and compute some statistics are:

numpy
pytorch-geometric
matplotlib
scipy
sklearn
optuna (only for optimization in hyperparams_optimization.py)
astropy (only for MW and M31 data in Source/galaxies.py)
captum (only for interpretability in captumtest.py)

Usage

These are some advices to employ the scripts described above:

To perform a search of the optimal hyperparameters, run hyperparams_optimization.py.
To train a model with a given set of parameters defined in params.py, run main.py.
Once a model is trained, run onlytest.py to test in the training simulation suite and cross test it in the other one included in CAMELS (IllustrisTNG and SIMBA).
Run captumtest.py to study the interpretability of the models, feature importance and saliency graphs.
Run halomass.py to infer the mass of the Milky Way and Andromeda, whose data are defined in Source/galaxies.py. For this, note that only models without the stellar mass radius as feature are considered.

Citation

If you use the code, please link this repository, and cite arXiv:2111.08683 and the DOI 10.5281/zenodo.5676528.

Contact

For comments, questions etc. you can contact me at [email protected].

Releases(v1.0)

v1.0(Apr 26, 2022)

Release version of the code.
Source code(tar.gz)
Source code(zip)

Predict halo masses from simulations via graph neural networks

Related tags

Overview

HaloGraphNet

Scripts

Requisites

Usage

Citation

Contact

You might also like...

[CIKM 2019] Code and dataset for "Fi-GNN: Modeling Feature Interactions via Graph Neural Networks for CTR Prediction"

Implementation of "GNNAutoScale: Scalable and Expressive Graph Neural Networks via Historical Embeddings" in PyTorch

Source code of NeurIPS 2021 Paper ''Be Confident! Towards Trustworthy Graph Neural Networks via Confidence Calibration''

Official Implementation of "LUNAR: Unifying Local Outlier Detection Methods via Graph Neural Networks"

My published benchmark for a Kaggle Simulations Competition

Urban mobility simulations with Python3, RLlib (Deep Reinforcement Learning) and Mesa (Agent-based modeling)

This project aims to be a handler for input creation and running of multiple RICEWQ simulations.

TUPÃ was developed to analyze electric field properties in molecular simulations

Complex-Valued Neural Networks (CVNN)Complex-Valued Neural Networks (CVNN)

Releases(v1.0)

v1.0(Apr 26, 2022)

Owner

Pablo Villanueva Domingo

Adversarial examples to the new ConvNeXt architecture

Spatial Single-Cell Analysis Toolkit

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

Code for Massive-scale Decoding for Text Generation using Lattices

Pytorch implementation of the popular Improv RNN model originally proposed by the Magenta team.

Official project website for the CVPR 2021 paper "Exploring intermediate representation for monocular vehicle pose estimation"

[NeurIPS 2020] This project provides a strong single-stage baseline for Long-Tailed Classification, Detection, and Instance Segmentation (LVIS).

Exploit Camera Raw Data for Video Super-Resolution via Hidden Markov Model Inference

Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021)

Based on Yolo's low-power, ultra-lightweight universal target detection algorithm, the parameter is only 250k, and the speed of the smart phone mobile terminal can reach ~300fps+

ROS-UGV-Control-Interface - Control interface which can be used in any UGV

Julia and Matlab codes to simulated all problems in El-Hachem, McCue and Simpson (2021)

A modification of Daniel Russell's notebook merged with Katherine Crowson's hq-skip-net changes

💃 VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena

PyElastica is the Python implementation of Elastica, an open-source software for the simulation of assemblies of slender, one-dimensional structures using Cosserat Rod theory.

Image super-resolution (SR) is a fast-moving field with novel architectures attracting the spotlight

AI Face Mesh: This is a simple face mesh detection program based on Artificial intelligence.

Deep learned, hardware-accelerated 3D object pose estimation

An official implementation of the paper Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers

Machine learning notebooks in different subjects optimized to run in google collaboratory