Application of the L2HMC algorithm to simulations in lattice QCD.

Last update: Dec 14, 2022

Overview

l2hmc-qcd

📊 Slides

Recent talk on Training Topological Samplers for Lattice Gauge Theory from the Machine Learning for High Energy Physics, on and off the Lattice @ ect* Trento (09/30/2021)

📒 Example Notebook

Accepted to the Deep Learning for Simulation (SimDL) Workshop at ICLR 2021
- 📚 : arXiv:2105.03418
- 📊 : poster

Overview

The L2HMC algorithm aims to improve upon HMC by optimizing a carefully chosen loss function which is designed to minimize autocorrelations within the Markov Chain, thereby improving the efficiency of the sampler.

This work is based on the original implementation: brain-research/l2hmc/.

A detailed description of the L2HMC algorithm can be found in the paper:

Generalizing Hamiltonian Monte Carlo with Neural Network

by Daniel Levy, Matt D. Hoffman and Jascha Sohl-Dickstein.

Broadly, given an analytically described target distribution, π(x), L2HMC provides a statistically exact sampler that:

Quickly converges to the target distribution (fast burn-in).
Quickly produces uncorrelated samples (fast mixing).
Is able to efficiently mix between energy levels.
Is capable of traversing low-density zones to mix between modes (often difficult for generic HMC).

L2HMC for LatticeQCD

Goal: Use L2HMC to efficiently generate gauge configurations for calculating observables in lattice QCD.

A detailed description of the (ongoing) work to apply this algorithm to simulations in lattice QCD (specifically, a 2D U(1) lattice gauge theory model) can be found in doc/main.pdf.

Organization

Dynamics / Network

The base class for the augmented L2HMC leapfrog integrator is implemented in the BaseDynamics (a tf.keras.Model object).

The GaugeDynamics is a subclass of BaseDynamics containing modifications for the 2D U(1) pure gauge theory.

The network is defined in l2hmc-qcd/network/functional_net.py.

Network Architecture

An illustration of the leapfrog layer updating (x, v) --> (x', v') can be seen below.

Lattice

Lattice code can be found in lattice.py, specifically the GaugeLattice object that provides the base structure on which our target distribution exists.

Additionally, the GaugeLattice object implements a variety of methods for calculating physical observables such as the average plaquette, ɸₚ, and the topological charge Q,

Training

The training loop is implemented in l2hmc-qcd/utils/training_utils.py .

To train the sampler on a 2D U(1) gauge model using the parameters specified in bin/train_configs.json:

$ python3 /path/to/l2hmc-qcd/l2hmc-qcd/train.py --json_file=/path/to/l2hmc-qcd/bin/train_configs.json

Or via the bin/train.sh script provided in bin/.

Features

Distributed training (via horovod): If horovod is installed, the model can be trained across multiple GPUs (or CPUs) by:

#!/bin/bash

TRAINER=/path/to/l2hmc-qcd/l2hmc-qcd/train.py
JSON_FILE=/path/to/l2hmc-qcd/bin/train_configs.json

horovodrun -np ${PROCS} python3 ${TRAINER} --json_file=${JSON_FILE}

Contact

Code author: Sam Foreman

Pull requests and issues should be directed to: saforem2

Citation

If you use this code or found this work interesting, please cite our work along with the original paper:

@misc{foreman2021deep,
      title={Deep Learning Hamiltonian Monte Carlo}, 
      author={Sam Foreman and Xiao-Yong Jin and James C. Osborn},
      year={2021},
      eprint={2105.03418},
      archivePrefix={arXiv},
      primaryClass={hep-lat}
}

@article{levy2017generalizing,
  title={Generalizing Hamiltonian Monte Carlo with Neural Networks},
  author={Levy, Daniel and Hoffman, Matthew D. and Sohl-Dickstein, Jascha},
  journal={arXiv preprint arXiv:1711.09268},
  year={2017}
}

Acknowledgement

This research used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under contract DE_AC02-06CH11357. This work describes objective technical results and analysis. Any subjective views or opinions that might be expressed in the work do not necessarily represent the views of the U.S. DOE or the United States Government. Declaration of Interests - None.

Comments

Remove upper bound on python_requires

(I'm moving between meetings so can iterate on this more later, so excuse the very brief Issue for now).

At the moment the project has an upper bound on python_requires

https://github.com/saforem2/l2hmc-qcd/blob/2eb6ee63cc0c53b187e6d716f4c12f418c8b8515/setup.py#L165

Assuming that you're intending l2hmc to be a library and not an application, then I would highly recommend removing this for the reasons summarized in Henry's detailed blog post on the subject.

Congrats on getting l2hmc up on PyPI though! :snake: :rocket:

opened by matthewfeickert 2
Alpha
Pull upstream alpha branch into main

Major changes

new src/ hierarchical module organization

Contains skeleton implementation of 4D SU(3) lattice gauge model

src/l2hmc/lattice/gauge/lattice.py

Framework independent configuration

Unified configuration system simplifies logic, same configs used for both tensorflow and pytorch experiments

Plan to be able to specify which backend to use through config option

Unified (and framework independent) configurations between tensorflow and pytorch implementations

Definitions can be found in l2hmc-qcd/src/l2hmc/configs.py

Note: This is still very much a WIP. Many existing features still need to be re-implemented / updated into new code in src/.

Todo

[ ] Write unit tests

[ ] Use simple configs for end-to-end workflow test + integrate into CI

[ ] dynamic learning rate scheduling

[ ] Test 4D SU(3) numpy code

[ ] Write tensorflow and pytorch implementations of LatticeSU3 objects

[ ] Improved / simplified ( / trainable?) annealing schedule

[ ] Distributed training support

[ ] horovod

[ ] DDP for pytorch implementation

[ ] DeepSpeed from Microsoft??

[ ] Testing / inference logic

[ ] Automatic checkpointing

[ ] Metric logging

[ ] Tensorboard?

[ ] Sacred?

[ ] build custom dashboard? plot.ly?

[ ] Setup packaging / distribution through pip

[ ] Resolve issue
opened by saforem2 1
Alpha
Major upgrades to how training is initialized in l2hmc-qcd/utils/training_utils.py, particularly when trying to restore a model from an existing checkpoint.

Significant upgrades to logging mechanics in l2hmc-qcd/utils/logger.py and l2hmc-qcd/utils/logger_config.py which now use a RichHandler to nicely format log messages characterized by severity, including automatic file rotation, etc.

Improvements to test suite in l2hmc-qcd/tests/test_training.py, more robust tests on larger set of possible cases

TODO: Automate using github actions for CI

Improvements to l2hmc-qcd/dynamics/gauge_dynamics.py but still a WIP
opened by saforem2 1
Rich
General improvements, rewrote logging methods to use Rich for better formatting.

Adds dynamic (trainable) step size eps for each separate x and v updates, seems to generally increase the total energy towards the middle of the trajectory but it remains unclear if this corresponds to an improvement in the tunneling rate

Adds methods for calculating autocorrelations of the topological charge, as well as notebooks for generating the plots

Updates to the writeup in doc/main.pdf

Will likely be last changes to writeup before public release of official draft
opened by saforem2 1
Dev
Updates to README

Ability to load network with new training instance

Updates to doc/, removes old sections related to debugging the bias in the plaquette
opened by saforem2 1
Saveable model
Complete rewrite of dynamics.xnet and dynamics.vnet models to use tf.keras.functional Models.

Additional changes include:

Non-Compact Projection update for gauge fields

Ability to specify convolution structure to be prepended at beginning of gauge network
opened by saforem2 1
Dev

Removes models/gauge_model.py entirely.

Instead, a base dynamics class is implemented in dynamics/dynamics.py, and an example subclass is provided in dynamics/gauge_dynamics.py.

opened by saforem2 1
Split networks

Major rewrite of existing codebase.

This pull request updates everything to be compatible with tensorflow >= 2.2 and removes a bunch of redundant legacy code.

opened by saforem2 1
Dev
Dynamics object is now compatible with tf >= 2.0

Running inference on trained model with tensorflow now creates identical graphs and summary files to numpy inference code

Inference with numpy now uses object oriented structure

Adds LaTeX + PDF documentation in doc/
opened by saforem2 1
Cooley dev

Adds new GaugeNetwork architecture as the default for training GaugeModel

Additionally, replaces pickle with joblib for saving data as .z compressed files (as opposed to .pkl files).

opened by saforem2 1
Testing

Implemented nnehmc_loss calculation for an alternative loss function using the approach suggested in https://infoscience.epfl.ch/record/264887/files/robust_parameter_estimation.pdf.

This modified loss function can be chosen (instead of the standard loss described in the original paper) by passing --use_nnehmc_loss as a command line argument.

opened by saforem2 1

Packaging and PyPI distribution?

As you've made a library and are using it as such:

# snippet from toy_distributions.ipynb

# append parent directory to `sys.path`
# to load from modules in `../l2hmc-qcd/`
module_path = os.path.join('..')
if module_path not in sys.path:
    sys.path.append(module_path)

# Local imports
from utils.attr_dict import AttrDict
from utils.training_utils import train_dynamics
from dynamics.config import DynamicsConfig
from dynamics.base_dynamics import BaseDynamics
from dynamics.generic_dynamics import GenericDynamics
from network.config import LearningRateConfig
from config import (State, NetWeights, MonteCarloStates,
                    BASE_DIR, BIN_DIR, TF_FLOAT)

from utils.distributions import (plot_samples2D, contour_potential,
                                 two_moons_potential, sin_potential,
                                 sin_potential1, sin_potential2)

do you have any plans and/or interest in packaging it as a Python library so it can either be pip installed from GitHub or be distributed on PyPI?

opened by matthewfeickert 5

Releases(0.12.0)

0.12.0(Aug 9, 2022)

Source code(tar.gz)
Source code(zip)
0.8.0(Apr 14, 2022)

Full Changelog: https://github.com/saforem2/l2hmc-qcd/compare/0.7.0...0.8.0
Source code(tar.gz)
Source code(zip)
0.7.0(Apr 14, 2022)

pypi release: v0.7.0

Full Changelog: https://github.com/saforem2/l2hmc-qcd/compare/0.4.0...0.7.0
Source code(tar.gz)
Source code(zip)
0.4.0(Apr 8, 2022)

Full Changelog: https://github.com/saforem2/l2hmc-qcd/compare/0.3.0...0.4.0
Source code(tar.gz)
Source code(zip)

Owner

Sam Foreman

Computational science Postdoc at Argonne National Laboratory working on applying machine learning to simulations in lattice QCD.

GitHub Repository https://samforeman.me/l2hmc-qcd

A simple log parser and summariser for IIS web server logs

IISLogFileParser A basic parser tool for IIS Logs which summarises findings from the log file. Inspired by the Gist https://gist.github.com/wh13371/e7

2 Mar 26, 2022

Read and write layered TIFF ImageSourceData and ImageResources tags

Read and write layered TIFF ImageSourceData and ImageResources tags Psdtags is a Python library to read and write the Adobe Photoshop(r) specific Imag

4 Feb 05, 2022

ReferFormer - Official Implementation of ReferFormer

The official implementation of the paper: Language as Queries for Referring Vide

232 Dec 29, 2022

Official repository for the NeurIPS 2021 paper Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided curriculum Learning Approach

Get Fooled for the Right Reason Official repository for the NeurIPS 2021 paper Get Fooled for the Right Reason: Improving Adversarial Robustness throu

1 Apr 25, 2022

A benchmark dataset for mesh multi-label-classification based on cube engravings introduced in MeshCNN

Double Cube Engravings This script creates a dataset for multi-label mesh clasification, with an intentionally difficult setup for point cloud classif

1 Nov 30, 2021

The pytorch implementation of SOKD (BMVC2021).

Semi-Online Knowledge Distillation Implementations of SOKD. Requirements This repo was tested with Python 3.8, PyTorch 1.5.1, torchvision 0.6.1, CUDA

4 Dec 19, 2021

Pre-trained NFNets with 99% of the accuracy of the official paper

NFNet Pytorch Implementation This repo contains pretrained NFNet models F0-F6 with high ImageNet accuracy from the paper High-Performance Large-Scale

133 Dec 09, 2022

Neural-Pull: Learning Signed Distance Functions from Point Clouds by Learning to Pull Space onto Surfaces(ICML 2021)

Neural-Pull: Learning Signed Distance Functions from Point Clouds by Learning to Pull Space onto Surfaces(ICML 2021) This repository contains the code

149 Dec 15, 2022

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.

Geometry Processing with Neural Fields Pytorch implementation for the NeurIPS 2021 paper: Geometry Processing with Neural Fields Guandao Yang, Serge B

162 Dec 16, 2022

Code for paper: Towards Tokenized Human Dynamics Representation

Video Tokneization Codebase for video tokenization, based on our paper Towards Tokenized Human Dynamics Representation. Prerequisites (tested under Py

20 May 31, 2022

Vision Transformer for 3D medical image registration (Pytorch).

ViT-V-Net: Vision Transformer for Volumetric Medical Image Registration keywords: vision transformer, convolutional neural networks, image registratio

192 Dec 20, 2022

This is a repository for a Semantic Segmentation inference API using the Gluoncv CV toolkit

BMW Semantic Segmentation GPU/CPU Inference API This is a repository for a Semantic Segmentation inference API using the Gluoncv CV toolkit. The train

56 Nov 24, 2022

Here I will explain the flow to deploy your custom deep learning models on Ultra96V2.

Xilinx_Vitis_AI This repo will help you to Deploy your Deep Learning Model on Ultra96v2 Board. Prerequisites Vitis Core Development Kit 2019.2 This co

1 Feb 08, 2022

Implementation of Nalbach et al. 2017 paper.

Deep Shading Convolutional Neural Networks for Screen-Space Shading Our project is based on Nalbach et al. 2017 paper. In this project, a set of buffe

17 Sep 08, 2022

Hierarchical Clustering: O(1)-Approximation for Well-Clustered Graphs

Hierarchical Clustering: O(1)-Approximation for Well-Clustered Graphs This repository contains code to accompany the paper "Hierarchical Clustering: O

3 Sep 25, 2022

This project uses ViT to perform image classification tasks on DATA set CIFAR10.

Vision-Transformer-Multiprocess-DistributedDataParallel-Apex Introduction This project uses ViT to perform image classification tasks on DATA set CIFA

3 Jun 03, 2022

Model Zoo for AI Model Efficiency Toolkit

We provide a collection of popular neural network models and compare their floating point and quantized performance.

137 Jan 03, 2023

Contains modeling practice materials and homework for the Computational Neuroscience course at Okinawa Institute of Science and Technology

A310 Computational Neuroscience - Okinawa Institute of Science and Technology, 2022 This repository contains modeling practice materials and homework

1 Jan 24, 2022

Self-Learning - Books Papers, Courses & more I have to learn soon

968 Jan 02, 2022

Code for "On Memorization in Probabilistic Deep Generative Models"

On Memorization in Probabilistic Deep Generative Models This repository contains the code necessary to reproduce the experiments in On Memorization in

3 Jun 09, 2022

Application of the L2HMC algorithm to simulations in lattice QCD.

Related tags

Overview

l2hmc-qcd

📊 Slides

📒 Example Notebook

Overview

L2HMC for LatticeQCD

Organization

Dynamics / Network

Network Architecture

Lattice

Training

Features

Contact

Citation

Acknowledgement

Comments

Major changes

Todo

Releases(0.12.0)

0.12.0(Aug 9, 2022)

0.8.0(Apr 14, 2022)

0.7.0(Apr 14, 2022)

0.4.0(Apr 8, 2022)

Owner

Sam Foreman

A simple log parser and summariser for IIS web server logs

Read and write layered TIFF ImageSourceData and ImageResources tags

ReferFormer - Official Implementation of ReferFormer

Official repository for the NeurIPS 2021 paper Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided curriculum Learning Approach

A benchmark dataset for mesh multi-label-classification based on cube engravings introduced in MeshCNN

The pytorch implementation of SOKD (BMVC2021).

Pre-trained NFNets with 99% of the accuracy of the official paper

Neural-Pull: Learning Signed Distance Functions from Point Clouds by Learning to Pull Space onto Surfaces(ICML 2021)

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.

Code for paper: Towards Tokenized Human Dynamics Representation

Vision Transformer for 3D medical image registration (Pytorch).

This is a repository for a Semantic Segmentation inference API using the Gluoncv CV toolkit

Here I will explain the flow to deploy your custom deep learning models on Ultra96V2.

Implementation of Nalbach et al. 2017 paper.

Hierarchical Clustering: O(1)-Approximation for Well-Clustered Graphs

This project uses ViT to perform image classification tasks on DATA set CIFAR10.

Model Zoo for AI Model Efficiency Toolkit

Contains modeling practice materials and homework for the Computational Neuroscience course at Okinawa Institute of Science and Technology

Self-Learning - Books Papers, Courses & more I have to learn soon

Code for "On Memorization in Probabilistic Deep Generative Models"