Our VMAgent is a platform for exploiting Reinforcement Learning (RL) on Virtual Machine (VM) scheduling tasks.

Last update: Dec 12, 2022

Overview

VMAgent is a platform for exploiting Reinforcement Learning (RL) on Virtual Machine (VM) scheduling tasks. VMAgent is constructed based on one month real VM scheduling dataset called Huawei-East-1 from HUAWEI Cloud and it contains multiple practicle VM scheduling scenarios (such as Fading, Rcovering, etc). These scenarios also correspond to the challanges in the RL. Exploiting the design of RL methods in these secenarios help both the RL and VM scheduling communities. To emphasis, more details about VMAgent can be found in our paper VMAgent: Scheduling Simulator for Reinforcement Learning. Our another paper Learning to Schedule Multi-NUMA Virtual Machines via Reinforcement Learning has employed this VMAgent simultor to design RL-based VM scheduling algorithms.

Key Components of VMAgent:

SchedGym (Simulator): it provides many practical scenarios and flexible configurations to define custom scenarios.
SchedAgent (Algorithms): it provides many popular RL methods as the baselines.
SchedVis (Visulization): it provides the visualization of schedlueing dynamics on many metrics.

Scenarios and Baselines

The VMAgent provides multiple practical scenarios:

Scenario	Allow-Deletion	Allow-Expansion	Server Num
Fading	False	False	Small
Recovering	True	False	Small
Expanding	True	True	Small
Recovering-L	True	False	Large

Researchers can also flexibly customized their scenarios in the vmagent/config/ folder.

Besides, we provides many baselines for quick startups. It includes FirstFit, BestFit, DQN, PPO, A2C and SAC. More baselines is coming.

Installation

git clone [email protected]:mail-ecnu/VMAgent.git
cd VMAgent
conda env create -f conda_env.yml
conda activate VMAgent-dev
python3 setup.py develop

Quick Examples

In this quick example, we show how to train a dqn agent in a fading scenario. For more examples and the configurations' concrete definitions, we refer readers to our docs.

config/fading.yaml:

N: 5
cpu: 40 
mem: 90
allow_release: False

config/algs/dqn.yaml:

mac: 'vectormac'
learner: 'q_learner'
agent: 'DQNAgent'

Then

python train.py --env=fading --alg=dqn

It provides the first VM scheudling simulator based on the one month east china data in HUAWEI Cloud. It includes three scenarios in practical cloud: Recovering, Fading and Expansion. Our video is at video. Some demonstrations are listed:

Docs

For more information of our VMAgent, we refer the readers to the document. It describes the detail of SchedGym, SchedAgent and SchedVis.

Data

We collect one month scheduling data in east china region of huawei cloud. The format and the stastical analysis of the data are presented in the docs. one month east china data in huawei cloud.

Visualization

For visualization, see the schedvis directory in detail.

References

Junjie Sheng, Shengliang Cai, Haochuan Cui, Wenhao Li, Yun Hua, Bo Jin, Wenli Zhou, Yiqiu Hu, Lei Zhu, Qian Peng, Hongyuan Zha and Xiangfeng Wang, VMAgent: Scheduling Simulator for Reinforcement Learning. arXiv preprint arXiv:2112.04785, 2021.
Junjie Sheng, Yiqiu Hu, Wenli Zhou, Lei Zhu, Bo Jin, Jun Wang and Xiangfeng Wang, Learning to Schedule Multi-NUMA Virtual Machines via Reinforcement Learning, Pattern Recognition, 121, 2021, pp.108254.

License

Licensed under the MIT License.

Our VMAgent is a platform for exploiting Reinforcement Learning (RL) on Virtual Machine (VM) scheduling tasks.

Related tags

Overview

Scenarios and Baselines

Installation

Quick Examples

Docs

Data

Visualization

References

License

Owner

Gems & Holiday Package Prediction

Tensorflow implementation of "Learning Deep Features for Discriminative Localization"

PyTorch implementation of HDN(Homography Decomposition Networks) for planar object tracking

BC3407-Group-5-Project - BC3407 Group Project With Python

CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"

ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information

[ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)

Generic U-Net Tensorflow implementation for image segmentation

A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution

CONditionals for Ordinal Regression and classification in tensorflow

Automatic Video Captioning Evaluation Metric --- EMScore

This repository contains all the code and materials distributed in the 2021 Q-Programming Summer of Qode.

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Face Mask Detection System built with OpenCV, TensorFlow using Computer Vision concepts

Code for Ditto: Building Digital Twins of Articulated Objects from Interaction

Repo público onde postarei meus estudos de Python, buscando aprender por meio do compartilhamento do aprendizado!

Code release for "Transferable Semantic Augmentation for Domain Adaptation" (CVPR 2021)

Keepsake is a Python library that uploads files and metadata (like hyperparameters) to Amazon S3 or Google Cloud Storage

Attentive Implicit Representation Networks (AIR-Nets)