A TensorFlow implementation of FCN-8s

Last update: Aug 08, 2022

Overview

FCN-8s implementation in TensorFlow

Overview
Examples and demo video
Dependencies
How to use it
Download pre-trained VGG-16

Overview

This is a TensorFlow implementation of the FCN-8s model architecture for semantic image segmentation introduced by Shelhamer et al. in the paper Fully Convolutional Networks for Semantic Segmentation.

This repository only contains the 'all-at-once' version of the FCN-8s model, which converges significantly faster than the version trained in stages. A convolutionalized VGG-16 model trained on ImageNet classification is provided and serves as the encoder of the FCN-8s. Sufficient documentation and a tutorial on how to train, evaluate and use the model for prediction are also provided. Some useful TensorBoard summaries can be recorded out of the box.

Examples and demo video

Below are some prediction examples of the model trained on the Cityscapes dataset for 13,000 steps at batch size 16, at which point the model achieves a mean IoU of 38.2% on the validation dataset. This is far from convergence of course, the purpose of these examples is just to demonstrate that the code works and the model learns. You can watch the model in action on the Cityscapes demo videos here.

Dependencies

Python 3.x
TensorFlow 1.x
Numpy
Scipy
OpenCV (for data augmentation)
tqdm

How to use it

fcn8s_tutorial.ipynb explains how to train and evaluate the model and how to make and visualize predictions.

Download pre-trained VGG-16

You can download the pre-trained, convolutionalized VGG-16 model here

A TensorFlow implementation of FCN-8s

Related tags

Overview

FCN-8s implementation in TensorFlow

Contents

Overview

Examples and demo video

Dependencies

How to use it

Download pre-trained VGG-16

Owner

Pierluigi Ferrari

TCPNet - Temporal-attentive-Covariance-Pooling-Networks-for-Video-Recognition

EfficientMPC - Efficient Model Predictive Control Implementation

BasicNeuralNetwork - This project looks over the basic structure of a neural network and how machine learning training algorithms work

PyTorch implementation of "Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning"

Code repository for the work "Multi-Domain Incremental Learning for Semantic Segmentation", accepted at WACV 2022

Graph WaveNet apdapted for brain connectivity analysis.

This repository contains tutorials for the py4DSTEM Python package

Pytorch implementation of MaskFlownet

A stock generator that assess a list of stocks and returns the best stocks for investing and money allocations based on users choices of volatility, duration and number of stocks

Generate images from texts. In Russian. In PaddlePaddle

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)

Crosslingual Segmental Language Model

Making self-supervised learning work on molecules by using their 3D geometry to pre-train GNNs. Implemented in DGL and Pytorch Geometric.

The official implementation code of "PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction."

Predicting 10 different clothing types using Xception pre-trained model.

Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom

Cweqgen - The CW Equation Generator

PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models

AI drive app that can help user become beautiful.

Code for Understanding Pooling in Graph Neural Networks