Using fully convolutional networks for semantic segmentation with caffe for the cityscapes dataset

Last update: Jun 06, 2022

Overview

Using fully convolutional networks for semantic segmentation (Shelhamer et al.) with caffe for the cityscapes dataset

How to get started

Download the cityscapes dataset and the vgg-16-layer net
Modify the images in the dataset with cut_images.py or downscale_images.py for less resource demanding training and evaluation
Create the 32 pixel stride net with net_32.py
Modify the paths in train.txt and val.txt (first line: path to training/validation images, second line: path to annotations)
Start training with solve_start.py
Run evaluate_models.py to evaluate your model or create_eval_images.py to create images with pixel label ids

Sources

Fully Convolutional Models for Semantic Segmentation:

Shelhamer, Evan, Jonathon Long, and Trevor Darrell. "Fully Convolutional Networks for Semantic Segmentation." PAMI, 2016, URL http://fcn.berkeleyvision.org

Cityscapes Dataset (Semantic Understanding of Urban Street Scenes):

Cordts, Marius, et al. "The cityscapes dataset." CVPR Workshop on The Future of Datasets in Vision. 2015, URL https://www.cityscapes-dataset.com

Caffe Deep Learning Framework:

Jia, Yangqing, et al. "Caffe: Convolutional architecture for fast feature embedding." Proceedings of the 22nd ACM international conference on Multimedia. ACM, 2014, URL http://caffe.berkeleyvision.org

Using fully convolutional networks for semantic segmentation with caffe for the cityscapes dataset

Related tags

Overview

How to get started

Sources

Fully Convolutional Models for Semantic Segmentation:

Cityscapes Dataset (Semantic Understanding of Urban Street Scenes):

Caffe Deep Learning Framework:

Owner

Simon Guist

Omnidirectional Scene Text Detection with Sequential-free Box Discretization (IJCAI 2019). Including competition model, online demo, etc.

Puzzle-CAM: Improved localization via matching partial and full features.

MRI reconstruction (e.g., QSM) using deep learning methods

Linear image-to-image translation

Code for Neurips2021 Paper "Topology-Imbalance Learning for Semi-Supervised Node Classification".

Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision (ICCV 2021)

SMCA replication There are no extra compiled components in SMCA DETR and package dependencies are minimal

Yet Another Reinforcement Learning Tutorial

Code for "Retrieving Black-box Optimal Images from External Databases" (WSDM 2022)

Container : Context Aggregation Network

Pre-trained NFNets with 99% of the accuracy of the official paper

This repository contains the official implementation code of the paper Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis

This porject is intented to build the most accurate model for predicting the porbability of loan default

An implementation of paper `Real-time Convolutional Neural Networks for Emotion and Gender Classification` with PaddlePaddle.

E2C implementation in PyTorch

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Bayesian Generative Adversarial Networks in Tensorflow

Spatial Single-Cell Analysis Toolkit

I have created this Virtual Paint Program, in this you can paint(draw) on your screen using hand gestures, created in Python-3 using OpenCV and Mediapipe library. Gestures :- Index Finger for drawing and Index+Middle Finger for changing position and objects.

Multi-Task Temporal Shift Attention Networks for On-Device Contactless Vitals Measurement (NeurIPS 2020)