Segmentation vgg16 fcn - cityscapes

Overview

VGGSegmentation

Segmentation vgg16 fcn - cityscapes Priprema skupa

skripta prepare_dataset_downsampled.py

Iz slika cityscapesa izrezuje haubu automobila, i smanjuje sliku na željenu rezoluciju, to zapisuje u tfrecords formatu. Treba zadati putanju do cityscapesa, izlazni direktorij gdje će se spremati tfrecordsi i zadati željenu rezoluciju.

Priprema težina vgg-a

Da bi se model mogao fine-tuneati treba na disku imati spremljene težine mreže (prethodno naučene na nekom drugom skupu). One se mogu skinuti s interneta u raznim formatima.

Ja sam ih imala spremljene u sljedećim datotekama: conv1_1_biases.bin conv1_1_weights.bin conv1_2_biases.bin conv1_2_weights.bin conv2_1_biases.bin conv2_1_weights.bin conv2_2_biases.bin conv2_2_weights.bin conv3_1_biases.bin conv3_1_weights.bin conv3_2_biases.bin conv3_2_weights.bin conv3_3_biases.bin conv3_3_weights.bin conv4_1_biases.bin conv4_1_weights.bin conv4_2_biases.bin conv4_2_weights.bin conv4_3_biases.bin conv4_3_weights.bin conv5_1_biases.bin conv5_1_weights.bin conv5_2_biases.bin conv5_2_weights.bin conv5_3_biases.bin conv5_3_weights.bin fc6_biases.bin fc6_weights.bin fc7_biases.bin fc7_weights.bin fc8_biases.bin fc8_weights.bin

Ako će se težine učitavati iz ckpt. datoteke npr vgg_16.ckpt, onda će i u kodu trebati mjenjati metodu create_init_op unutar model.py

Konfiguracija

config/cityscapes.py - primjer fajla s konfiguracijom za treniranje

Treba promjeniti putanje

model_path da pokazuje do py fajla s definicijom modela (primjer za takve dvije defincije su model.py i model2.py)

dataset_dir - da pokazuje do foldera s prethodno pripremljenim tfrecordsima (koji sadrzi subdirektorije train i val)

treba paziti pri razlicitim rezolucijama da se promjene zastavice img_width i height

ostale zastavice se većinom odnose na treniranje modela to mjenjati prema potrebi.

subsample_factor zastavica bi označavala faktor za koji se rezolucija mape smanji na kraju mreže. Taj faktor će ovisiti o samome modelu koji se trenira, ako model ima tri pooling sloja 2*2 svaki taj sloj će sliku smanjiti za dva puta pa će ukupno smanjnjenje biti za faktor osam

train.py - skripta koja pokreće skriptu treniranja, nakon svake epohe model se evaluira na skupu za validaciju.

efficient neural audio synthesis in the waveform domain

neural waveshaping synthesis real-time neural audio synthesis in the waveform domain paper • website • colab • audio by Ben Hayes, Charalampos Saitis,

Ben Hayes 169 Dec 23, 2022
In Search of Probeable Generalization Measures

In Search of Probeable Generalization Measures Exciting News! In Search of Probeable Generalization Measures has been accepted to the International Co

Mahdi S. Hosseini 6 Sep 11, 2022
Safe Local Motion Planning with Self-Supervised Freespace Forecasting, CVPR 2021

Safe Local Motion Planning with Self-Supervised Freespace Forecasting By Peiyun Hu, Aaron Huang, John Dolan, David Held, and Deva Ramanan Citing us Yo

Peiyun Hu 90 Dec 01, 2022
Unofficial & improved implementation of NeRF--: Neural Radiance Fields Without Known Camera Parameters

[Unofficial code-base] NeRF--: Neural Radiance Fields Without Known Camera Parameters [ Project | Paper | Official code base ] ⬅️ Thanks the original

Jianfei Guo 239 Dec 22, 2022
Open source person re-identification library in python

Open-ReID Open-ReID is a lightweight library of person re-identification for research purpose. It aims to provide a uniform interface for different da

Tong Xiao 1.3k Jan 01, 2023
LETR: Line Segment Detection Using Transformers without Edges

LETR: Line Segment Detection Using Transformers without Edges Introduction This repository contains the official code and pretrained models for Line S

mlpc-ucsd 157 Jan 06, 2023
Agent-based model simulator for air quality and pandemic risk assessment in architectural spaces

Agent-based model simulation for air quality and pandemic risk assessment in architectural spaces. User Guide archABM is a fast and open source agent-

Vicomtech 10 Dec 05, 2022
CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images

CurriculumNet Introduction This repo contains related code and models from the ECCV 2018 CurriculumNet paper. CurriculumNet is a new training strategy

156 Jul 04, 2022
GPOEO is a micro-intrusive GPU online energy optimization framework for iterative applications

GPOEO GPOEO is a micro-intrusive GPU online energy optimization framework for iterative applications. We also implement ODPP [1] as a comparison. [1]

瑞雪轻飏 8 Sep 10, 2022
Example how to deploy deep learning model with aiohttp.

aiohttp-demos Demos for aiohttp project. Contents Imagetagger Deep Learning Image Classifier URL shortener Toxic Comments Classifier Moderator Slack B

aio-libs 661 Jan 04, 2023
Clean and readable code for Decision Transformer: Reinforcement Learning via Sequence Modeling

Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

Nikhil Barhate 104 Jan 06, 2023
Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"

Zero-shot-Fact-Verification-by-Claim-Generation This repository contains code and models for the paper: Zero-shot Fact Verification by Claim Generatio

Liangming Pan 47 Jan 01, 2023
Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems

Emotional-Support-Conversation Copyright © 2021 CoAI Group, Tsinghua University. All rights reserved. Data and codes are for academic research use onl

126 Dec 21, 2022
Rethinking Transformer-based Set Prediction for Object Detection

Rethinking Transformer-based Set Prediction for Object Detection Here are the code for the ICCV paper. The code is adapted from Detectron2 and AdelaiD

Zhiqing Sun 62 Dec 03, 2022
Image Segmentation and Object Detection in Pytorch

Image Segmentation and Object Detection in Pytorch Pytorch-Segmentation-Detection is a library for image segmentation and object detection with report

Daniil Pakhomov 732 Dec 10, 2022
AttGAN: Facial Attribute Editing by Only Changing What You Want (IEEE TIP 2019)

News 11 Jan 2020: We clean up the code to make it more readable! The old version is here: v1. AttGAN TIP Nov. 2019, arXiv Nov. 2017 TensorFlow impleme

Zhenliang He 568 Dec 14, 2022
Traductor de lengua de señas al español basado en Python con Opencv y MedaiPipe

Traductor de señas Traductor de lengua de señas al español basado en Python con Opencv y MedaiPipe Requerimientos 🔧 Python 3.8 o inferior para evitar

Jahaziel Hernandez Hoyos 3 Nov 12, 2022
An 16kHz implementation of HiFi-GAN for soft-vc.

HiFi-GAN An 16kHz implementation of HiFi-GAN for soft-vc. Relevant links: Official HiFi-GAN repo HiFi-GAN paper Soft-VC repo Soft-VC paper Example Usa

Benjamin van Niekerk 42 Dec 27, 2022
Pgn2tex - Scripts to convert pgn files to latex document. Useful to build books or pdf from pgn studies

Pgn2Latex (WIP) A simple script to make pdf from pgn files and studies. It's sti

12 Jul 23, 2022
Mask-invariant Face Recognition through Template-level Knowledge Distillation

Mask-invariant Face Recognition through Template-level Knowledge Distillation This is the official repository of "Mask-invariant Face Recognition thro

Fadi Boutros 35 Dec 06, 2022