A PyTorch Implementation of Single Shot Scale-invariant Face Detector.

Last update: Jan 07, 2023

Related tags

Overview

S³FD: Single Shot Scale-invariant Face Detector

A PyTorch Implementation of Single Shot Scale-invariant Face Detector.

Eval

python wider_eval_pytorch.py

cd eval/eval_tools_old-version
octave wider_eval_pytorch.m

Model

s3fd_convert.7z

Test

python test.py --model data/s3fd_convert.pth --path data/test01.jpg

References

SFD

Comments

RGB <-> BGR

From this line, I assume you use RGB: img = img - np.array([104,117,123])

However opencv uses BGR, so this line returns BGR: if args.path=='CAMERA': ret, img = cap.read()

Then BGR is fed to the network bboxlist = detect(net,img)

I fed RGB to the network and got worse results. Is it possible that you meant RGB in all places but the network is actually trained for BGR? (If then it should be img = img - np.array([123,117,104]))

opened by elbaro 3
How Convert Weights

Dear @clcarwin, Thank you for your nice work. Would you please tell me how you can convert Caffe weights and model of S3FD into PyTorch? Can you convert the model & pre-trained weights of RefineDet into PyTorch?

opened by ahkarami 2
evaluation accuracy is not good as the original paper

hi @clcarwin,

I test you evaluation results on wider face as (easy 92.8, medium 91.5, hard 84.2). But with the original model provided by sfzhang15/SFD, I can get (easy 93.8, medium 92.4, hard 85.1).

Did I test correctly? If so, why there is accuracy loss?

Great work! Best,

opened by marvis 2
'float' object cannot be interpreted as an integer??

Sir,I'm sorry to disturb you about this object. I run this object on windows 10,python 3.5.2 ,pytorch 0.3. After : python test.py --model data/s3fd_convert.pth --path data/test01.jpg, the screen display: D:\Python\Pytorch_cw_sfd\SFD_pytorch>python test.py --model data/s3fd_convert.pth --path data/test01.jpg Traceback (most recent call last): File "test.py", line 71, in bboxlist = detect(net,img) File "test.py", line 27, in detect for i in range(len(olist)/2): olist[i2] = F.softmax(olist[i2]) TypeError: 'float' object cannot be interpreted as an integer

Why ???

opened by door5719 1
padding size of fc6

Hi @clcarwin,

Why do you set the padding size of fc6 to 3? This is inconsistent with the original paper. See https://github.com/clcarwin/SFD_pytorch/blob/master/net_s3fd.py#L42

Best,

opened by marvis 1
Optimization

Good: It is accurate.

Bad: The inference time is more than 80 ms for realtime usage. To make it work for realtime image has to be resized to less than 200x200 which reduces accuracy.

So in order to make it usable the only way is to make it faster. Have you tried using TensorRT or TVM or Pytorch serving in C++ ?

opened by jamessmith90 0
Several speed & code updates

Seems nobody's looking at PR's here, but letting others know I've made a number of improvements.

It runs smoothly on modern pytorch (1.3) and refactored the code to eliminate redundant code. I also added some convenient methods that make it easier to do common things, like detect_faces. Also, added integration tests.

I independently found the same speed-up as @kir-dan in https://github.com/clcarwin/SFD_pytorch/pull/4 and moved all that code into pytorch instead of numpy, so it can be fully run on GPU.

opened by leopd 0
Very high GPU memory usage

Hi, I have been running the model using test.py and modified it run multiple files. The GPU memory keeps on increasing,from 3gigs to 9 gigs. Is this due to poor garbage collection?

opened by vaishnavm217 2
Change Anchor Boxes Aspect Ratio

Dear @clcarwin, If one wants to change the aspect ratio of anchor boxes, must just changed the detect method in test.py? For example, line https://github.com/clcarwin/SFD_pytorch/blob/96fdfbe22eef176a04802d915834b82a131a854d/test.py#L39 or other methods moreover must changed?

opened by ahkarami 0
About data augmentation

When I use the Tensorflow to build the project, I have some trouble in data augmentation which describe in the paper. Can you tell the details of the data augmentation or show your data augmentation code to me. Thank you

opened by ckqsars 0

Releases(v0.1)

v0.1(Nov 21, 2017)

Source code(tar.gz)
Source code(zip)
s3fd_convert.7z(8.14 MB)

Owner

carwin

GitHub Repository

Automated Attendance Project Using Face Recognition

dependencies for project: cmake 3.22.1 dlib 19.22.1 face-recognition 1.3.0 openc

1 Jan 09, 2022

Swin-Transformer is basically a hierarchical Transformer whose representation is computed with shifted windows.

Swin-Transformer Swin-Transformer is basically a hierarchical Transformer whose representation is computed with shifted windows. For more details, ple

9 Mar 14, 2022

zeus is a Python implementation of the Ensemble Slice Sampling method.

zeus is a Python implementation of the Ensemble Slice Sampling method. Fast & Robust Bayesian Inference, Efficient Markov Chain Monte Carlo (MCMC), Bl

197 Dec 04, 2022

Gauge equivariant mesh cnn

Geometric Mesh CNN The code in this repository is an implementation of the Gauge Equivariant Mesh CNN introduced in the paper Gauge Equivariant Mesh C

50 Dec 18, 2022

Run Effective Large Batch Contrastive Learning on Limited Memory GPU

Gradient Cache Gradient Cache is a simple technique for unlimitedly scaling contrastive learning batch far beyond GPU memory constraint. This means tr

198 Dec 29, 2022

Syntax-Aware Action Targeting for Video Captioning

Syntax-Aware Action Targeting for Video Captioning Code for SAAT from "Syntax-Aware Action Targeting for Video Captioning" (Accepted to CVPR 2020). Th

59 Oct 13, 2022

Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

26 Dec 07, 2022

An example showing how to use jax to train resnet50 on multi-node multi-GPU

jax-multi-gpu-resnet50-example This repo shows how to use jax for multi-node multi-GPU training. The example is adapted from the resnet50 example in d

20 Jul 04, 2022

StyleTransfer - Open source style transfer project, based on VGG19

9 Dec 13, 2021

TAPEX: Table Pre-training via Learning a Neural SQL Executor

TAPEX: Table Pre-training via Learning a Neural SQL Executor The official repository which contains the code and pre-trained models for our paper TAPE

157 Dec 28, 2022

Code and results accompanying our paper titled Mixture Proportion Estimation and PU Learning: A Modern Approach at Neurips 2021 (Spotlight)

Mixture Proportion Estimation and PU Learning: A Modern Approach This repository is the official implementation of Mixture Proportion Estimation and P

23 Dec 28, 2022

A PyTorch Implementation of Single Shot Scale-invariant Face Detector.

Related tags

Overview

S³FD: Single Shot Scale-invariant Face Detector

Eval

Model

Test

References

Comments

Releases(v0.1)

v0.1(Nov 21, 2017)

Owner

carwin

Automated Attendance Project Using Face Recognition

Swin-Transformer is basically a hierarchical Transformer whose representation is computed with shifted windows.

zeus is a Python implementation of the Ensemble Slice Sampling method.

Gauge equivariant mesh cnn

Run Effective Large Batch Contrastive Learning on Limited Memory GPU

Syntax-Aware Action Targeting for Video Captioning

Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

An example showing how to use jax to train resnet50 on multi-node multi-GPU

StyleTransfer - Open source style transfer project, based on VGG19

TAPEX: Table Pre-training via Learning a Neural SQL Executor

Code and results accompanying our paper titled Mixture Proportion Estimation and PU Learning: A Modern Approach at Neurips 2021 (Spotlight)

Learning from graph data using Keras

Public repo for the ICCV2021-CVAMD paper "Is it Time to Replace CNNs with Transformers for Medical Images?"

Cognate Detection Repository

Bolt Online Learning Toolbox

Posterior predictive distributions quantify uncertainties ignored by point estimates.

DECAF: Deep Extreme Classification with Label Features

Exadel CompreFace is a free and open-source face recognition GitHub project

Generative Handwriting using LSTM Mixture Density Network with TensorFlow

JDet is Object Detection Framework based on Jittor.