GitHub - s-gupta/fast-rcnn: Fast R-CNN (original) (raw)

Saurabh Gupta, Judy Hoffman, Jitendra Malik

This codebase allows use of RGB-D object detection models from this arXiv tech report.

License

This code base is built on Fast R-CNN. License for Fast R-CNN can be found in LICENSE_fast_rcnn.

Citing

If you find this code base and models useful in your research, please consider citing an appropriate sub-set of the following papers:

@article{gupta2015cross,
  title={Cross Modal Distillation for Supervision Transfer},
  author={Gupta, Saurabh and Hoffman, Judy and Malik, Jitendra},
  journal={arXiv preprint arXiv:1507.00448},
  year={2015}
}

@incollection{gupta2014learning,
  title={Learning rich features from RGB-D images for object detection and segmentation},
  author={Gupta, Saurabh and Girshick, Ross and Arbel{\'a}ez, Pablo and Malik, Jitendra},
  booktitle={Computer Vision--ECCV 2014},
  pages={345--360},
  year={2014},
  publisher={Springer}
}

@article{girshick15fastrcnn,
    Author = {Ross Girshick},
    Title = {Fast R-CNN},
    Journal = {arXiv preprint arXiv:1504.08083},
    Year = {2015}
}

Requirements: software
Requirements: hardware
Basic installation

Requirements: software

Requirements for Caffe and pycaffe (see: Caffe installation instructions)

Note: Caffe must be built with support for Python layers!

In your Makefile.config, make sure to have this line uncommented

WITH_PYTHON_LAYER := 1

Python packages you might not have: cython, python-opencv, easydict

Requirements: hardware

For training smaller networks (CaffeNet, VGG_CNN_M_1024) a good GPU (e.g., Titan, K20, K40, ...) with at least 3G of memory suffices
For training with VGG16, you'll need a K40 (~11G of memory)

Installation (sufficient for the demo)

Clone the repository

Clone the python code

git clone https://github.com/s-gupta/fast-rcnn.git

We'll call the directory that you cloned Fast R-CNN into FRCN_ROOT. Clone Caffe with roi_pooling_layers:
cd $FRCNN_ROOT
git clone https://github.com/rbgirshick/caffe-fast-rcnn.git caffe-fast-rcnn
cd caffe-fast-rcnn

caffe-fast-rcnn needs to be on the fast-rcnn branch (or equivalent detached state).

git checkout fast-rcnn 2. Build the Cython modules 3. Build Caffe and pycaffe
cd $FRCN_ROOT/caffe-fast-rcnn

Now follow the Caffe installation instructions here:

http://caffe.berkeleyvision.org/installation.html

If you're experienced with Caffe and have all of the requirements installed

and your Makefile.config in place, then simply do.

Make sure caffe is built with PYTHON layers.

make -j8 && make pycaffe

Download models and data

Download the NYUD2 data

cd $FRCN_ROOT ./data/scripts/fetch_nyud2_data.sh

Download the NYUD2 MCG boxes

cd $FRCN_ROOT ./data/scripts/fetch_nyud2_mcg_boxes.sh

Download the ImageNet and Supervision Transfer Models

cd $FRCN_ROOT ./data/scripts/fetch_init_models.sh

Fetch NYUD2 Object Detector Models.

cd $FRCN_ROOT ./output/scripts/fetch_nyud2_detectors.sh

Usage

Look at experiments/test_pretrained_models.sh and experiments/train_models.sh to use pretrained models and train your models yourself.