GitHub - microsoft/X-Decoder at seginw (original) (raw)

pip3 install torch==1.13.1 torchvision==0.14.1 --extra-index-url https://download.pytorch.org/whl/cu113 python -m pip install 'git+https://github.com/MaureenZOU/detectron2-xyz.git' pip install git+https://github.com/cocodataset/panopticapi.git python -m pip install -r requirements.txt

sometime this script will report an error when download the zip, please simply copy the command (This is cuased by bad character embedding)

sh install_cococapeval.sh

.
└── seginw/
    ├── Airplane-Parts/
    │   ├── train/
    │   │   ├── *.jpg
    │   │   └── _annotations_min1cat.coco.json
    │   ├── train_10shot/
    │   │   └── ...
    │   └── valid/
    │       └── ...
    ├── Bottles/
    │   └── ...
    └── ...

Note: Due to zero-padding, filling a single gpu with multiple images may decrease the performance.

.
├── ade.json/
│   └── {"ADE150-mIoU": "x", "ADE150-PQ": "x", "ADE150-mAP": "x", "ADE847-mIoU": "x"} 
├── seginw_Airplane-Parts_val.json/
│   └── coco format
├── seginw_Bottles_val.json
├── seginw_Brain-Tumor_val.json
├── seginw_Chicken_val.json
└── ...
@article{zou2022xdecoder,
  author      = {Zou, Xueyan and Dou, Zi-Yi and Yang, Jianwei and Gan, Zhe and Li, Linjie and Li, Chunyuan and Dai, Xiyang and Wang, Jianfeng and Yuan, Lu and Peng, Nanyun and Wang, Lijuan and Lee, Yong Jae and Gao, Jianfeng},
  title       = {Generalized Decoding for Pixel, Image and Language},
  publisher   = {arXiv},
  year        = {2022},
}