Ishan Misra (original) (raw)

Director, Research Scientist @ GenAI (Meta)

I work on computer vision and machine learning research specifically in generative AI and self-supervised learning. I am a Director, Research Scientist in the GenAI group at Meta where I lead the research efforts on video generation models. I was the tech lead for Meta's Movie Gen project for foundation models in video generation, video editing, video personalization, and audio generation.

Previously, I was part of the FAIR team at Meta where I worked on self-supervised learning in computer vision and multimodal learning.

News

2025 March

2024 October

Research on Movie Gen series of foundation media models announced (played role of Tech Lead for the full project). Covered in NY Times, Financial Times, Forbes.

2024 October

Giving four talks at ECCV 2024 Workshops and Tutorials on Generative Video Models

2024 September

2024 July

Mark Zuckerberg announces the release of Llama3 (with our efforts on video recognition).

2024 July

2024 March

2024 June

4 papers accepted at CVPR

2024 June

Emu Video now powers "animate" on meta.ai that converts images to videos!

2024 June

2023 Nov

2023 May

Mark Zuckerberg announced our recent foundational multimodal model ImageBind

2023 April

Mark Zuckerberg announced our recent foundational self-supervised model DINO-v2

2022 April

2021 March

Publications

Mainly publish on video and image recognition, video and image generation, object detection/segmentation, multimodal learning, and self-supervised learning.

Movie Gen: A Cast of Media Foundation Models

The Movie Gen Team (Overall Tech Lead; Core Contributor)

Meta Research 2024

PDF Blog

Generative AI Video Generation Foundation Models

The Llama 3 Herd of Models

The Llama3 Team (played role of a Core Contributor for video recognition)

arxiv 2024

PDF Code

Generative AI LLM Foundation Models Image Recognition Video Recognition

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

Rohit Girdhar^* , Mannat Singh^* , Andrew Brown* , Quentin Duval* , Samaneh Azadi* , Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra*

ECCV 2024

PDF BibTeX Powers Meta's /animate and Emu Reels products *Authors contributed equally

Generative AI Diffusion Models Video Generation Foundation Models

@inproceedings{emuvideo2023, title={Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning}, author={Rohit Girdhar and Mannat Singh and Andrew Brown and Quentin Duval and Samaneh Azadi and Sai Saketh Rambhatla and Akbar Shah and Xi Yin and Devi Parikh and Ishan Misra}, inproceedings={ECCV}, year={2024}, }

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis

Feng Liang, Bichen Wu, Jialiang Wang, Licheng Yu, Kunpeng Li, Yinan Zhao, Ishan Misra, Jia-Bin Huang, Peizhao Zhang, Peter Vajda, Diana Marculescu

CVPR 2024

PDF BibTeX Highlight

Generative AI Diffusion Models Video Generation

@inproceedings{liang2024flowvid, title={FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis}, author={Feng Liang and Bichen Wuand Jialiang Wang and Licheng Yu and Kunpeng Li and Yinan Zhao and Ishan Misra and Jia-Bin Huang and Peizhao Zhang and Peter Vajda and Diana Marculescu}, booktitle={CVPR}, year={2024}, }

InstanceDiffusion: Instance-level Control for Image Generation

Xudong Wang, Trevor Darrell, Sai Saketh Rambhatla, Rohit Girdhar, Ishan Misra

CVPR 2024

@inproceedings{wang2024instance, title={InstanceDiffusion: Instance-level Control for Image Generation}, author={Xudong Wang and Trevor Darrell and Sai Saketh Rambhatla and Rohit Girdhar and Ishan Misra}, booktitle={CVPR}, year={2024}, }

Generating Illustrated Instructions

Sachit Menon, Ishan Misra, Rohit Girdhar

CVPR 2024

PDF BibTeX

Generative AI Diffusion Models LLM

@inproceedings{menon2024illustrated, title={Generating Illustrated Instructions}, author={Sachit Menon and Ishan Misra and Rohit Girdhar}, booktitle={CVPR}, year={2024}, }

VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation

Xudong Wang, Ishan Misra, Ziyun Zheng, Rohit Girdhar, Trevor Darrell

CVPR 2024

PDF BibTeX

Self-Supervised Learning Video Recognition Object Discovery

@inproceedings{wang2024vcutler, title={VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation}, author={Xudong Wang and Ishan Misra and Ziyun Zheng and Rohit Girdhar and Trevor Darrell}, booktitle={CVPR}, year={2024}, }

The effectiveness of MAE pre-pretraining for billion-scale pretraining

Mannat Singh* , Quentin Duval* , Kalyan Vasudev Alwala* , Haoqi Fan, Vaibhav Aggarwal, Aaron Adcock, Armand Joulin, Piotr Dollár, Christoph Feichtenhofer, Ross Girshick, Rohit Girdhar, Ishan Misra

ICCV 2023

PDF Code

Colab

BibTeX *Authors contributed equally

Self-Supervised Learning Weakly-Supervised Learning Large Scale Foundation Models

@inproceedings{singh2023effectiveness, title={The effectiveness of MAE pre-pretraining for billion-scale pretraining}, author={Singh, Mannat and Duval, Quentin and Alwala, Kalyan Vasudev and Fan, Haoqi and Aggarwal, Vaibhav and Adcock, Aaron and Joulin, Armand and Doll{'a}r, Piotr and Feichtenhofer, Christoph and Girshick, Ross and Girdhar, Rohit and Misra, Ishan}, booktitle={ICCV}, year={2023}, }

MOST: Multiple Object localization with Self-supervised Transformers for object discovery.

Sai Saketh Rambhatla, Ishan Misra, Rama Chellappa, Abhinav Shrivastava

ICCV 2023

@inproceedings{rambhatla2023most, title={MOST: Multiple Object localization with Self-supervised Transformers for object discovery}, author={Sai Saketh Rambhatla and Ishan Misra and Rama Chellappa and Abhinav Shrivastava}, booktitle={ICCV}, year={2023}, }

MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without Camera Poses

Yang Fu, Ishan Misra, Xiaolong Wang

ICML 2023

@inproceedings{fu2023mononerf, title={MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without Camera Poses}, author={Yang Fu and Ishan Misra and Xiaolong Wang}, booktitle={ICML}, year={2023}, }

ImageBind: One Embedding Space To Bind Them All

Rohit Girdhar* , Alaaeldin El-Nouby* , Zhuang Liu, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra*

CVPR 2023

Demo PDF Code Demo BibTeX Highlighted paper *Authors contributed equally

Multimodal Learning Self-Supervised Learning Foundation Models

@inproceedings{girdhar2023imagebind, title={ImageBind: One Embedding Space To Bind Them All}, author={Girdhar, Rohit and El-Nouby, Alaaeldin and Liu, Zhuang and Singh, Mannat and Alwala, Kalyan Vasudev and Joulin, Armand and Misra, Ishan}, booktitle={CVPR}, year={2023}, }

Cut and Learn for Unsupervised Object Detection and Instance Segmentation

Xudong Wang, Rohit Girdhar, Stella X. Yu, Ishan Misra

CVPR 2023

@inproceedings{wang2023cut, title={Cut and Learn for Unsupervised Object Detection and Instance Segmentation}, author={Wang, Xudong and Girdhar, Rohit and Yu, Stella X and Misra, Ishan}, booktitle={CVPR}, year={2023}, }

Learning Video Representations from Large Language Models

Yue Zhao, Ishan Misra, Philipp Krahenbuhl, Rohit Girdhar

CVPR 2023

@inproceedings{zhao2022lavila, title={Learning Video Representations from Large Language Models}, author={Zhao, Yue and Misra, Ishan and Kr{"a}henb{"u}hl, Philipp and Girdhar, Rohit}, booktitle=CVPR, year={2023}, }

The Hidden Uniform Cluster Prior in Self-Supervised Learning

Mahmoud Assran, Randall Balestriero, Quentin Duval, Florian Bordes, Ishan Misra, Piotr Bojanowski, Pascal Vincent, Michael Rabbat, Nicolas Ballas

ICLR 2023

PDF BibTeX

Self-supervised Learning Representation Learning

OmniMAE: Single Model Masked Pretraining on Images and Videos

Rohit Girdhar* , Alaaeldin El-Nouby* , Mannat Singh* , Kalyan Vasudev Alwala* , Armand Joulin, Ishan Misra*

CVPR 2023

PDF Code BibTeX *Authors contributed equally

Self-supervised Learning Representation Learning Video Recognition

@inproceedings{girdhar2022omnimae, title={OmniMAE: Single Model Masked Pretraining on Images and Videos}, author={Girdhar, Rohit and El-Nouby, Alaaeldin and Singh, Mannat and Alwala, Kalyan Vasudev and Joulin, Armand and Misra, Ishan}, booktitle={CVPR}, year={2023}, }

Masked Siamese Networks for Label-Efficient Learning

Mahmoud Assran, Mathilde Caron, Ishan Misra, Piotr Bojanowski, Florian Bordes, Pascal Vincent, Armand Joulin, Michael Rabbat, Nicolas Ballas

ECCV 2022

PDF Code BibTeX

Self-supervised Learning Representation Learning Image Recognition

@inproceedings{assran2022masked, title={Masked Siamese Networks for Label-Efficient Learning}, author={Assran, Mahmoud, and Caron, Mathilde, and Misra, Ishan, and Bojanowski, Piotr, and Bordes, Florian and Vincent, Pascal, and Joulin, Armand, and Rabbat, Michael, and Ballas, Nicolas}, booktitle={ECCV}, year={2022}, }

Detecting Twenty-thousand Classes using Image-level Supervision

Xingyi Zhou, Rohit Girdhar, Armand Joulin, Phillip Krahenbuhl, Ishan Misra

ECCV 2022

PDF Code BibTeX

Object Detection Open World Recognition Instance Recognition

@inproceedings{zhou2021detecting, title={Detecting Twenty-thousand Classes using Image-level Supervision}, author={Zhou, Xingyi and Girdhar, Rohit and Joulin, Armand and Kr{"a}henb{"u}hl, Philipp and Misra, Ishan}, booktitle={ECCV}, year={2022}, }

Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision

Priya Goyal, Quentin Duval, Isaac Seessel, Mathilde Caron, Ishan Misra, Levent Sagun, Armand Joulin, Piotr Bojanowski

Arxiv 2022

PDF

Self-supervised Learning Image Recognition Foundation Models

Omnivore: A Single Model for Many Visual Modalities

Rohit Girdhar* , Mannat Singh* , Nikhila Ravi* , Laurens van der Maaten, Armand Joulin, Ishan Misra*

CVPR 2022

PDF Code BibTeX Oral *Authors contributed equally

Self-supervised Learning Image Recognition Video Recognition Multimodal Learning Foundation Models

@inproceedings{girdhar2022omnivore, title={{Omnivore: A Single Model for Many Visual Modalities}}, author={Girdhar, Rohit and Singh, Mannat and Ravi, Nikhila and van der Maaten, Laurens and Joulin, Armand and Misra, Ishan}, booktitle={CVPR}, year={2022}, }

Masked-attention Mask Transformer for Universal Image Segmentation

Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar

CVPR 2022

PDF Code BibTeX

Semantic Segmentation Panoptic Segmentation Instance Recognition

@inproceedings{cheng2021mask2former, title={Masked-attention Mask Transformer for Universal Image Segmentation}, author={Bowen Cheng and Ishan Misra and Alexander G. Schwing and Alexander Kirillov and Rohit Girdhar}, booktitle={CVPR}, year={2022}, }

An End-to-End Transformer Model for 3D Object Detection

Ishan Misra, Rohit Girdhar, Armand Joulin

ICCV 2021

@inproceedings{misra2021-3detr, title={{An End-to-End Transformer Model for 3D Object Detection}}, author={Misra, Ishan and Girdhar, Rohit and Joulin, Armand}, booktitle={{ICCV}}, year={2021}, }

Emerging Properties in Self-Supervised Vision Transformers

Mathilde Caron, Hugo Touvron, Ishan Misra, Hervé Jégou, Julien Mairal, Piotr Bojanowski, Armand Joulin

ICCV 2021

PDF Code

Self-supervised Learning Image Recognition Foundation Models

Self-Supervised Pretraining of 3D Features on any Point-Cloud

Zaiwei Zhang, Rohit Girdhar, Armand Joulin, Ishan Misra

ICCV 2021

PDF Code BibTeX

Self-supervised Learning Image Recognition Foundation Models Representation Learning

@inproceedings{zhang_depth_contrast, title={Self-Supervised Pretraining of 3D Features on any Point-Cloud}, author={Zhang, Zaiwei and Girdhar, Rohit and Joulin, Armand and Misra, Ishan}, journal={arXiv preprint arXiv:2101.02691}, year={2021}, }

MDETR : Modulated Detection for End-to-End Multi-Modal Understanding

Aishwarya Kamath, Mannat Singh, Yann LeCun, Ishan Misra, Gabriel Synnaeve, Nicolas Carion

ICCV 2021

PDF Code Oral

Multimodal learning Instance Recognition Foundation Models

Audio-Visual Instance Discrimination with Cross-Modal Agreement

Pedro Morgado, Nuno Vasconcelos, Ishan Misra

CVPR 2021

PDF Code BibTeX Best Paper Candidate

Multimodal learning Self-supervised Learning Audio Recognition

@article{morgado2020avid, title={Audio-Visual Instance Discrimination with Cross-Modal Agreement}, author={Pedro Morgado and Nuno Vasconcelos and Ishan Misra}, year={2020}, journal={https://arxiv.org/abs/2004.12943}, }

Robust Audio-Visual Instance Discrimination

Pedro Morgado, Ishan Misra, Nuno Vasconcelos

CVPR 2021

PDF BibTeX Oral

Multimodal learning Self-supervised Learning Audio Recognition

@ InProceedings{morgado2021_robust_xid, title={Robust Audio-Visual Instance Discrimination}, author={Pedro Morgado, Ishan Misra, Nuno Vasconcelos}, booktitle = {{CVPR}}, year={2021}, }

Barlow Twins: Self-Supervised Learning via Redundancy Reduction

Jure Zbontar* , Li Jing* , Ishan Misra, Yann LeCun, Stéphane Deny

ICML 2021

PDF Code BibTeX *Authors contributed equally

Self-supervised Learning Image Recognition Representation Learning

@inproceedings{zbontar_barlowtwins, title={Barlow Twins: Self-Supervised Learning via Redundancy Reduction}, author={Jure Zbontar, Li Jing, Ishan Misra, Yann LeCun, Stephane Deny}, booktitle={ICML}, year={2021}, }

3D Spatial Recognition without Spatially Labeled 3D

Zhongzheng Ren, Ishan Misra, Alexander G. Schwing, Rohit Girdhar

CVPR 2021

PDF

3D Recognition Instance Recognition

Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

Mathilde Caron, Ishan Misra, Julien Mairal, Priya Goyal, Piotr Bojanowski, Armand Joulin

NeurIPS 2020

PDF Code BibTeX

Self-supervised Learning Image Recognition Representation Learning

@inproceedings{caron2020swav, title={Unsupervised Learning of Visual Features by Contrasting Cluster Assignments}, author={Mathilde Caron, Ishan Misra, Julien Mairal, Priya Goyal, Piotr Bojanowski, Armand Joulin}, year={2020}, booktitle={NeurIPS}, }

Self-Supervised Learning of Pretext-Invariant Representations

Ishan Misra, Laurens van der Maaten

CVPR 2020

PDF Code BibTeX

Self-supervised Learning Image Recognition Representation Learning

@inproceedings{misra2020pirl, title={Self-Supervised Learning of Pretext-Invariant Representations}, author={Misra, Ishan and van der Maaten, Laurens}, booktitle={CVPR}, year={2020}, }

ClusterFit: Improving Generalization of Visual Representations

Xueting Yan* , Ishan Misra* , Abhinav Gupta, Deepti Ghadiyaram* , Dhruv Mahajan*

CVPR 2020

PDF Code BibTeX *Authors contributed equally

Image Recognition Representation Learning

@inproceedings{yan2020cluster, title={{ClusterFit: Improving Generalization of Visual Representations}}, author={Xueting Yan, Ishan Misra, Abhinav Gupta, Deepti Ghadiyaram, Dhruv Mahajan}, booktitle={CVPR}, year={2020}, }

In Defense of Grid Features for Visual Question Answering

Huaizu Jiang, Ishan Misra, Marcus Rohrbach, Erik Learned-Miller, Xinlei Chen

CVPR 2020

PDF Code BibTeX

Image Recognition Multimodal Learning Visual Question Answering

@inproceedings{jiang2020grid, title={In Defense of Grid Features for Visual Question Answering}, author={Huaizu Jiang, Ishan Misra, Marcus Rohrbach, Erik Learned-Miller, Xinlei Chen}, booktitle={CVPR}, year={2020}, }

3D-RelNet: Joint Object and Relational Network for 3D Prediction

Nilesh Kulkarni, Ishan Misra, Shubham Tulsiani, Abhinav Gupta

ICCV 2019

PDF Code BibTeX

3D Recognition Object detection Visual Question Answering

@inproceedings{kulkarni20193drel, title={{3D-RelNet: Joint Object and Relational Network for 3D Prediction}}, author={Nilesh Kulkarni and Ishan Misra and Shubham Tulsiani and Abhinav Gupta}, booktitle={ICCV}, year={2019}, }

Scaling and Benchmarking Self-Supervised Visual Representation Learning

Priya Goyal, Dhruv Mahajan, Abhinav Gupta* , Ishan Misra*

ICCV 2019

PDF Code BibTeX *Authors contributed equally

Self-supervised Learning Image Recognition Representation Learning

@inproceedings{goyal2019self, title={{Scaling and Benchmarking Self-Supervised Visual Representation Learning}}, author={Priya Goyal and Dhruv Mahajan and Abhinav Gupta and Ishan Misra}, booktitle={ICCV}, year={2019}, }

Binary Image Selection (BISON): Interpretable Evaluation of Visual Grounding

Hexiang Hu, Ishan Misra, Laurens van der Maaten

ICCV Workshop on Vision and Language 2019

@article{hexiang2018bison, title={{Binary Image Selection (BISON): Interpretable Evaluation of Visual Grounding}}, author={Hu, Hexiang and Misra, Ishan and van der Maaten, Laurens}, journal={arXiv preprint arXiv:1901.06595}, year={2019}, }

Does Object Recognition Work for Everyone?

Terrance DeVries* , Ishan Misra* , Changhan Wang* , Laurens van der Maaten

CVPR 2019

PDF BibTeX *Authors contributed equally

Fairness Image Recognition

@inproceedings{devries2019fairness, title={{Does Object Recognition Work for Everyone?}}, author={Terrance DeVries and Ishan Misra and Changhan Wang and Laurens van der Maaten}, booktitle={CVPR 2019 Workshop on Computer Vision for Global Challenges}, year={2019}, }

Mainstream: Dynamic Stem-Sharing for Multi-Tenant Video Processing

Angela Jiang, Daniel L.-K. Wong, Christopher Canel, Ishan Misra, Michael Kaminsky, Michael Kozuch, Padmanabhan Pillai, David G. Andersen and Gregory Ganger

USENIX Annual Technical Conference 2018

@inproceedings {jiangmainstream18, title = {Mainstream: Dynamic Stem-Sharing for Multi-Tenant Video Processing}, authors = {Angela Jiang and Daniel L.-K. Wong and Christopher Canel and Ishan Misra and Michael Kaminsky and Michael Kozuch and Padmanabhan Pillai and David G. Andersen and Gregory Ganger}, booktitle = {{USENIX} Annual Technical Conference ({USENIX} {ATC} 18)}, year = {2018}, address = {Boston, MA}, url = {https://www.usenix.org/conference/atc18/presentation/jiang}, publisher = {{USENIX} Association}, }

Learning by Asking Questions

Ishan Misra, Ross Girshick, Rob Fergus, Martial Hebert, Abhinav Gupta, Laurens van der Maaten

CVPR 2018

PDF BibTeX Oral

Multimodal Learning Visual Question Answering

@inproceedings{misra2017lba, Author = {Ishan Misra and Ross Girshick and Rob Fergus and, Martial Hebert and Abhinav Gupta and Laurens van der Maaten}, Title = {{Learning by Asking Questions}}, Booktitle = {{CVPR}}, Year = {2018}, }

Cut Paste and Learn: Surprisingly Easy Synthesis for Instance Detection

Debidatta Dwibedi, Ishan Misra, Martial Hebert

ICCV 2017

@inproceedings{debi2017cutpaste, title={{Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection}}, author={Dwibedi, Debidatta and Misra, Ishan and Hebert, Martial}, booktitle={ICCV}, year={2017}, }

From Red Wine to Red Tomato: Composition with Context

Ishan Misra, Abhinav Gupta, Martial Hebert

CVPR 2017

@inproceedings{misra2017composing, title={{From Red Wine to Red Tomato: Composition with Context}}, author={Misra, Ishan and Gupta, Abhinav and Hebert, Martial}, booktitle={CVPR}, year={2017}, }

Shuffle and Learn: Unsupervised Learning using Temporal Order Verification

Ishan Misra, C. Lawrence Zitnick, Martial Hebert

ECCV 2016

PDF Code BibTeX

Self-supervised Learning Representation Learning Video Recognition

@inproceedings{misra2016unsupervised, title={{Shuffle and Learn: Unsupervised Learning using Temporal Order Verification}}, author={Misra, Ishan and Zitnick, C. Lawrence and Hebert, Martial}, booktitle={ECCV}, year={2016}, }

Seeing through the Human Reporting Bias: Visual Classifiers from Noisy

Ishan Misra, C. Lawrence Zitnick, Margaret Mitchell, Ross Girshick

CVPR 2016

@inproceedings{MisraNoisy16, Author = {Ishan Misra and C. Lawrence Zitnick and Margaret Mitchell and Ross Girshick}, Booktitle = {CVPR}, Title = {{Seeing through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels}}, Year = {2016}, } ,

Cross-stitch Networks for Multi-Task Learning

Ishan Misra* , Abhinav Shrivastava* , Abhinav Gupta, Martial Hebert

CVPR 2016

PDF BibTeX Spotlight *Authors contributed equally

Multi-task Learning Image Recognition 3D Recognition

@inproceedings{MisraCrossMTL16, Author = {Ishan Misra and Abhinav Shrivastava and Abhinav Gupta and Martial Hebert}, Booktitle = {CVPR}, Title = {{Cross-stitch Networks for Multi-task Learning}}, Year = {2016}, } ,

Generating Natural Questions About an Image

Nasrin Mostafazadeh, Ishan Misra, Jacob Devlin, Margaret Mitchell, Xiaodong He, Lucy Vanderwende

ACL 2016

@article{mostafazadeh2016generating, title={Generating Natural Questions About an Image}, author={Mostafazadeh, Nasrin and Misra, Ishan and Devlin, Jacob and Mitchell, Margaret and He, Xiaodong and Vanderwende, Lucy}, journal={arXiv preprint arXiv:1603.06059}, year={2016}, } ,

Visual Storytelling

Ting-Hao Huang, Francis Ferraro, Nasrin Mostafazadeh, Ishan Misra, Jacob Devlin, Aishwarya Agrawal, Ross Girshick, Xiaodong He, Pushmeet Kohli, et al.

NAACL 2016

@article{ferraro2016visual, title={Visual storytelling}, author={Ferraro, Francis and Mostafazadeh, Nasrin and Misra, Ishan and Agrawal, Aishwarya and Devlin, Jacob and Girshick, Ross and He, Xiaodong and Kohli, Pushmeet and Batra, Dhruv and Zitnick, C Lawrence and Parikh, Devi and Vanderwende, Lucy and Galley, Michel and Mitchell, Margaret}, journal={arXiv preprint arXiv:1604.03968}, year={2016}, }

Watch and Learn: Semi-Supervised Learning of Object Detectors from Video

Ishan Misra, Abhinav Shrivastava, Martial Hebert

CVPR 2015

PDF BibTeX

Semi-supervised Learning Video Recognition Instance Recognition

@inproceedings{MisraSSL15, Author = {Ishan Misra and Abhinav Shrivastava and Martial Hebert}, Booktitle = {CVPR}, Title = {Watch and Learn: Semi-Supervised Learning of Object Detectors from Videos}, Year = {2015}, } ,

Applying artificial vision models to human scene understanding

Elissa Aminoff, M. Toneva, Abhinav Shrivastava, Xinlei Chen, Ishan Misra, et al.

Journal of Frontiers in Computational Neuroscience 2015

Data-driven Exemplar Model Selection

Ishan Misra, Abhinav Shrivastava, Martial Hebert

WACV 2014

PDF BibTeX Best Student Paper

Image Recognition Instance Recognition

@inproceedings{MisraExemplarSelection, Author = {Ishan Misra and Abhinav Shrivastava and Martial Hebert}, Booktitle = {IEEE Winter Conference on Applications of Computer Vision (WACV)}, Title = {Data-driven Exemplar Model Selection}, Year = {2014}, } ,