Angel Xuan Chang | Angel Xuan Chang (original) (raw)

I am an Associate Professor at Simon Fraser University. Prior to this, I was a visiting research scientist at Facebook AI Research and a research scientist at Eloquent Labs working on dialogue. I received my Ph.D. in Computer Science from Stanford, where I was part of the Natural Language Processing Group and advised by Chris Manning. My research focuses on connecting language to 3D representations of shapes and scenes and grounding of language for embodied agents in indoor environments. I have worked on methods for synthesizing 3D scenes and shapes from natural language, and various datasets for 3D scene understanding. In general, I am interested in the semantics of shapes and scenes, the representation and acquisition of common sense knowledge, and reasoning using probabilistic models. My group also works on using machine learning for biodiversity monitoring, specifically with DNA barcodes as part of the larger BIOSCAN project. Some of my other interests include drawing and dance.

News

More...

Angel Xuan Chang

angelx-{at}-sfu-[dot]-ca

Associate Professor
School of Computing Science
Simon Fraser University
3DLG | GrUVi | SFU NatLang
SFU AI/ML | VINCI
Canada CIFAR AI Chair (Amii)
TUM-IAS Hans Fischer Fellow (2018-2022)
Google Scholar

Recent Papers

Research Themes

Grounding language to 3D

ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding

Duoduo CLIP: Efficient 3D Understanding with Multi-View Images

TriCoLo: Trimodal Contrastive Loss for Text to Shape Retrieval

Multi3DRefer: Grounding Text Description to Multiple 3D Objects

UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding

D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding

3DVQA: Visual Question Answering for 3D Environments

Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language

Language based content creation

HSM: Hierarchical Scene Motifs for Multi-Scale Indoor Scene Generation

SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis

SceneMotifCoder: Example-driven Visual Program Learning for Generating 3D Object Arrangements

Text-to-3D Shape Generation

Understanding Pure CLIP Guidance for Voxel Grid NeRF Models

Text2Shape: Generating Shapes from Natural Language by Learning Joint Embeddings

Text to 3D Scene Generation with Rich Lexical Grounding

Learning Spatial Knowledge for Text to 3D Scene Generation

BIOSCAN

CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale

BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity

BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity

Zahra Gharaee, Scott C Lowe, Zeming Gong, Pablo Millan Arias, Nicholas Pellegrino, Austin T. Wang, Joakim Bruslund Haurum, Iuliia Zarubiieva, Lila Kari, Dirk Steinke, Graham W Taylor, Paul Fieguth, Angel X. Chang NeurIPS D&B 2024, arXiv:2406.12723 [cs.LG], June 2024
pdf | code | webpage

A Step Towards Worldwide Biodiversity Assessment: The BIOSCAN-1M Insect Dataset

A Step Towards Worldwide Biodiversity Assessment: The BIOSCAN-1M Insect Dataset

Zahra Gharaee, Zeming Gong, Nicholas Pellegrino, Iuliia Zarubiieva, Joakim Bruslund Haurum, Scott C Lowe, Jaclyn TA McKeown, Chris CY Ho, Joschka McLeod, Yi-Yun C Wei, Jireh Agda, Sujeevan Ratnasingham, Dirk Steinke, Angel X. Chang, Graham W Taylor, Paul Fieguth
NeurIPS Datasets and Benchmarks 2023
pdf | code | webpage

Embodied AI

Zero-shot Object-Centric Instruction Following: Integrating Foundation Models with Traditional Navigation

MOPA: Modular Object Navigation with PointGoal Agents

HomeRobot: Open Vocabulary Mobile Manipulation

HomeRobot: Open Vocabulary Mobile Manipulation

Sriram Yenamandra, Arun Ramachandran, Karmesh Yadav, Austin Wang, Mukul Khanna, Theo Gervet, Tsung-Yen Yang, Vidhi Jain, Alexander William Clegg, John Turner, Zsolt Kira, Manolis Savva, Angel X. Chang, Devendra Singh Chaplot, Dhruv Batra, Roozbeh Mottaghi, Yonatan Bisk, Chris Paxton
CoRL 2023
pdf | code | challenge | webpage

Exploiting Proximity-Aware Tasks for Embodied Social Navigation

Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments

Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents

Rearrangement: A Challenge for Embodied AI

Rearrangement: A Challenge for Embodied AI

Dhruv Batra, Angel X. Chang, Sonia Chernova, Andrew J. Davison, Jia Deng, Vladlen Koltun, Sergey Levine, Jitendra Malik, Igor Mordatch, Roozbeh Mottaghi, Manolis Savva, Hao Su
arXiv:2011.01975 [cs.AI], November 2020
pdf

Multi-ON: Benchmarking Semantic Map Memory using Multi-Object Navigation

On evaluation of embodied navigation agents

On evaluation of embodied navigation agents

Peter Anderson, Angel X. Chang, Devendra Singh Chaplot, Alexey Dosovitskiy, Saurabh Gupta, Vladlen Koltun, Jana Kosecka, Jitendra Malik, Roozbeh Mottaghi, Manolis Savva, Amir R. Zamir
arXiv:1807.06757 [cs.AI], July 2018
pdf

Simulation platforms

Habitat 2.0: Training Home Assistants to Rearrange their Habitat

Habitat 2.0: Training Home Assistants to Rearrange their Habitat

Andrew Szot, Alexander Clegg, Eric Undersander, Erik Wijmans, Yili Zhao, John Turner, Noah Maestre, Mustafa Mukadam, Devendra Singh Chaplot, Oleksandr Maksymets, Aaron Gokaslan, VladimĂ­r Vondrus, Sameer Dharur, Franziska Meier, Wojciech Galuba, Angel X. Chang, Zsolt Kira, Vladlen Koltun, Jitendra Malik, Manolis Savva, Dhruv Batra
NeurIPS 2021
pdf | code | post

SAPIEN: a SimulAted Part-based Interactive ENvironment

SAPIEN: a SimulAted Part-based Interactive ENvironment

Fanbo Xiang, Yuzhe Qin, Kaichun Mo, Yikuan Xia, Hao Zhu, Fangchen Liu, Minghua Liu, Hanxiao (Shawn) Jiang, Yifu Yuan, He Wang, Li Yi, Angel X. Chang, Leonidas Guibas, Hao Su
CVPR 2020
pdf | webpage

MINOS: Multimodal Indoor Simulator for Navigation in Complex Environments

Articulated objects for interactive environments

SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects

S2O: Static to Openable Enhancement for Articulated 3D Objects

OPDMulti: Openable Part Detection for Multiple Objects

MultiScan: Scalable RGBD scanning for 3D environments with articulated objects

Articulated 3D Human-Object Interactions from RGB Videos: An Empirical Analysis of Approaches and Challenges

OPD: Single-view 3D Openable Part Detection

Motion Annotation Programs: A Scalable Approach to Annotating Kinematic Articulations in Large 3D Shape Collections

Datasets for 3D deep learning

Habitat Synthetic Scenes Dataset (HSSD-200): An Analysis of 3D Scene Scale and Realism Tradeoffs for ObjectGoal Navigation

Habitat Synthetic Scenes Dataset (HSSD-200): An Analysis of 3D Scene Scale and Realism Tradeoffs for ObjectGoal Navigation

Mukul Khanna, Yongsen Mao, Hanxiao (Shawn) Jiang, Sanjay Haresh, Brennan Shacklett, Dhruv Batra, Alexander William Clegg, Eric Undersander, Angel X. Chang, Manolis Savva
CVPR 2024, arXiv:2306.11290 [cs.CV], June 2023
pdf | code | data | webpage

Habitat-Matterport 3D Semantics Dataset

Habitat-Matterport 3D Semantics Dataset

Karmesh Yadav, Ram Ramrakhya, Santhosh K. Ramakrishnan, Theo Gervet, John Turner, Aaron Gokaslan, Noah Maestre, Angel X. Chang, Dhruv Batra, Manolis Savva, Alexander William Clegg, Devendra Singh Chaplot
CVPR 2023
pdf | webpage

Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI

Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI

Santhosh K. Ramakrishnan, Aaron Gokaslan, Erik Wijmans, Oleksandr Maksymets, Alexander Clegg, John Turner, Eric Undersander, Wojciech Galuba, Andrew Westbury, Angel X. Chang, Manolis Savva, Yili Zhao, Dhruv Batra
NeurIPS Datasets and Benchmarks Track 2021
pdf | webpage

Mirror3D: Depth Refinement for Mirror Surfaces

PartNet: A Large-scale Benchmark for Fine-grained and Hierarchical Part-level 3D Object Understanding

Scan2CAD: Learning CAD Model Alignment in RGB-D Scans

Matterport3D: Learning from RGB-D Data in Indoor Environments

ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes

ShapeNet: An Information-Rich 3D Model Repository

ShapeNet: An Information-Rich 3D Model Repository

Angel X. Chang, Thomas Funkhouser, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Zimo Li, Silvio Savarese, Manolis Savva, Shuran Song, Hao Su, Jianxiong Xiao, Li Yi, Fisher Yu
arXiv:1512.03012 [cs.GR], Dec 2015
pdf | bib | code | webpage

3D scene understanding and generation

Diorama: Unleashing Zero-shot Single-view 3D Scene Modeling

Roominoes: Learning to Assemble 3D Rooms into Floor Plans

Plan2Scene: Converting Floorplans to 3D Scenes

PlanIT: Planning and Instantiating Indoor Scenes with Relation Graph and Spatial Prior Networks

Hierarchy Denoising Recursive Autoencoders for 3D Scene Layout Prediction

Deep Convolutional Priors for Indoor Scene Synthesis

Semantic Scene Completion from a Single Depth Image