Session Six

MAIN CONFERENCE

All papers will be presented in the same manner. Each paper will have a five minute pre-recorded video and a PDF of the poster. An asynchronous text chat will be available for each paper. Attendees can view the papers and videos on demand at any time. Authors will also have individual Q&A sessions at the posted times below.

 All posted times are EDT but the chart linked below has all time zones’ conversions. When the virtual site is up, you will be able to select which sessions you are interested in and it will populate your own schedule.

Presentation Schedule

  • All times are Eastern Daylight Time

Date: Wednesday, June 23, 2021   6:00– 8:30
Paper Session Six:

Paper ID Paper Title Authors
5142 Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition Shancheng Fang, Hongtao Xie, Yuxin Wang, Zhendong Mao, Yongdong Zhang
2270 MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization Jiahui Huang, He Wang, Tolga Birdal, Minhyuk Sung, Federica Arrigoni, Shi-Min Hu, Leonidas J. Guibas
5041 NeuTex: Neural Texture Mapping for Volumetric Neural Rendering Fanbo Xiang, Zexiang Xu, Miloš Hašan, Yannick Hold-Geoffroy, Kalyan Sunkavalli, Hao Su
414 UnsupervisedR&R: Unsupervised Point Cloud Registration via Differentiable Rendering Mohamed El Banani, Luya Gao, Justin Johnson
6098 RangeIoUDet: Range Image Based Real-Time 3D Object Detector Optimized by Intersection Over Union Zhidong Liang, Zehan Zhang, Ming Zhang, Xian Zhao, Shiliang Pu
10357 Architectural Adversarial Robustness: The Case for Deep Pursuit George Cazenavette, Calvin Murdock, Simon Lucey
457 SimPoE: Simulated Character Control for 3D Human Pose Estimation Ye Yuan, Shih-En Wei, Tomas Simon, Kris Kitani, Jason Saragih
5450 CodedStereo: Learned Phase Masks for Large Depth-of-Field Stereo Shiyu Tan, Yicheng Wu, Shoou-I Yu, Ashok Veeraraghavan
7815 PSD: Principled Synthetic-to-Real Dehazing Guided by Physical Priors Zeyuan Chen, Yangchao Wang, Yang Yang, Dong Liu
10082 OpenRooms: An Open Framework for Photorealistic Indoor Scene Datasets Zhengqin Li, Ting-Wei Yu, Shen Sang, Sarah Wang, Meng Song, Yuhan Liu, Yu-Ying Yeh, Rui Zhu, Nitesh Gundavarapu, Jia Shi, Sai Bi, Hong-Xing Yu, Zexiang Xu, Kalyan Sunkavalli, Miloš Hašan, Ravi Ramamoorthi, Manmohan Chandraker
7067 A Closer Look at Fourier Spectrum Discrepancies for CNN-Generated Images Detection Keshigeyan Chandrasegaran, Ngoc-Trung Tran, Ngai-Man Cheung
3702 NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections Ricardo Martin-Brualla, Noha Radwan, Mehdi S. M. Sajjadi, Jonathan T. Barron, Alexey Dosovitskiy, Daniel Duckworth
7596 ID-Unet: Iterative Soft and Hard Deformation for View Synthesis Mingyu Yin, Li Sun, Qingli Li
4877 GeoSim: Realistic Video Simulation via Geometry-Aware Composition for Self-Driving Yun Chen, Frieda Rong, Shivam Duggal, Shenlong Wang, Xinchen Yan, Sivabalan Manivasagam, Shangjie Xue, Ersin Yumer, Raquel Urtasun
4070 All Labels Are Not Created Equal: Enhancing Semi-Supervision via Label Grouping and Co-Training Islam Nassar, Samitha Herath, Ehsan Abbasnejad, Wray Buntine, Gholamreza Haffari
11399 Orthogonal Over-Parameterized Training Weiyang Liu, Rongmei Lin, Zhen Liu, James M. Rehg, Liam Paull, Li Xiong, Le Song, Adrian Weller
11659 DeepTag: An Unsupervised Deep Learning Method for Motion Tracking on Cardiac Tagging Magnetic Resonance Images Meng Ye, Mikael Kanski, Dong Yang, Qi Chang, Zhennan Yan, Qiaoying Huang, Leon Axel, Dimitris Metaxas
2819 Transferable Query Selection for Active Domain Adaptation Bo Fu, Zhangjie Cao, Jianmin Wang, Mingsheng Long
5388 When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework Zhizhong Huang, Junping Zhang, Hongming Shan
3899 Simpler Certified Radius Maximization by Propagating Covariances Xingjian Zhen, Rudrasis Chakraborty, Vikas Singh
3741 Improving Panoptic Segmentation at All Scales Lorenzo Porzi, Samuel Rota Bulò, Peter Kontschieder
4411 Learning Triadic Belief Dynamics in Nonverbal Communication From Videos Lifeng Fan, Shuwen Qiu, Zilong Zheng, Tao Gao, Song-Chun Zhu, Yixin Zhu
4551 Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps Yuk Heo, Yeong Jun Koh, Chang-Su Kim
4263 Less Is More: ClipBERT for Video-and-Language Learning via Sparse Sampling Jie Lei, Linjie Li, Luowei Zhou, Zhe Gan, Tamara L. Berg, Mohit Bansal, Jingjing Liu
5359 Im2Vec: Synthesizing Vector Graphics Without Vector Supervision Pradyumna Reddy, Michaël Gharbi, Michal Lukáč, Niloy J. Mitra
4163 FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding Bo Sun, Banghuai Li, Shengcai Cai, Ye Yuan, Chi Zhang
4621 Beyond Max-Margin: Class Margin Equilibrium for Few-Shot Object Detection Bohao Li, Boyu Yang, Chang Liu, Feng Liu, Rongrong Ji, Qixiang Ye
10441 Dynamic Head: Unifying Object Detection Heads With Attentions Xiyang Dai, Yinpeng Chen, Bin Xiao, Dongdong Chen, Mengchen Liu, Lu Yuan, Lei Zhang
7270 Dictionary-Guided Scene Text Recognition Nguyen Nguyen, Thu Nguyen, Vinh Tran, Minh-Triet Tran, Thanh Duc Ngo, Thien Huu Nguyen, Minh Hoai
6857 Progressive Contour Regression for Arbitrary-Shape Scene Text Detection Pengwen Dai, Sanyi Zhang, Hua Zhang, Xiaochun Cao
1920 Strengthen Learning Tolerance for Weakly Supervised Object Localization Guangyu Guo, Junwei Han, Fang Wan, Dingwen Zhang
2132 StruMonoNet: Structure-Aware Monocular 3D Prediction Zhenpei Yang, Li Erran Li, Qixing Huang
5382 Fully Understanding Generic Objects: Modeling, Segmentation, and Reconstruction Feng Liu, Luan Tran, Xiaoming Liu
7225 Exploiting & Refining Depth Distributions With Triangulation Light Curtains Yaadhav Raaj, Siddharth Ancha, Robert Tamburo, David Held, Srinivasa G. Narasimhan
2662 PMP-Net: Point Cloud Completion by Learning Multi-Step Point Moving Paths Xin Wen, Peng Xiang, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu
4464 TearingNet: Point Cloud Autoencoder To Learn Topology-Friendly Representations Jiahao Pang, Duanshun Li, Dong Tian
10827 3D Object Detection With Pointformer Xuran Pan, Zhuofan Xia, Shiji Song, Li Erran Li, Gao Huang
552 NeuroMorph: Unsupervised Shape Interpolation and Correspondence in One Go Marvin Eisenberger, David Novotny, Gael Kerchenbaum, Patrick Labatut, Natalia Neverova, Daniel Cremers, Andrea Vedaldi
5459 Towards Part-Based Understanding of RGB-D Scans Alexey Bokhovkin, Vladislav Ishimtsev, Emil Bogomolov, Denis Zorin, Alexey Artemov, Evgeny Burnaev, Angela Dai
2454 NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis Pratul P. Srinivasan, Boyang Deng, Xiuming Zhang, Matthew Tancik, Ben Mildenhall, Jonathan T. Barron
2709 Probabilistic Model Distillation for Semantic Correspondence Xin Li, Deng-Ping Fan, Fan Yang, Ao Luo, Hong Cheng, Zicheng Liu
1180 SceneGraphFusion: Incremental 3D Scene Graph Prediction From RGB-D Sequences Shun-Cheng Wu, Johanna Wald, Keisuke Tateno, Nassir Navab, Federico Tombari
5815 Self-Supervised Learning of Depth Inference for Multi-View Stereo Jiayu Yang, Jose M. Alvarez, Miaomiao Liu
2995 Mesoscopic Photogrammetry With an Unstabilized Phone Camera Kevin C. Zhou, Colin Cooke, Jaehee Park, Ruobing Qian, Roarke Horstmeyer, Joseph A. Izatt, Sina Farsiu
1579 LiDAR R-CNN: An Efficient and Universal 3D Object Detector Zhichao Li, Feng Wang, Naiyan Wang
4987 Monocular 3D Object Detection: An Extrinsic Parameter Free Approach Yunsong Zhou, Yuan He, Hongzi Zhu, Cheng Wang, Hongyang Li, Qinhong Jiang
2066 Beyond Short Clips: End-to-End Video-Level Learning With Collaborative Memories Xitong Yang, Haoqi Fan, Lorenzo Torresani, Larry S. Davis, Heng Wang
2514 Multimodal Motion Prediction With Stacked Transformers Yicheng Liu, Jinghuai Zhang, Liangji Fang, Qinhong Jiang, Bolei Zhou
3822 Weakly Supervised Action Selection Learning in Video Junwei Ma, Satya Krishna Gorti, Maksims Volkovs, Guangwei Yu
2236 BASAR:Black-Box Attack on Skeletal Action Recognition Yunfeng Diao, Tianjia Shao, Yong-Liang Yang, Kun Zhou, He Wang
10059 Adversarial Robustness Across Representation Spaces Pranjal Awasthi, George Yu, Chun-Sung Ferng, Andrew Tomkins, Da-Cheng Juan
7277 img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation Vítor Albiero, Xingyu Chen, Xi Yin, Guan Pang, Tal Hassner
7375 OSTeC: One-Shot Texture Completion Baris Gecer, Jiankang Deng, Stefanos Zafeiriou
2691 Locally Aware Piecewise Transformation Fields for 3D Human Mesh Registration Shaofei Wang, Andreas Geiger, Siyu Tang
2831 Monocular 3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks Yu Cheng, Bo Wang, Bo Yang, Robby T. Tan
3651 Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition Delian Ruan, Yan Yan, Shenqi Lai, Zhenhua Chai, Chunhua Shen, Hanzi Wang
227 SDD-FIQA: Unsupervised Face Image Quality Assessment With Similarity Distribution Distance Fu-Zhao Ou, Xingyu Chen, Ruixin Zhang, Yuge Huang, Shaoxin Li, Jilin Li, Yong Li, Liujuan Cao, Yuan-Gen Wang
7423 Facial Action Unit Detection With Transformers Geethu Miriam Jacob, Björn Stenger
2426 Anchor-Free Person Search Yichao Yan, Jinpeng Li, Jie Qin, Song Bai, Shengcai Liao, Li Liu, Fan Zhu, Ling Shao
4173 Neural Camera Simulators Hao Ouyang, Zifan Shi, Chenyang Lei, Ka Lung Law, Qifeng Chen
6987 Neural Auto-Exposure for High-Dynamic Range Object Detection Emmanuel Onzon, Fahim Mannan, Felix Heide
3264 ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring Dongxu Li, Chenchen Xu, Kaihao Zhang, Xin Yu, Yiran Zhong, Wenqi Ren, Hanna Suominen, Hongdong Li
288 Memory Oriented Transfer Learning for Semi-Supervised Image Deraining Huaibo Huang, Aijing Yu, Ran He
11692 Robust Representation Learning With Feedback for Single Image Deraining Chenghao Chen, Hao Li
4408 A Multi-Task Network for Joint Specular Highlight Detection and Removal Gang Fu, Qing Zhang, Lei Zhu, Ping Li, Chunxia Xiao
4827 Panoramic Image Reflection Removal Yuchen Hong, Qian Zheng, Lingran Zhao, Xudong Jiang, Alex C. Kot, Boxin Shi
3051 Turning Frequency to Resolution: Video Super-Resolution via Event Cameras Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, Dacheng Tao
951 SRWarp: Generalized Image Super-Resolution under Arbitrary Transformation Sanghyun Son, Kyoung Mu Lee
6495 Learning Scene Structure Guidance via Cross-Task Knowledge Transfer for Single Depth Super-Resolution Baoli Sun, Xinchen Ye, Baopu Li, Haojie Li, Zhihui Wang, Rui Xu
7173 Gated Spatio-Temporal Attention-Guided Video Deblurring Maitreya Suin, A. N. Rajagopalan
3908 Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark Longyin Wen, Dawei Du, Pengfei Zhu, Qinghua Hu, Qilong Wang, Liefeng Bo, Siwei Lyu
10684 Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild With Pose Annotations Adel Ahmadyan, Liangkai Zhang, Artsiom Ablavatski, Jianing Wei, Matthias Grundmann
11470 Dynamic Domain Adaptation for Efficient Inference Shuang Li, JinMing Zhang, Wenxuan Ma, Chi Harold Liu, Wei Li
3225 General Instance Distillation for Object Detection Xing Dai, Zeren Jiang, Zhao Wu, Yiping Bao, Zhicheng Wang, Si Liu, Erjin Zhou
3109 Data-Free Knowledge Distillation for Image Super-Resolution Yiman Zhang, Hanting Chen, Xinghao Chen, Yiping Deng, Chunjing Xu, Yunhe Wang
8145 Improving Accuracy of Binary Neural Networks Using Unbalanced Activation Distribution Hyungjun Kim, Jihoon Park, Changhun Lee, Jae-Joon Kim
7214 Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs Hui-Po Wang, Ning Yu, Mario Fritz
8263 Cross Modal Focal Loss for RGBD Face Anti-Spoofing Anjith George, Sébastien Marcel
6288 On the Difficulty of Membership Inference Attacks Shahbaz Rezaei, Xin Liu
6705 Lifelong Person Re-Identification via Adaptive Knowledge Accumulation Nan Pu, Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew
1970 Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes Julian Chibane, Aayush Bansal, Verica Lazova, Gerard Pons-Moll
514 Regularizing Generative Adversarial Networks Under Limited Data Hung-Yu Tseng, Lu Jiang, Ce Liu, Ming-Hsuan Yang, Weilong Yang
7418 Automatic Correction of Internal Units in Generative Neural Networks Ali Tousi, Haedong Jeong, Jiyeon Han, Hwanil Choi, Jaesik Choi
1523 HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms Mahmoud Afifi, Marcus A. Brubaker, Michael S. Brown
6085 Prior Based Human Completion Zibo Zhao, Wen Liu, Yanyu Xu, Xianing Chen, Weixin Luo, Lei Jin, Bohui Zhu, Tong Liu, Binqiang Zhao, Shenghua Gao
3618 Diverse Semantic Image Synthesis via Probability Distribution Modeling Zhentao Tan, Menglei Chai, Dongdong Chen, Jing Liao, Qi Chu, Bin Liu, Gang Hua, Nenghai Yu
8297 Adaptive Convolutions for Structure-Aware Style Transfer Prashanth Chandran, Gaspard Zoss, Paulo Gotardo, Markus Gross, Derek Bradley
6677 PISE: Person Image Synthesis and Editing With Decoupled GAN Jinsong Zhang, Kun Li, Yu-Kun Lai, Jingyu Yang
4642 Semi-Supervised Synthesis of High-Resolution Editable Textures for 3D Humans Bindita Chaudhuri, Nikolaos Sarafianos, Linda Shapiro, Tony Tung
3444 CDFI: Compression-Driven Network Design for Frame Interpolation Tianyu Ding, Luming Liang, Zhihui Zhu, Ilya Zharkov
7537 Few-Shot Classification With Feature Map Reconstruction Networks Davis Wertheimer, Luming Tang, Bharath Hariharan
5931 Augmentation Strategies for Learning With Noisy Labels Kento Nishi, Yi Ding, Alex Rich, Tobias Höllerer
1454 Activate or Not: Learning Customized Activation Ningning Ma, Xiangyu Zhang, Ming Liu, Jian Sun
10160 Background Splitting: Finding Rare Classes in a Sea of Background Ravi Teja Mullapudi, Fait Poms, William R. Mark, Deva Ramanan, Kayvon Fatahalian
2915 CLCC: Contrastive Learning for Color Constancy Yi-Chen Lo, Chia-Che Chang, Hsuan-Chao Chiu, Yu-Hao Huang, Chia-Ping Chen, Yu-Lin Chang, Kevin Jou
3115 Dynamic Region-Aware Convolution Jin Chen, Xijun Wang, Zichao Guo, Xiangyu Zhang, Jian Sun
5480 Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking Yiding Yang, Zhou Ren, Haoxiang Li, Chunluan Zhou, Xinchao Wang, Gang Hua
10624 Searching for Fast Model Families on Datacenter Accelerators Sheng Li, Mingxing Tan, Ruoming Pang, Andrew Li, Liqun Cheng, Quoc V. Le, Norman P. Jouppi
6563 Discrete-Continuous Action Space Policy Gradient-Based Attention for Image-Text Matching Shiyang Yan, Li Yu, Yuan Xie
8266 Quantifying Explainers of Graph Neural Networks in Computational Pathology Guillaume Jaume, Pushpak Pati, Behzad Bozorgtabar, Antonio Foncubierta, Anna Maria Anniciello, Florinda Feroce, Tilman Rau, Jean-Philippe Thiran, Maria Gabrani, Orcun Goksel
2486 Forecasting Irreversible Disease via Progression Learning Botong Wu, Sijie Ren, Jing Li, Xinwei Sun, Shi-Ming Li, Yizhou Wang
6901 Transformer Tracking Xin Chen, Bin Yan, Jiawen Zhu, Dong Wang, Xiaoyun Yang, Huchuan Lu
6124 Online Multiple Object Tracking With Cross-Task Synergy Song Guo, Jingya Wang, Xinchao Wang, Dacheng Tao
6219 Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification Jiaxing Chen, Xinyang Jiang, Fudong Wang, Jun Zhang, Feng Zheng, Xing Sun, Wei-Shi Zheng
2479 Regularizing Neural Networks via Adversarial Model Perturbation Yaowei Zheng, Richong Zhang, Yongyi Mao
2835 Task-Aware Variational Adversarial Active Learning Kwanyoung Kim, Dongwon Park, Kwang In Kim, Se Young Chun
8179 VDSM: Unsupervised Video Disentanglement With State-Space Modeling and Deep Mixtures of Experts Matthew J. Vowels, Necati Cihan Camgoz, Richard Bowden
2960 Multi-Target Domain Adaptation With Collaborative Consistency Learning Takashi Isobe, Xu Jia, Shuaijun Chen, Jianzhong He, Yongjie Shi, Jianzhuang Liu, Huchuan Lu, Shengjin Wang
6105 Learning To Relate Depth and Semantics for Unsupervised Domain Adaptation Suman Saha, Anton Obukhov, Danda Pani Paudel, Menelaos Kanakis, Yuhua Chen, Stamatios Georgoulis, Luc Van Gool
4346 Adversarially Adaptive Normalization for Single Domain Generalization Xinjie Fan, Qifei Wang, Junjie Ke, Feng Yang, Boqing Gong, Mingyuan Zhou
4189 Rainbow Memory: Continual Learning With a Memory of Diverse Samples Jihwan Bang, Heesu Kim, YoungJoon Yoo, Jung-Woo Ha, Jonghyun Choi
3378 Asymmetric Metric Learning for Knowledge Transfer Mateusz Budnik, Yannis Avrithis
10164 Scalability vs. Utility: Do We Have To Sacrifice One for the Other in Data Importance Quantification? Ruoxi Jia, Fan Wu, Xuehui Sun, Jiacen Xu, David Dao, Bhavya Kailkhura, Ce Zhang, Bo Li, Dawn Song
8027 Self-Supervised Learning on 3D Point Clouds by Learning Discrete Generative Models Benjamin Eckart, Wentao Yuan, Chao Liu, Jan Kautz
843 Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks Xiaoxiao Long, Lingjie Liu, Wei Li, Christian Theobalt, Wenping Wang
9958 Beyond Image to Depth: Improving Depth Prediction Using Echoes Kranti Kumar Parida, Siddharth Srivastava, Gaurav Sharma
3042 Deeply Shape-Guided Cascade for Instance Segmentation Hao Ding, Siyuan Qiao, Alan Yuille, Wei Shen
1130 Linguistic Structures As Weak Supervision for Visual Scene Graph Generation Keren Ye, Adriana Kovashka
1210 Semantic Segmentation With Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization Daiqing Li, Junlin Yang, Karsten Kreis, Antonio Torralba, Sanja Fidler
3575 Self-Guided and Cross-Guided Learning for Few-Shot Segmentation Bingfeng Zhang, Jimin Xiao, Terry Qin
2934 Scene Essence Jiayan Qiu, Yiding Yang, Xinchao Wang, Dacheng Tao
578 Adaptive Prototype Learning and Allocation for Few-Shot Segmentation Gen Li, Varun Jampani, Laura Sevilla-Lara, Deqing Sun, Jonghyun Kim, Joongkyu Kim
305 Cluster, Split, Fuse, and Update: Meta-Learning for Open Compound Domain Adaptive Semantic Segmentation Rui Gong, Yuhua Chen, Danda Pani Paudel, Yawei Li, Ajad Chhatkuli, Wen Li, Dengxin Dai, Luc Van Gool
3383 Unsupervised Part Segmentation Through Disentangling Appearance and Shape Shilong Liu, Lei Zhang, Xiao Yang, Hang Su, Jun Zhu
3751 Temporal Action Segmentation From Timestamp Supervision Zhe Li, Yazan Abu Farha, Jürgen Gall
7251 RAFT-3D: Scene Flow Using Rigid-Motion Embeddings Zachary Teed, Jia Deng
4949 Coarse-Fine Networks for Temporal Activity Detection in Videos Kumara Kahatapitiya, Michael S. Ryoo
2082 Learning Discriminative Prototypes With Dynamic Time Warping Xiaobin Chang, Frederick Tung, Greg Mori
6302 Learning Dynamic Network Using a Reuse Gate Function in Semi-Supervised Video Object Segmentation Hyojin Park, Jayeon Yoo, Seohyeong Jeong, Ganesh Venkatesh, Nojun Kwak
274 Probabilistic Embeddings for Cross-Modal Retrieval Sanghyuk Chun, Seong Joon Oh, Rafael Sampaio de Rezende, Yannis Kalantidis, Diane Larlus
6167 Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning Shaoxiang Chen, Yu-Gang Jiang
11068 Positive Sample Propagation Along the Audio-Visual Event Line Jinxing Zhou, Liang Zheng, Yiran Zhong, Shijie Hao, Meng Wang
2404 Embracing Uncertainty: Decoupling and De-Bias for Robust Temporal Grounding Hao Zhou, Chongyang Zhang, Yan Luo, Yanjun Chen, Chuanping Hu
2791 Structured Scene Memory for Vision-Language Navigation Hanqing Wang, Wenguan Wang, Wei Liang, Caiming Xiong, Jianbing Shen
1720 Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules Aisha Urooj, Hilde Kuehne, Kevin Duarte, Chuang Gan, Niels Lobo, Mubarak Shah
8500 Hierarchical Layout-Aware Graph Convolutional Network for Unified Aesthetics Assessment Dongyu She, Yu-Kun Lai, Gaoxiong Yi, Kun Xu
1993 Parser-Free Virtual Try-On via Distilling Appearance Flows Yuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, Ping Luo
7368 Self-Supervised Simultaneous Multi-Step Prediction of Road Dynamics and Cost Map Elmira Amirloo, Mohsen Rohani, Ershad Banijamali, Jun Luo, Pascal Poupart
5096 StyleMeUp: Towards Style-Agnostic Sketch-Based Image Retrieval Aneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang, Tao Xiang, Yi-Zhe Song