MAIN CONFERENCE
All papers will be presented in the same manner. Each paper will have a five minute pre-recorded video and a PDF of the poster. An asynchronous text chat will be available for each paper. Attendees can view the papers and videos on demand at any time. Authors will also have individual Q&A sessions at the posted times below.
All posted times are EDT but the chart linked below has all time zones’ conversions. When the virtual site is up, you will be able to select which sessions you are interested in and it will populate your own schedule.
Presentation Schedule
-
All times are Eastern Daylight Time
Date: Wednesday, June 23, 2021 6:00– 8:30
Paper Session Six:
Paper ID | Paper Title | Authors |
5142 | Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition | Shancheng Fang, Hongtao Xie, Yuxin Wang, Zhendong Mao, Yongdong Zhang |
2270 | MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization | Jiahui Huang, He Wang, Tolga Birdal, Minhyuk Sung, Federica Arrigoni, Shi-Min Hu, Leonidas J. Guibas |
5041 | NeuTex: Neural Texture Mapping for Volumetric Neural Rendering | Fanbo Xiang, Zexiang Xu, Miloš Hašan, Yannick Hold-Geoffroy, Kalyan Sunkavalli, Hao Su |
414 | UnsupervisedR&R: Unsupervised Point Cloud Registration via Differentiable Rendering | Mohamed El Banani, Luya Gao, Justin Johnson |
6098 | RangeIoUDet: Range Image Based Real-Time 3D Object Detector Optimized by Intersection Over Union | Zhidong Liang, Zehan Zhang, Ming Zhang, Xian Zhao, Shiliang Pu |
10357 | Architectural Adversarial Robustness: The Case for Deep Pursuit | George Cazenavette, Calvin Murdock, Simon Lucey |
457 | SimPoE: Simulated Character Control for 3D Human Pose Estimation | Ye Yuan, Shih-En Wei, Tomas Simon, Kris Kitani, Jason Saragih |
5450 | CodedStereo: Learned Phase Masks for Large Depth-of-Field Stereo | Shiyu Tan, Yicheng Wu, Shoou-I Yu, Ashok Veeraraghavan |
7815 | PSD: Principled Synthetic-to-Real Dehazing Guided by Physical Priors | Zeyuan Chen, Yangchao Wang, Yang Yang, Dong Liu |
10082 | OpenRooms: An Open Framework for Photorealistic Indoor Scene Datasets | Zhengqin Li, Ting-Wei Yu, Shen Sang, Sarah Wang, Meng Song, Yuhan Liu, Yu-Ying Yeh, Rui Zhu, Nitesh Gundavarapu, Jia Shi, Sai Bi, Hong-Xing Yu, Zexiang Xu, Kalyan Sunkavalli, Miloš Hašan, Ravi Ramamoorthi, Manmohan Chandraker |
7067 | A Closer Look at Fourier Spectrum Discrepancies for CNN-Generated Images Detection | Keshigeyan Chandrasegaran, Ngoc-Trung Tran, Ngai-Man Cheung |
3702 | NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections | Ricardo Martin-Brualla, Noha Radwan, Mehdi S. M. Sajjadi, Jonathan T. Barron, Alexey Dosovitskiy, Daniel Duckworth |
7596 | ID-Unet: Iterative Soft and Hard Deformation for View Synthesis | Mingyu Yin, Li Sun, Qingli Li |
4877 | GeoSim: Realistic Video Simulation via Geometry-Aware Composition for Self-Driving | Yun Chen, Frieda Rong, Shivam Duggal, Shenlong Wang, Xinchen Yan, Sivabalan Manivasagam, Shangjie Xue, Ersin Yumer, Raquel Urtasun |
4070 | All Labels Are Not Created Equal: Enhancing Semi-Supervision via Label Grouping and Co-Training | Islam Nassar, Samitha Herath, Ehsan Abbasnejad, Wray Buntine, Gholamreza Haffari |
11399 | Orthogonal Over-Parameterized Training | Weiyang Liu, Rongmei Lin, Zhen Liu, James M. Rehg, Liam Paull, Li Xiong, Le Song, Adrian Weller |
11659 | DeepTag: An Unsupervised Deep Learning Method for Motion Tracking on Cardiac Tagging Magnetic Resonance Images | Meng Ye, Mikael Kanski, Dong Yang, Qi Chang, Zhennan Yan, Qiaoying Huang, Leon Axel, Dimitris Metaxas |
2819 | Transferable Query Selection for Active Domain Adaptation | Bo Fu, Zhangjie Cao, Jianmin Wang, Mingsheng Long |
5388 | When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework | Zhizhong Huang, Junping Zhang, Hongming Shan |
3899 | Simpler Certified Radius Maximization by Propagating Covariances | Xingjian Zhen, Rudrasis Chakraborty, Vikas Singh |
3741 | Improving Panoptic Segmentation at All Scales | Lorenzo Porzi, Samuel Rota Bulò, Peter Kontschieder |
4411 | Learning Triadic Belief Dynamics in Nonverbal Communication From Videos | Lifeng Fan, Shuwen Qiu, Zilong Zheng, Tao Gao, Song-Chun Zhu, Yixin Zhu |
4551 | Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps | Yuk Heo, Yeong Jun Koh, Chang-Su Kim |
4263 | Less Is More: ClipBERT for Video-and-Language Learning via Sparse Sampling | Jie Lei, Linjie Li, Luowei Zhou, Zhe Gan, Tamara L. Berg, Mohit Bansal, Jingjing Liu |
5359 | Im2Vec: Synthesizing Vector Graphics Without Vector Supervision | Pradyumna Reddy, Michaël Gharbi, Michal Lukáč, Niloy J. Mitra |
4163 | FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding | Bo Sun, Banghuai Li, Shengcai Cai, Ye Yuan, Chi Zhang |
4621 | Beyond Max-Margin: Class Margin Equilibrium for Few-Shot Object Detection | Bohao Li, Boyu Yang, Chang Liu, Feng Liu, Rongrong Ji, Qixiang Ye |
10441 | Dynamic Head: Unifying Object Detection Heads With Attentions | Xiyang Dai, Yinpeng Chen, Bin Xiao, Dongdong Chen, Mengchen Liu, Lu Yuan, Lei Zhang |
7270 | Dictionary-Guided Scene Text Recognition | Nguyen Nguyen, Thu Nguyen, Vinh Tran, Minh-Triet Tran, Thanh Duc Ngo, Thien Huu Nguyen, Minh Hoai |
6857 | Progressive Contour Regression for Arbitrary-Shape Scene Text Detection | Pengwen Dai, Sanyi Zhang, Hua Zhang, Xiaochun Cao |
1920 | Strengthen Learning Tolerance for Weakly Supervised Object Localization | Guangyu Guo, Junwei Han, Fang Wan, Dingwen Zhang |
2132 | StruMonoNet: Structure-Aware Monocular 3D Prediction | Zhenpei Yang, Li Erran Li, Qixing Huang |
5382 | Fully Understanding Generic Objects: Modeling, Segmentation, and Reconstruction | Feng Liu, Luan Tran, Xiaoming Liu |
7225 | Exploiting & Refining Depth Distributions With Triangulation Light Curtains | Yaadhav Raaj, Siddharth Ancha, Robert Tamburo, David Held, Srinivasa G. Narasimhan |
2662 | PMP-Net: Point Cloud Completion by Learning Multi-Step Point Moving Paths | Xin Wen, Peng Xiang, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu |
4464 | TearingNet: Point Cloud Autoencoder To Learn Topology-Friendly Representations | Jiahao Pang, Duanshun Li, Dong Tian |
10827 | 3D Object Detection With Pointformer | Xuran Pan, Zhuofan Xia, Shiji Song, Li Erran Li, Gao Huang |
552 | NeuroMorph: Unsupervised Shape Interpolation and Correspondence in One Go | Marvin Eisenberger, David Novotny, Gael Kerchenbaum, Patrick Labatut, Natalia Neverova, Daniel Cremers, Andrea Vedaldi |
5459 | Towards Part-Based Understanding of RGB-D Scans | Alexey Bokhovkin, Vladislav Ishimtsev, Emil Bogomolov, Denis Zorin, Alexey Artemov, Evgeny Burnaev, Angela Dai |
2454 | NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis | Pratul P. Srinivasan, Boyang Deng, Xiuming Zhang, Matthew Tancik, Ben Mildenhall, Jonathan T. Barron |
2709 | Probabilistic Model Distillation for Semantic Correspondence | Xin Li, Deng-Ping Fan, Fan Yang, Ao Luo, Hong Cheng, Zicheng Liu |
1180 | SceneGraphFusion: Incremental 3D Scene Graph Prediction From RGB-D Sequences | Shun-Cheng Wu, Johanna Wald, Keisuke Tateno, Nassir Navab, Federico Tombari |
5815 | Self-Supervised Learning of Depth Inference for Multi-View Stereo | Jiayu Yang, Jose M. Alvarez, Miaomiao Liu |
2995 | Mesoscopic Photogrammetry With an Unstabilized Phone Camera | Kevin C. Zhou, Colin Cooke, Jaehee Park, Ruobing Qian, Roarke Horstmeyer, Joseph A. Izatt, Sina Farsiu |
1579 | LiDAR R-CNN: An Efficient and Universal 3D Object Detector | Zhichao Li, Feng Wang, Naiyan Wang |
4987 | Monocular 3D Object Detection: An Extrinsic Parameter Free Approach | Yunsong Zhou, Yuan He, Hongzi Zhu, Cheng Wang, Hongyang Li, Qinhong Jiang |
2066 | Beyond Short Clips: End-to-End Video-Level Learning With Collaborative Memories | Xitong Yang, Haoqi Fan, Lorenzo Torresani, Larry S. Davis, Heng Wang |
2514 | Multimodal Motion Prediction With Stacked Transformers | Yicheng Liu, Jinghuai Zhang, Liangji Fang, Qinhong Jiang, Bolei Zhou |
3822 | Weakly Supervised Action Selection Learning in Video | Junwei Ma, Satya Krishna Gorti, Maksims Volkovs, Guangwei Yu |
2236 | BASAR:Black-Box Attack on Skeletal Action Recognition | Yunfeng Diao, Tianjia Shao, Yong-Liang Yang, Kun Zhou, He Wang |
10059 | Adversarial Robustness Across Representation Spaces | Pranjal Awasthi, George Yu, Chun-Sung Ferng, Andrew Tomkins, Da-Cheng Juan |
7277 | img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation | Vítor Albiero, Xingyu Chen, Xi Yin, Guan Pang, Tal Hassner |
7375 | OSTeC: One-Shot Texture Completion | Baris Gecer, Jiankang Deng, Stefanos Zafeiriou |
2691 | Locally Aware Piecewise Transformation Fields for 3D Human Mesh Registration | Shaofei Wang, Andreas Geiger, Siyu Tang |
2831 | Monocular 3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks | Yu Cheng, Bo Wang, Bo Yang, Robby T. Tan |
3651 | Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition | Delian Ruan, Yan Yan, Shenqi Lai, Zhenhua Chai, Chunhua Shen, Hanzi Wang |
227 | SDD-FIQA: Unsupervised Face Image Quality Assessment With Similarity Distribution Distance | Fu-Zhao Ou, Xingyu Chen, Ruixin Zhang, Yuge Huang, Shaoxin Li, Jilin Li, Yong Li, Liujuan Cao, Yuan-Gen Wang |
7423 | Facial Action Unit Detection With Transformers | Geethu Miriam Jacob, Björn Stenger |
2426 | Anchor-Free Person Search | Yichao Yan, Jinpeng Li, Jie Qin, Song Bai, Shengcai Liao, Li Liu, Fan Zhu, Ling Shao |
4173 | Neural Camera Simulators | Hao Ouyang, Zifan Shi, Chenyang Lei, Ka Lung Law, Qifeng Chen |
6987 | Neural Auto-Exposure for High-Dynamic Range Object Detection | Emmanuel Onzon, Fahim Mannan, Felix Heide |
3264 | ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring | Dongxu Li, Chenchen Xu, Kaihao Zhang, Xin Yu, Yiran Zhong, Wenqi Ren, Hanna Suominen, Hongdong Li |
288 | Memory Oriented Transfer Learning for Semi-Supervised Image Deraining | Huaibo Huang, Aijing Yu, Ran He |
11692 | Robust Representation Learning With Feedback for Single Image Deraining | Chenghao Chen, Hao Li |
4408 | A Multi-Task Network for Joint Specular Highlight Detection and Removal | Gang Fu, Qing Zhang, Lei Zhu, Ping Li, Chunxia Xiao |
4827 | Panoramic Image Reflection Removal | Yuchen Hong, Qian Zheng, Lingran Zhao, Xudong Jiang, Alex C. Kot, Boxin Shi |
3051 | Turning Frequency to Resolution: Video Super-Resolution via Event Cameras | Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, Dacheng Tao |
951 | SRWarp: Generalized Image Super-Resolution under Arbitrary Transformation | Sanghyun Son, Kyoung Mu Lee |
6495 | Learning Scene Structure Guidance via Cross-Task Knowledge Transfer for Single Depth Super-Resolution | Baoli Sun, Xinchen Ye, Baopu Li, Haojie Li, Zhihui Wang, Rui Xu |
7173 | Gated Spatio-Temporal Attention-Guided Video Deblurring | Maitreya Suin, A. N. Rajagopalan |
3908 | Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark | Longyin Wen, Dawei Du, Pengfei Zhu, Qinghua Hu, Qilong Wang, Liefeng Bo, Siwei Lyu |
10684 | Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild With Pose Annotations | Adel Ahmadyan, Liangkai Zhang, Artsiom Ablavatski, Jianing Wei, Matthias Grundmann |
11470 | Dynamic Domain Adaptation for Efficient Inference | Shuang Li, JinMing Zhang, Wenxuan Ma, Chi Harold Liu, Wei Li |
3225 | General Instance Distillation for Object Detection | Xing Dai, Zeren Jiang, Zhao Wu, Yiping Bao, Zhicheng Wang, Si Liu, Erjin Zhou |
3109 | Data-Free Knowledge Distillation for Image Super-Resolution | Yiman Zhang, Hanting Chen, Xinghao Chen, Yiping Deng, Chunjing Xu, Yunhe Wang |
8145 | Improving Accuracy of Binary Neural Networks Using Unbalanced Activation Distribution | Hyungjun Kim, Jihoon Park, Changhun Lee, Jae-Joon Kim |
7214 | Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs | Hui-Po Wang, Ning Yu, Mario Fritz |
8263 | Cross Modal Focal Loss for RGBD Face Anti-Spoofing | Anjith George, Sébastien Marcel |
6288 | On the Difficulty of Membership Inference Attacks | Shahbaz Rezaei, Xin Liu |
6705 | Lifelong Person Re-Identification via Adaptive Knowledge Accumulation | Nan Pu, Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew |
1970 | Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes | Julian Chibane, Aayush Bansal, Verica Lazova, Gerard Pons-Moll |
514 | Regularizing Generative Adversarial Networks Under Limited Data | Hung-Yu Tseng, Lu Jiang, Ce Liu, Ming-Hsuan Yang, Weilong Yang |
7418 | Automatic Correction of Internal Units in Generative Neural Networks | Ali Tousi, Haedong Jeong, Jiyeon Han, Hwanil Choi, Jaesik Choi |
1523 | HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms | Mahmoud Afifi, Marcus A. Brubaker, Michael S. Brown |
6085 | Prior Based Human Completion | Zibo Zhao, Wen Liu, Yanyu Xu, Xianing Chen, Weixin Luo, Lei Jin, Bohui Zhu, Tong Liu, Binqiang Zhao, Shenghua Gao |
3618 | Diverse Semantic Image Synthesis via Probability Distribution Modeling | Zhentao Tan, Menglei Chai, Dongdong Chen, Jing Liao, Qi Chu, Bin Liu, Gang Hua, Nenghai Yu |
8297 | Adaptive Convolutions for Structure-Aware Style Transfer | Prashanth Chandran, Gaspard Zoss, Paulo Gotardo, Markus Gross, Derek Bradley |
6677 | PISE: Person Image Synthesis and Editing With Decoupled GAN | Jinsong Zhang, Kun Li, Yu-Kun Lai, Jingyu Yang |
4642 | Semi-Supervised Synthesis of High-Resolution Editable Textures for 3D Humans | Bindita Chaudhuri, Nikolaos Sarafianos, Linda Shapiro, Tony Tung |
3444 | CDFI: Compression-Driven Network Design for Frame Interpolation | Tianyu Ding, Luming Liang, Zhihui Zhu, Ilya Zharkov |
7537 | Few-Shot Classification With Feature Map Reconstruction Networks | Davis Wertheimer, Luming Tang, Bharath Hariharan |
5931 | Augmentation Strategies for Learning With Noisy Labels | Kento Nishi, Yi Ding, Alex Rich, Tobias Höllerer |
1454 | Activate or Not: Learning Customized Activation | Ningning Ma, Xiangyu Zhang, Ming Liu, Jian Sun |
10160 | Background Splitting: Finding Rare Classes in a Sea of Background | Ravi Teja Mullapudi, Fait Poms, William R. Mark, Deva Ramanan, Kayvon Fatahalian |
2915 | CLCC: Contrastive Learning for Color Constancy | Yi-Chen Lo, Chia-Che Chang, Hsuan-Chao Chiu, Yu-Hao Huang, Chia-Ping Chen, Yu-Lin Chang, Kevin Jou |
3115 | Dynamic Region-Aware Convolution | Jin Chen, Xijun Wang, Zichao Guo, Xiangyu Zhang, Jian Sun |
5480 | Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking | Yiding Yang, Zhou Ren, Haoxiang Li, Chunluan Zhou, Xinchao Wang, Gang Hua |
10624 | Searching for Fast Model Families on Datacenter Accelerators | Sheng Li, Mingxing Tan, Ruoming Pang, Andrew Li, Liqun Cheng, Quoc V. Le, Norman P. Jouppi |
6563 | Discrete-Continuous Action Space Policy Gradient-Based Attention for Image-Text Matching | Shiyang Yan, Li Yu, Yuan Xie |
8266 | Quantifying Explainers of Graph Neural Networks in Computational Pathology | Guillaume Jaume, Pushpak Pati, Behzad Bozorgtabar, Antonio Foncubierta, Anna Maria Anniciello, Florinda Feroce, Tilman Rau, Jean-Philippe Thiran, Maria Gabrani, Orcun Goksel |
2486 | Forecasting Irreversible Disease via Progression Learning | Botong Wu, Sijie Ren, Jing Li, Xinwei Sun, Shi-Ming Li, Yizhou Wang |
6901 | Transformer Tracking | Xin Chen, Bin Yan, Jiawen Zhu, Dong Wang, Xiaoyun Yang, Huchuan Lu |
6124 | Online Multiple Object Tracking With Cross-Task Synergy | Song Guo, Jingya Wang, Xinchao Wang, Dacheng Tao |
6219 | Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification | Jiaxing Chen, Xinyang Jiang, Fudong Wang, Jun Zhang, Feng Zheng, Xing Sun, Wei-Shi Zheng |
2479 | Regularizing Neural Networks via Adversarial Model Perturbation | Yaowei Zheng, Richong Zhang, Yongyi Mao |
2835 | Task-Aware Variational Adversarial Active Learning | Kwanyoung Kim, Dongwon Park, Kwang In Kim, Se Young Chun |
8179 | VDSM: Unsupervised Video Disentanglement With State-Space Modeling and Deep Mixtures of Experts | Matthew J. Vowels, Necati Cihan Camgoz, Richard Bowden |
2960 | Multi-Target Domain Adaptation With Collaborative Consistency Learning | Takashi Isobe, Xu Jia, Shuaijun Chen, Jianzhong He, Yongjie Shi, Jianzhuang Liu, Huchuan Lu, Shengjin Wang |
6105 | Learning To Relate Depth and Semantics for Unsupervised Domain Adaptation | Suman Saha, Anton Obukhov, Danda Pani Paudel, Menelaos Kanakis, Yuhua Chen, Stamatios Georgoulis, Luc Van Gool |
4346 | Adversarially Adaptive Normalization for Single Domain Generalization | Xinjie Fan, Qifei Wang, Junjie Ke, Feng Yang, Boqing Gong, Mingyuan Zhou |
4189 | Rainbow Memory: Continual Learning With a Memory of Diverse Samples | Jihwan Bang, Heesu Kim, YoungJoon Yoo, Jung-Woo Ha, Jonghyun Choi |
3378 | Asymmetric Metric Learning for Knowledge Transfer | Mateusz Budnik, Yannis Avrithis |
10164 | Scalability vs. Utility: Do We Have To Sacrifice One for the Other in Data Importance Quantification? | Ruoxi Jia, Fan Wu, Xuehui Sun, Jiacen Xu, David Dao, Bhavya Kailkhura, Ce Zhang, Bo Li, Dawn Song |
8027 | Self-Supervised Learning on 3D Point Clouds by Learning Discrete Generative Models | Benjamin Eckart, Wentao Yuan, Chao Liu, Jan Kautz |
843 | Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks | Xiaoxiao Long, Lingjie Liu, Wei Li, Christian Theobalt, Wenping Wang |
9958 | Beyond Image to Depth: Improving Depth Prediction Using Echoes | Kranti Kumar Parida, Siddharth Srivastava, Gaurav Sharma |
3042 | Deeply Shape-Guided Cascade for Instance Segmentation | Hao Ding, Siyuan Qiao, Alan Yuille, Wei Shen |
1130 | Linguistic Structures As Weak Supervision for Visual Scene Graph Generation | Keren Ye, Adriana Kovashka |
1210 | Semantic Segmentation With Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization | Daiqing Li, Junlin Yang, Karsten Kreis, Antonio Torralba, Sanja Fidler |
3575 | Self-Guided and Cross-Guided Learning for Few-Shot Segmentation | Bingfeng Zhang, Jimin Xiao, Terry Qin |
2934 | Scene Essence | Jiayan Qiu, Yiding Yang, Xinchao Wang, Dacheng Tao |
578 | Adaptive Prototype Learning and Allocation for Few-Shot Segmentation | Gen Li, Varun Jampani, Laura Sevilla-Lara, Deqing Sun, Jonghyun Kim, Joongkyu Kim |
305 | Cluster, Split, Fuse, and Update: Meta-Learning for Open Compound Domain Adaptive Semantic Segmentation | Rui Gong, Yuhua Chen, Danda Pani Paudel, Yawei Li, Ajad Chhatkuli, Wen Li, Dengxin Dai, Luc Van Gool |
3383 | Unsupervised Part Segmentation Through Disentangling Appearance and Shape | Shilong Liu, Lei Zhang, Xiao Yang, Hang Su, Jun Zhu |
3751 | Temporal Action Segmentation From Timestamp Supervision | Zhe Li, Yazan Abu Farha, Jürgen Gall |
7251 | RAFT-3D: Scene Flow Using Rigid-Motion Embeddings | Zachary Teed, Jia Deng |
4949 | Coarse-Fine Networks for Temporal Activity Detection in Videos | Kumara Kahatapitiya, Michael S. Ryoo |
2082 | Learning Discriminative Prototypes With Dynamic Time Warping | Xiaobin Chang, Frederick Tung, Greg Mori |
6302 | Learning Dynamic Network Using a Reuse Gate Function in Semi-Supervised Video Object Segmentation | Hyojin Park, Jayeon Yoo, Seohyeong Jeong, Ganesh Venkatesh, Nojun Kwak |
274 | Probabilistic Embeddings for Cross-Modal Retrieval | Sanghyuk Chun, Seong Joon Oh, Rafael Sampaio de Rezende, Yannis Kalantidis, Diane Larlus |
6167 | Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning | Shaoxiang Chen, Yu-Gang Jiang |
11068 | Positive Sample Propagation Along the Audio-Visual Event Line | Jinxing Zhou, Liang Zheng, Yiran Zhong, Shijie Hao, Meng Wang |
2404 | Embracing Uncertainty: Decoupling and De-Bias for Robust Temporal Grounding | Hao Zhou, Chongyang Zhang, Yan Luo, Yanjun Chen, Chuanping Hu |
2791 | Structured Scene Memory for Vision-Language Navigation | Hanqing Wang, Wenguan Wang, Wei Liang, Caiming Xiong, Jianbing Shen |
1720 | Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules | Aisha Urooj, Hilde Kuehne, Kevin Duarte, Chuang Gan, Niels Lobo, Mubarak Shah |
8500 | Hierarchical Layout-Aware Graph Convolutional Network for Unified Aesthetics Assessment | Dongyu She, Yu-Kun Lai, Gaoxiong Yi, Kun Xu |
1993 | Parser-Free Virtual Try-On via Distilling Appearance Flows | Yuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, Ping Luo |
7368 | Self-Supervised Simultaneous Multi-Step Prediction of Road Dynamics and Cost Map | Elmira Amirloo, Mohsen Rohani, Ershad Banijamali, Jun Luo, Pascal Poupart |
5096 | StyleMeUp: Towards Style-Agnostic Sketch-Based Image Retrieval | Aneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang, Tao Xiang, Yi-Zhe Song |