MAIN CONFERENCE
All papers will be presented in the same manner. Each paper will have a five minute pre-recorded video and a PDF of the poster. An asynchronous text chat will be available for each paper. Attendees can view the papers and videos on demand at any time. Authors will also have individual Q&A sessions at the posted times below.
All posted times are EDT but the chart linked below has all time zones’ conversions. When the virtual site is up, you will be able to select which sessions you are interested in and it will populate your own schedule.
Presentation Schedule
-
All times are Eastern Daylight Time
Date: Friday, June 25, 2021 6:00– 8:30
Paper Session Twelve:
Paper ID | Paper Title | Authors |
4741 | Deep Polarization Imaging for 3D Shape and SVBRDF Acquisition | Valentin Deschaintre, Yiming Lin, Abhijeet Ghosh |
7791 | Self-Point-Flow: Self-Supervised Scene Flow Estimation From Point Clouds With Optimal Transport and Random Walk | Ruibo Li, Guosheng Lin, Lihua Xie |
1195 | Exploring Data-Efficient 3D Scene Understanding With Contrastive Scene Contexts | Ji Hou, Benjamin Graham, Matthias Nießner, Saining Xie |
3433 | NeuralRecon: Real-Time Coherent 3D Reconstruction From Monocular Video | Jiaming Sun, Yiming Xie, Linghao Chen, Xiaowei Zhou, Hujun Bao |
9966 | FrameExit: Conditional Early Exiting for Efficient Video Recognition | Amir Ghodrati, Babak Ehteshami Bejnordi, Amirhossein Habibian |
5180 | Inverting Generative Adversarial Renderer for Face Reconstruction | Jingtan Piao, Keqiang Sun, Quan Wang, Kwan-Yee Lin, Hongsheng Li |
4453 | Spherical Confidence Learning for Face Recognition | Shen Li, Jianqing Xu, Xiaqing Xu, Pengcheng Shen, Shaoxin Li, Bryan Hooi |
8049 | Event-Based Bispectral Photometry Using Temporally Modulated Illumination | Tsuyoshi Takatani, Yuzuha Ito, Ayaka Ebisu, Yinqiang Zheng, Takahito Aoto |
1565 | AdderSR: Towards Energy Efficient Image Super-Resolution | Dehua Song, Yunhe Wang, Hanting Chen, Chang Xu, Chunjing Xu, Dacheng Tao |
1442 | Diversifying Sample Generation for Accurate Data-Free Quantization | Xiangguo Zhang, Haotong Qin, Yifu Ding, Ruihao Gong, Qinghua Yan, Renshuai Tao, Yuhang Li, Fengwei Yu, Xianglong Liu |
2057 | How Privacy-Preserving Are Line Clouds? Recovering Scene Details From 3D Lines | Kunal Chelani, Fredrik Kahl, Torsten Sattler |
10978 | Unsupervised Learning of Depth and Depth-of-Field Effect From Natural Images With Aperture Rendering Generative Adversarial Networks | Takuhiro Kaneko |
7633 | Stylized Neural Painting | Zhengxia Zou, Tianyang Shi, Shuang Qiu, Yi Yuan, Zhenwei Shi |
4812 | Permute, Quantize, and Fine-Tune: Efficient Compression of Neural Networks | Julieta Martinez, Jashan Shewakramani, Ting Wei Liu, Ioan Andrei Bârsan, Wenyuan Zeng, Raquel Urtasun |
3053 | Amalgamating Knowledge From Heterogeneous Graph Neural Networks | Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, Dacheng Tao |
4015 | XProtoNet: Diagnosis in Chest Radiography With Global and Local Explanations | Eunji Kim, Siwon Kim, Minji Seo, Sungroh Yoon |
2782 | LQF: Linear Quadratic Fine-Tuning | Alessandro Achille, Aditya Golatkar, Avinash Ravichandran, Marzia Polito, Stefano Soatto |
1934 | DECOR-GAN: 3D Shape Detailization by Conditional Refinement | Zhiqin Chen, Vladimir G. Kim, Matthew Fisher, Noam Aigerman, Hao Zhang, Siddhartha Chaudhuri |
4418 | Exploring Simple Siamese Representation Learning | Xinlei Chen, Kaiming He |
2219 | Camera Pose Matters: Improving Depth Prediction by Mitigating Pose Distribution Bias | Yunhan Zhao, Shu Kong, Charless Fowlkes |
2080 | DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation | Xinyi Wu, Zhenyao Wu, Hao Guo, Lili Ju, Song Wang |
4113 | OTCE: A Transferability Metric for Cross-Domain Cross-Task Representations | Yang Tan, Yang Li, Shao-Lun Huang |
524 | Learning the Best Pooling Strategy for Visual Semantic Embedding | Jiacheng Chen, Hexiang Hu, Hao Wu, Yuning Jiang, Changhu Wang |
10362 | Divide-and-Conquer for Lane-Aware Diverse Trajectory Prediction | Sriram Narayanan, Ramin Moslemi, Francesco Pittaluga, Buyu Liu, Manmohan Chandraker |
4722 | Magic Layouts: Structural Prior for Component Detection in User Interface Designs | Dipu Manandhar, Hailin Jin, John Collomosse |
368 | Dense Label Encoding for Boundary Discontinuity Free Rotation Detection | Xue Yang, Liping Hou, Yue Zhou, Wentao Wang, Junchi Yan |
1273 | MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition | Ayan Kumar Bhunia, Shuvozit Ghose, Amandeep Kumar, Pinaki Nath Chowdhury, Aneeshan Sain, Yi-Zhe Song |
6128 | PSRR-MaxpoolNMS: Pyramid Shifted MaxpoolNMS With Relationship Recovery | Tianyi Zhang, Jie Lin, Peng Hu, Bin Zhao, Mohamed M. Sabry Aly |
5013 | End-to-End Object Detection With Fully Convolutional Network | Jianfeng Wang, Lin Song, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng |
5688 | PointDSC: Robust Point Cloud Registration Using Deep Spatial Consistency | Xuyang Bai, Zixin Luo, Lei Zhou, Hongkai Chen, Lei Li, Zeyu Hu, Hongbo Fu, Chiew-Lan Tai |
3328 | Wide-Depth-Range 6D Object Pose Estimation in Space | Yinlin Hu, Sébastien Speierer, Wenzel Jakob, Pascal Fua, Mathieu Salzmann |
8649 | Fostering Generalization in Single-View 3D Reconstruction by Learning a Hierarchy of Local and Global Shape Priors | Jan Bechtold, Maxim Tatarchenko, Volker Fischer, Thomas Brox |
1024 | View-Guided Point Cloud Completion | Xuancheng Zhang, Yutong Feng, Siqi Li, Changqing Zou, Hai Wan, Xibin Zhao, Yandong Guo, Yue Gao |
3200 | Regularization Strategy for Point Cloud via Rigidly Mixed Sample | Dogyoon Lee, Jaeha Lee, Junhyeop Lee, Hyeongmin Lee, Minhyeok Lee, Sungmin Woo, Sangyoun Lee |
6624 | PWCLO-Net: Deep LiDAR Odometry in 3D Point Clouds Using Hierarchical Embedding Mask Optimization | Guangming Wang, Xinrui Wu, Zhe Liu, Hesheng Wang |
5368 | Co-Attention for Conditioned Image Matching | Olivia Wiles, Sébastien Ehrhardt, Andrew Zisserman |
6480 | A Dual Iterative Refinement Method for Non-Rigid Shape Matching | Rui Xiang, Rongjie Lai, Hongkai Zhao |
3192 | NeRD: Neural 3D Reflection Symmetry Detector | Yichao Zhou, Shichen Liu, Yi Ma |
2612 | Deep Lucas-Kanade Homography for Multimodal Image Alignment | Yiming Zhao, Xinming Huang, Ziming Zhang |
1822 | DeepI2P: Image-to-Point Cloud Registration via Deep Classification | Jiaxin Li, Gim Hee Lee |
1997 | NormalFusion: Real-Time Acquisition of Surface Normals for High-Resolution RGB-D Scanning | Hyunho Ha, Joo Ho Lee, Andreas Meuleman, Min H. Kim |
5496 | LASR: Learning Articulated Shape Reconstruction From a Monocular Video | Gengshan Yang, Deqing Sun, Varun Jampani, Daniel Vlasic, Forrester Cole, Huiwen Chang, Deva Ramanan, William T. Freeman, Ce Liu |
10153 | Mirror3D: Depth Refinement for Mirror Surfaces | Jiaqi Tan, Weijie Lin, Angel X. Chang, Manolis Savva |
10259 | To the Point: Efficient 3D Object Detection in the Range Image With Graph Convolution Kernels | Yuning Chai, Pei Sun, Jiquan Ngiam, Weiyue Wang, Benjamin Caine, Vijay Vasudevan, Xiao Zhang, Dragomir Anguelov |
686 | CoLA: Weakly-Supervised Temporal Action Localization With Snippet Contrastive Learning | Can Zhang, Meng Cao, Dongming Yang, Jie Chen, Yuexian Zou |
5629 | MoViNets: Mobile Video Networks for Efficient Video Recognition | Dan Kondratyuk, Liangzhe Yuan, Yandong Li, Li Zhang, Mingxing Tan, Matthew Brown, Boqing Gong |
534 | Few-Shot Transformation of Common Actions Into Time and Space | Pengwan Yang, Pascal Mettes, Cees G. M. Snoek |
7883 | Learning Asynchronous and Sparse Human-Object Interaction in Videos | Romero Morais, Vuong Le, Svetha Venkatesh, Truyen Tran |
6251 | Adversarial Imaging Pipelines | Buu Phan, Fahim Mannan, Felix Heide |
6511 | Adversarial Laser Beam: Effective Physical-World Attack to DNNs in a Blink | Ranjie Duan, Xiaofeng Mao, A. K. Qin, Yuefeng Chen, Shaokai Ye, Yuan He, Yun Yang |
5887 | ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search | Lumin Xu, Yingda Guan, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang, Xiaogang Wang |
428 | SCALE: Modeling Clothed Humans with a Surface Codec of Articulated Local Elements | Qianli Ma, Shunsuke Saito, Jinlong Yang, Siyu Tang, Michael J. Black |
6228 | Probabilistic 3D Human Shape and Pose Estimation From Multiple Unconstrained Images in the Wild | Akash Sengupta, Ignas Budvytis, Roberto Cipolla |
8832 | Graph Stacked Hourglass Networks for 3D Human Pose Estimation | Tianhan Xu, Wataru Takano |
6885 | High-Fidelity and Arbitrary Face Editing | Yue Gao, Fangyun Wei, Jianmin Bao, Shuyang Gu, Dong Chen, Fang Wen, Zhouhui Lian |
4816 | IronMask: Modular Architecture for Protecting Deep Face Template | Sunpill Kim, Yunseong Jeong, Jinsu Kim, Jungkon Kim, Hyung Tae Lee, Jae Hong Seo |
1198 | Meta-Mining Discriminative Samples for Kinship Verification | Wanhua Li, Shiwei Wang, Jiwen Lu, Jianjiang Feng, Jie Zhou |
5182 | Generalizable Person Re-Identification With Relevance-Aware Mixture of Experts | Yongxing Dai, Xiaotong Li, Jun Liu, Zekun Tong, Ling-Yu Duan |
3301 | Time Lens: Event-Based Video Frame Interpolation | Stepan Tulyakov, Daniel Gehrig, Stamatios Georgoulis, Julius Erbach, Mathias Gehrig, Yuanyou Li, Davide Scaramuzza |
6839 | How To Exploit the Transferability of Learned Image Compression to Conventional Codecs | Jan P. Klopp, Keng-Chi Liu, Liang-Gee Chen, Shao-Yi Chien |
3377 | Pseudo 3D Auto-Correlation Network for Real Image Denoising | Xiaowan Hu, Ruijun Ma, Zhihong Liu, Yuanhao Cai, Xiaole Zhao, Yulun Zhang, Haoqian Wang |
2207 | Ultra-High-Definition Image Dehazing via Multi-Guided Bilateral Learning | Zhuoran Zheng, Wenqi Ren, Xiaochun Cao, Xiaobin Hu, Tao Wang, Fenglong Song, Xiuyi Jia |
4502 | HLA-Face: Joint High-Low Adaptation for Low Light Face Detection | Wenjing Wang, Wenhan Yang, Jiaying Liu |
3345 | Zero-Shot Single Image Restoration Through Controlled Perturbation of Koschmieder’s Model | Aupendu Kar, Sobhan Kanti Dhara, Debashis Sen, Prabir Kumar Biswas |
6576 | Deep Gaussian Scale Mixture Prior for Spectral Compressive Imaging | Tao Huang, Weisheng Dong, Xin Yuan, Jinjian Wu, Guangming Shi |
6893 | Multi-View 3D Reconstruction of a Texture-Less Smooth Surface of Unknown Generic Reflectance | Ziang Cheng, Hongdong Li, Yuta Asano, Yinqiang Zheng, Imari Sato |
2382 | Tackling the Ill-Posedness of Super-Resolution Through Adaptive Target Generation | Younghyun Jo, Seoung Wug Oh, Peter Vajda, Seon Joo Kim |
8381 | Memory-Efficient Network for Large-Scale Video Compressive Sensing | Ziheng Cheng, Bo Chen, Guanliang Liu, Hao Zhang, Ruiying Lu, Zhengjue Wang, Xin Yuan |
6387 | Troubleshooting Blind Image Quality Models in the Wild | Zhihua Wang, Haotao Wang, Tianlong Chen, Zhangyang Wang, Kede Ma |
6737 | UAV-Human: A Large Benchmark for Human Behavior Understanding With Unmanned Aerial Vehicles | Tianjiao Li, Jun Liu, Wei Zhang, Yun Ni, Wenqian Wang, Zhiheng Li |
4956 | FBNetV3: Joint Architecture-Recipe Search Using Predictor Pretraining | Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Bichen Wu, Zijian He, Zhen Wei, Kan Chen, Yuandong Tian, Matthew Yu, Peter Vajda, Joseph E. Gonzalez |
10777 | Domain-Independent Dominance of Adaptive Methods | Pedro Savarese, David McAllester, Sudarshan Babu, Michael Maire |
10552 | Wasserstein Contrastive Representation Distillation | Liqun Chen, Dong Wang, Zhe Gan, Jingjing Liu, Ricardo Henao, Lawrence Carin |
6316 | The Lottery Tickets Hypothesis for Supervised and Self-Supervised Pre-Training in Computer Vision Models | Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yang Zhang, Michael Carbin, Zhangyang Wang |
3034 | Generalizing Face Forgery Detection With High-Frequency Features | Yuchen Luo, Yong Zhang, Junchi Yan, Wei Liu |
5703 | CAMERAS: Enhanced Resolution and Sanity Preserving Class Activation Mapping for Image Saliency | Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian |
1591 | See Through Gradients: Image Batch Recovery via GradInversion | Hongxu Yin, Arun Mallya, Arash Vahdat, Jose M. Alvarez, Jan Kautz, Pavlo Molchanov |
2586 | Seeking the Shape of Sound: An Adaptive Framework for Learning Voice-Face Association | Peisong Wen, Qianqian Xu, Yangbangyan Jiang, Zhiyong Yang, Yuan He, Qingming Huang |
5248 | Prototype-Supervised Adversarial Network for Targeted Attack of Deep Hashing | Xunguang Wang, Zheng Zhang, Baoyuan Wu, Fumin Shen, Guangming Lu |
6256 | Intrinsic Image Harmonization | Zonghui Guo, Haiyong Zheng, Yufeng Jiang, Zhaorui Gu, Bing Zheng |
4968 | DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network | Rui Liu, Yixiao Ge, Ching Lam Choi, Xiaogang Wang, Hongsheng Li |
2074 | Combining Semantic Guidance and Deep Reinforcement Learning for Generating Human Level Paintings | Jaskirat Singh, Liang Zheng |
6376 | No Shadow Left Behind: Removing Objects and Their Shadows Using Approximate Lighting and Geometry | Edward Zhang, Ricardo Martin-Brualla, Janne Kontkanen, Brian L. Curless |
1129 | The Spatially-Correlative Loss for Various Image Translation Tasks | Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai |
6351 | Unpaired Image-to-Image Translation via Latent Energy Transport | Yang Zhao, Changyou Chen |
2859 | Model-Aware Gesture-to-Gesture Translation | Hezhen Hu, Weilun Wang, Wengang Zhou, Weichao Zhao, Houqiang Li |
3833 | Layout-Guided Novel View Synthesis From a Single Indoor Panorama | Jiale Xu, Jia Zheng, Yanyu Xu, Rui Tang, Shenghua Gao |
2122 | Progressive Temporal Feature Alignment Network for Video Inpainting | Xueyan Zou, Linjie Yang, Ding Liu, Yong Jae Lee |
8234 | On Robustness and Transferability of Convolutional Neural Networks | Josip Djolonga, Jessica Yung, Michael Tschannen, Rob Romijnders, Lucas Beyer, Alexander Kolesnikov, Joan Puigcerver, Matthias Minderer, Alexander D'Amour, Dan Moldovan, Sylvain Gelly, Neil Houlsby, Xiaohua Zhai, Mario Lucic |
6331 | Boosting Ensemble Accuracy by Revisiting Ensemble Diversity Metrics | Yanzhao Wu, Ling Liu, Zhongwei Xie, Ka-Ho Chow, Wenqi Wei |
4378 | General Multi-Label Image Classification With Transformers | Jack Lanchantin, Tianlu Wang, Vicente Ordonez, Yanjun Qi |
741 | Improving Calibration for Long-Tailed Recognition | Zhisheng Zhong, Jiequan Cui, Shu Liu, Jiaya Jia |
937 | Hardness Sampling for Self-Training Based Transductive Zero-Shot Learning | Liu Bo, Qiulei Dong, Zhanyi Hu |
4152 | Saliency-Guided Image Translation | Lai Jiang, Mai Xu, Xiaofei Wang, Leonid Sigal |
10487 | Bottleneck Transformers for Visual Recognition | Aravind Srinivas, Tsung-Yi Lin, Niki Parmar, Jonathon Shlens, Pieter Abbeel, Ashish Vaswani |
3193 | One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking | Minghao Chen, Jianlong Fu, Haibin Ling |
7842 | Multi-Perspective LSTM for Joint Visual Representation Learning | Alireza Sepas-Moghaddam, Fernando Pereira, Paulo Lobato Correia, Ali Etemad |
5337 | Multiple Instance Captioning: Learning Representations From Histopathology Textbooks and Articles | Jevgenij Gamper, Nasir Rajpoot |
2257 | clDice – A Novel Topology-Preserving Loss Function for Tubular Structure Segmentation | Suprosanna Shit, Johannes C. Paetzold, Anjany Sekuboyina, Ivan Ezhov, Alexander Unger, Andrey Zhylka, Josien P. W. Pluim, Ulrich Bauer, Bjoern H. Menze |
2146 | Learning To Fuse Asymmetric Feature Maps in Siamese Trackers | Wencheng Han, Xingping Dong, Fahad Shahbaz Khan, Ling Shao, Jianbing Shen |
7264 | Track, Check, Repeat: An EM Approach to Unsupervised Tracking | Adam W. Harley, Yiming Zuo, Jing Wen, Ayush Mangal, Shubhankar Potdar, Ritwick Chaudhry, Katerina Fragkiadaki |
3221 | Learning Optical Flow From a Few Matches | Shihao Jiang, Yao Lu, Hongdong Li, Richard Hartley |
5036 | AutoDO: Robust AutoAugment for Biased Data With Label Noise via Scalable Probabilistic Implicit Differentiation | Denis Gudovskiy, Luca Rigazio, Shun Ishizaka, Kazuki Kozuka, Sotaro Tsukizawa |
2415 | GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation | Gu Wang, Fabian Manhardt, Federico Tombari, Xiangyang Ji |
10200 | Robust and Accurate Object Detection via Adversarial Learning | Xiangning Chen, Cihang Xie, Mingxing Tan, Li Zhang, Cho-Jui Hsieh, Boqing Gong |
2228 | Domain Adaptation With Auxiliary Target Domain-Oriented Classifier | Jian Liang, Dapeng Hu, Jiashi Feng |
3555 | MetaAlign: Coordinating Domain Alignment and Classification for Unsupervised Domain Adaptation | Guoqiang Wei, Cuiling Lan, Wenjun Zeng, Zhibo Chen |
11107 | Partial Feature Selection and Alignment for Multi-Source Domain Adaptation | Yangye Fu, Ming Zhang, Xing Xu, Zuo Cao, Chao Ma, Yanli Ji, Kai Zuo, Huimin Lu |
10645 | MongeNet: Efficient Sampler for Geometric Deep Learning | Léo Lebrat, Rodrigo Santa Cruz, Clinton Fookes, Olivier Salvado |
7483 | Continual Learning via Bit-Level Information Preserving | Yujun Shi, Li Yuan, Yunpeng Chen, Jiashi Feng |
1480 | Believe It or Not: Counterfactual Bias Manipulation in Visual Question Answering | Muli Yang, Cheng Deng, Junchi Yan |
4765 | Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning | Zhenda Xie, Yutong Lin, Zheng Zhang, Yue Cao, Stephen Lin, Han Hu |
289 | Toward Joint Thing-and-Stuff Mining for Weakly Supervised Panoptic Segmentation | Yunhang Shen, Liujuan Cao, Zhiwei Chen, Feihong Lian, Baochang Zhang, Chi Su, Yongjian Wu, Feiyue Huang, Rongrong Ji |
4720 | S3: Learnable Sparse Signal Superdensity for Guided Depth Estimation | Yu-Kai Huang, Yueh-Cheng Liu, Tsung-Han Wu, Hung-Ting Su, Yu-Cheng Chang, Tsung-Lin Tsou, Yu-An Wang, Winston H. Hsu |
1394 | Residential Floor Plan Recognition and Reconstruction | Xiaolei Lv, Shengchu Zhao, Xinyang Yu, Binqiang Zhao |
8148 | ColorRL: Reinforced Coloring for End-to-End Instance Segmentation | Tran Anh Tuan, Nguyen Tuan Khoa, Tran Minh Quan, Won-Ki Jeong |
4585 | Harmonious Semantic Line Detection via Maximal Weight Clique Selection | Dongkwon Jin, Wonhui Park, Seong-Gyun Jeong, Chang-Su Kim |
2403 | LPSNet: A Lightweight Solution for Fast Panoptic Segmentation | Weixiang Hong, Qingpei Guo, Wei Zhang, Jingdong Chen, Wei Chu |
5077 | Progressive Semantic Segmentation | Chuong Huynh, Anh Tuan Tran, Khoa Luu, Minh Hoai |
3554 | Embedded Discriminative Attention Mechanism for Weakly Supervised Semantic Segmentation | Tong Wu, Junshi Huang, Guangyu Gao, Xiaoming Wei, Xiaolin Wei, Xuan Luo, Chi Harold Liu |
1504 | SelfSAGCN: Self-Supervised Semantic Alignment for Graph Convolution Network | Xu Yang, Cheng Deng, Zhiyuan Dang, Kun Wei, Junchi Yan |
7255 | Wasserstein Barycenter for Multi-Source Domain Adaptation | Eduardo Fernandes Montesuma, Fred Maurice Ngolè Mboula |
7574 | PiCIE: Unsupervised Semantic Segmentation Using Invariance and Equivariance in Clustering | Jang Hyun Cho, Utkarsh Mall, Kavita Bala, Bharath Hariharan |
1360 | Global2Local: Efficient Structure Search for Video Action Segmentation | Shang-Hua Gao, Qi Han, Zhong-Yu Li, Pai Peng, Liang Wang, Ming-Ming Cheng |
6468 | Introvert: Human Trajectory Prediction via Conditional 3D Attention | Nasim Shafiee, Taskin Padir, Ehsan Elhamifar |
766 | Weakly Supervised Video Salient Object Detection | Wangbo Zhao, Jing Zhang, Long Li, Nick Barnes, Nian Liu, Junwei Han |
3801 | Video Object Segmentation Using Global and Instance Embedding Learning | Wenbin Ge, Xiankai Lu, Jianbing Shen |
6940 | Human-Like Controllable Image Captioning With Verb-Specific Semantic Roles | Long Chen, Zhihong Jiang, Jun Xiao, Wei Liu |
1204 | Read and Attend: Temporal Localisation in Sign Language Videos | Gül Varol, Liliane Momeni, Samuel Albanie, Triantafyllos Afouras, Andrew Zisserman |
1283 | Localizing Visual Sounds the Hard Way | Honglie Chen, Weidi Xie, Triantafyllos Afouras, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman |
4377 | Look Before You Speak: Visually Contextualized Utterances | Paul Hongsuck Seo, Arsha Nagrani, Cordelia Schmid |
5906 | Look Before You Leap: Learning Landmark Features for One-Stage Visual Grounding | Binbin Huang, Dongze Lian, Weixin Luo, Shenghua Gao |
11061 | Visual Navigation With Spatial Attention | Bar Mayo, Tamir Hazan, Ayellet Tal |
10634 | Perception Matters: Detecting Perception Failures of VQA Models Using Metamorphic Testing | Yuanyuan Yuan, Shuai Wang, Mingyue Jiang, Tsong Yueh Chen |
5224 | Pixel-Wise Anomaly Detection in Complex Driving Scenes | Giancarlo Di Biase, Hermann Blum, Roland Siegwart, Cesar Cadena |
6703 | Disentangled Cycle Consistency for Highly-Realistic Virtual Try-On | Chongjian Ge, Yibing Song, Yuying Ge, Han Yang, Wei Liu, Ping Luo |
8750 | Hyperdimensional Computing as a Framework for Systematic Aggregation of Image Descriptors | Peer Neubert, Stefan Schubert |