MAIN CONFERENCE
All papers will be presented in the same manner. Each paper will have a five minute pre-recorded video and a PDF of the poster. An asynchronous text chat will be available for each paper. Attendees can view the papers and videos on demand at any time. Authors will also have individual Q&A sessions at the posted times below.
All posted times are EDT but the chart linked below has all time zones’ conversions. When the virtual site is up, you will be able to select which sessions you are interested in and it will populate your own schedule.
Presentation Schedule
-
All times are Eastern Daylight Time
Date: Monday, June 21, 2021 11:00 – 13:30
Paper Session One:
Paper ID | Paper Title | Authors |
2364 | Single-Stage Instance Shadow Detection With Bidirectional Relation Learning | Tianyu Wang, Xiaowei Hu, Chi-Wing Fu, Pheng-Ann Heng |
5745 | DeepMetaHandles: Learning Deformation Meta-Handles of 3D Meshes With Biharmonic Coordinates | Minghua Liu, Minhyuk Sung, Radomir Mech, Hao Su |
9986 | Learning Delaunay Surface Elements for Mesh Reconstruction | Marie-Julie Rakotosaona, Paul Guerrero, Noam Aigerman, Niloy J. Mitra, Maks Ovsjanikov |
1678 | Fusing the Old with the New: Learning Relative Camera Pose with Geometry-Guided Uncertainty | Bingbing Zhuang, Manmohan Chandraker |
3609 | Skeleton Merger: An Unsupervised Aligned Keypoint Detector | Ruoxi Shi, Zhengrong Xue, Yang You, Cewu Lu |
711 | Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection | Wenfei Yang, Tianzhu Zhang, Xiaoyuan Yu, Tian Qi, Yongdong Zhang, Feng Wu |
7701 | Pixel Codec Avatars | Shugao Ma, Tomas Simon, Jason Saragih, Dawei Wang, Yuecheng Li, Fernando De la Torre, Yaser Sheikh |
8522 | HOTR: End-to-End Human-Object Interaction Detection With Transformers | Bumsoo Kim, Junhyun Lee, Jaewoo Kang, Eun-Sol Kim, Hyunwoo J. Kim |
8686 | Tuning IR-Cut Filter for Illumination-Aware Spectral Reconstruction From RGB | Bo Sun, Junchi Yan, Xiao Zhou, Yinqiang Zheng |
7447 | DeFlow: Learning Complex Image Degradations From Unpaired Data With Conditional Flows | Valentin Wolf, Andreas Lugmayr, Martin Danelljan, Luc Van Gool, Radu Timofte |
2372 | AQD: Towards Accurate Quantized Object Detection | Peng Chen, Jing Liu, Bohan Zhuang, Mingkui Tan, Chunhua Shen |
3665 | Privacy-Preserving Collaborative Learning With Automatic Transformation Search | Wei Gao, Shangwei Guo, Tianwei Zhang, Han Qiu, Yonggang Wen, Yang Liu |
1300 | Rethinking and Improving the Robustness of Image Style Transfer | Pei Wang, Yijun Li, Nuno Vasconcelos |
11563 | Style-Aware Normalized Loss for Improving Arbitrary Style Transfer | Jiaxin Cheng, Ayush Jaiswal, Yue Wu, Pradeep Natarajan, Prem Natarajan |
2453 | Faster Meta Update Strategy for Noise-Robust Deep Learning | Youjiang Xu, Linchao Zhu, Lu Jiang, Yi Yang |
3302 | A Hyperbolic-to-Hyperbolic Graph Convolutional Network | Jindou Dai, Yuwei Wu, Zhi Gao, Yunde Jia |
1542 | Quasi-Dense Similarity Learning for Multiple Object Tracking | Jiangmiao Pang, Linlu Qiu, Xia Li, Haofeng Chen, Qi Li, Trevor Darrell, Fisher Yu |
7579 | MetricOpt: Learning To Optimize Black-Box Evaluation Metrics | Chen Huang, Shuangfei Zhai, Pengsheng Guo, Josh Susskind |
656 | Training Networks in Null Space of Feature Covariance for Continual Learning | Shipeng Wang, Xiaorong Li, Jian Sun, Zongben Xu |
5080 | Exponential Moving Average Normalization for Self-Supervised and Semi-Supervised Learning | Zhaowei Cai, Avinash Ravichandran, Subhransu Maji, Charless Fowlkes, Zhuowen Tu, Stefano Soatto |
3235 | Learning To Recover 3D Scene Shape From a Single Image | Wei Yin, Jianming Zhang, Oliver Wang, Simon Niklaus, Long Mai, Simon Chen, Chunhua Shen |
3790 | Fully Convolutional Networks for Panoptic Segmentation | Yanwei Li, Hengshuang Zhao, Xiaojuan Qi, Liwei Wang, Zeming Li, Jian Sun, Jiaya Jia |
8040 | Progressive Domain Expansion Network for Single Domain Generalization | Lei Li, Ke Gao, Juan Cao, Ziyao Huang, Yepeng Weng, Xiaoyue Mi, Zhengze Yu, Xiaoya Li, Boyang Xia |
3550 | Sketch, Ground, and Refine: Top-Down Dense Video Captioning | Chaorui Deng, Shizhe Chen, Da Chen, Yuan He, Qi Wu |
4325 | Shared Cross-Modal Trajectory Prediction for Autonomous Driving | Chiho Choi, Joon Hee Choi, Jiachen Li, Srikanth Malla |
2180 | Glancing at the Patch: Anomaly Localization With Global and Local Feature Comparison | Shenzhi Wang, Liwei Wu, Lei Cui, Yujun Shen |
6945 | RankDetNet: Delving Into Ranking Constraints for Object Detection | Ji Liu, Dong Li, Rongzhang Zheng, Lu Tian, Yi Shan |
2208 | GAIA: A Transfer Learning System of Object Detection That Fits Your Needs | Xingyuan Bu, Junran Peng, Junjie Yan, Tieniu Tan, Zhaoxiang Zhang |
1958 | Primitive Representation Learning for Scene Text Recognition | Ruijie Yan, Liangrui Peng, Shanyu Xiao, Gang Yao |
7161 | Keep Your Eyes on the Lane: Real-Time Attention-Guided Lane Detection | Lucas Tabelini, Rodrigo Berriel, Thiago M. Paixão, Claudine Badue, Alberto F. De Souza, Thiago Oliveira-Santos |
5021 | OTA: Optimal Transport Assignment for Object Detection | Zheng Ge, Songtao Liu, Zeming Li, Osamu Yoshie, Jian Sun |
5926 | StickyPillars: Robust and Efficient Feature Matching on Point Clouds Using Graph Neural Networks | Kai Fischer, Martin Simon, Florian Ölsner, Stefan Milz, Horst-Michael Groß, Patrick Mäder |
4293 | Semantic Scene Completion via Integrating Instances and Scene In-the-Loop | Yingjie Cai, Xuesong Chen, Chao Zhang, Kwan-Yee Lin, Xiaogang Wang, Hongsheng Li |
10588 | Holistic 3D Human and Scene Mesh Estimation From Single View Images | Zhenzhen Weng, Serena Yeung |
1120 | Point Cloud Upsampling via Disentangled Refinement | Ruihui Li, Xianzhi Li, Pheng-Ann Heng, Chi-Wing Fu |
3453 | DyCo3D: Robust Instance Segmentation of 3D Point Clouds Through Dynamic Convolution | Tong He, Chunhua Shen, Anton van den Hengel |
6746 | HCRF-Flow: Scene Flow From Point Clouds With Continuous High-Order CRFs and Position-Aware Flow Embedding | Ruibo Li, Guosheng Lin, Tong He, Fayao Liu, Chunhua Shen |
5444 | Iso-Points: Optimizing Neural Implicit Surfaces With Hybrid Representations | Wang Yifan, Shihao Wu, Cengiz Öztireli, Olga Sorkine-Hornung |
10374 | Fast Sinkhorn Filters: Using Matrix Scaling for Non-Rigid Shape Correspondence With Functional Maps | Gautam Pai, Jing Ren, Simone Melzi, Peter Wonka, Maks Ovsjanikov |
5280 | Globally Optimal Relative Pose Estimation With Gravity Prior | Yaqing Ding, Daniel Barath, Jian Yang, Hui Kong, Zuzana Kukelova |
555 | Discovering Relationships Between Object Categories via Universal Canonical Maps | Natalia Neverova, Artsiom Sanakoyeu, Patrick Labatut, David Novotny, Andrea Vedaldi |
2202 | Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation | Hugo Germain, Vincent Lepetit, Guillaume Bourmaud |
2029 | Rotation-Only Bundle Adjustment | Seong Hun Lee, Javier Civera |
10524 | PAUL: Procrustean Autoencoder for Unsupervised Lifting | Chaoyang Wang, Simon Lucey |
2179 | Robust Multimodal Vehicle Detection in Foggy Weather Using Complementary Lidar and Radar Signals | Kun Qian, Shilin Zhu, Xinyu Zhang, Li Erran Li |
943 | Depth-Conditioned Dynamic Message Propagation for Monocular 3D Object Detection | Li Wang, Liang Du, Xiaoqing Ye, Yanwei Fu, Guodong Guo, Xiangyang Xue, Jianfeng Feng, Li Zhang |
1583 | Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization | Junting Pan, Siyu Chen, Mike Zheng Shou, Yu Liu, Jing Shao, Hongsheng Li |
6171 | Temporal-Relational CrossTransformers for Few-Shot Action Recognition | Toby Perrett, Alessandro Masullo, Tilo Burghardt, Majid Mirmehdi, Dima Damen |
708 | Temporal Context Aggregation Network for Temporal Action Proposal Refinement | Zhiwu Qing, Haisheng Su, Weihao Gan, Dongliang Wang, Wei Wu, Xiang Wang, Yu Qiao, Junjie Yan, Changxin Gao, Nong Sang |
8560 | Affordance Transfer Learning for Human-Object Interaction Detection | Zhi Hou, Baosheng Yu, Yu Qiao, Xiaojiang Peng, Dacheng Tao |
6792 | Achieving Robustness in Classification Using Optimal Transport With Hinge Regularization | Mathieu Serrurier, Franck Mamalet, Alberto González-Sanz, Thibaut Boissin, Jean-Michel Loubes, Eustasio del Barrio |
7308 | Over-the-Air Adversarial Flickering Attacks Against Video Recognition Networks | Roi Pony, Itay Naeh, Shie Mannor |
6101 | Deep Dual Consecutive Network for Human Pose Estimation | Zhenguang Liu, Haoming Chen, Runyang Feng, Shuang Wu, Shouling Ji, Bailin Yang, Xun Wang |
707 | StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision | Yang Hong, Juyong Zhang, Boyi Jiang, Yudong Guo, Ligang Liu, Hujun Bao |
901 | Body Meshes as Points | Jianfeng Zhang, Dongdong Yu, Jun Hao Liew, Xuecheng Nie, Jiashi Feng |
350 | Cross-View Cross-Scene Multi-View Crowd Counting | Qi Zhang, Wei Lin, Antoni B. Chan |
7372 | GANmut: Learning Interpretable Conditional Space for Gamut of Emotions | Stefano d'Apolito, Danda Pani Paudel, Zhiwu Huang, Andrés Romero, Luc Van Gool |
5287 | Consistent Instance False Positive Improves Fairness in Face Recognition | Xingkun Xu, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jilin Li, Feiyue Huang, Yong Li, Zhen Cui |
233 | Neural Feature Search for RGB-Infrared Person Re-Identification | Yehansen Chen, Lin Wan, Zhihang Li, Qianyan Jing, Zongyuan Sun |
5953 | Coarse-To-Fine Person Re-Identification With Auxiliary-Domain Classification and Second-Order Information Bottleneck | Anguo Zhang, Yueming Gao, Yuzhen Niu, Wenxi Liu, Yongcheng Zhou |
3757 | EvDistill: Asynchronous Events To End-Task Learning via Bidirectional Reconstruction-Guided Cross-Modal Knowledge Distillation | Lin Wang, Yujeong Chae, Sung-Hoon Yoon, Tae-Kyun Kim, Kuk-Jin Yoon |
7525 | iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression | Shifeng Zhang, Chen Zhang, Ning Kang, Zhenguo Li |
3703 | Deep Convolutional Dictionary Learning for Image Denoising | Hongyi Zheng, Hongwei Yong, Lei Zhang |
2429 | Semi-Supervised Video Deraining With Dynamical Rain Generator | Zongsheng Yue, Jianwen Xie, Qian Zhao, Deyu Meng |
5340 | PPR10K: A Large-Scale Portrait Photo Retouching Dataset With Human-Region Mask and Group-Level Consistency | Jie Liang, Hui Zeng, Miaomiao Cui, Xuansong Xie, Lei Zhang |
3719 | Removing Diffraction Image Artifacts in Under-Display Camera via Dynamic Skip Connection Network | Ruicheng Feng, Chongyi Li, Huaijin Chen, Shuai Li, Chen Change Loy, Jinwei Gu |
6983 | GAN Prior Embedded Network for Blind Face Restoration in the Wild | Tao Yang, Peiran Ren, Xuansong Xie, Lei Zhang |
7591 | Polarimetric Normal Stereo | Yoshiki Fukao, Ryo Kawahara, Shohei Nobuhara, Ko Nishino |
2384 | Practical Single-Image Super-Resolution Using Look-Up Table | Younghyun Jo, Seon Joo Kim |
10569 | Deep Learning in Latent Space for Video Prediction and Compression | Bowen Liu, Yu Chen, Shiyu Liu, Hun-Seok Kim |
11775 | Debiased Subjective Assessment of Real-World Image Enhancement | Peibei Cao, Zhangyang Wang, Kede Ma |
6843 | BABEL: Bodies, Action and Behavior With English Labels | Abhinanda R. Punnakkal, Arjun Chandrasekaran, Nikos Athanasiou, Alejandra Quirós-Ramírez, Michael J. Black |
5043 | Rethinking Channel Dimensions for Efficient Model Design | Dongyoon Han, Sangdoo Yun, Byeongho Heo, YoungJoon Yoo |
11193 | Automated Log-Scale Quantization for Low-Cost Deep Neural Networks | Sangyun Oh, Hyeonuk Sim, Sugil Lee, Jongeun Lee |
11437 | ProSelfLC: Progressive Self Label Correction for Training Robust Deep Neural Networks | Xinshao Wang, Yang Hua, Elyor Kodirov, David A. Clifton, Neil M. Robertson |
10479 | The Lottery Ticket Hypothesis for Object Recognition | Sharath Girish, Shishira R Maiya, Kamal Gupta, Hao Chen, Larry S. Davis, Abhinav Shrivastava |
3707 | Spatial-Phase Shallow Learning: Rethinking Face Forgery Detection in Frequency Domain | Honggu Liu, Xiaodan Li, Wenbo Zhou, Yuefeng Chen, Yuan He, Hui Xue, Weiming Zhang, Nenghai Yu |
6064 | Transformer Interpretability Beyond Attention Visualization | Hila Chefer, Shir Gur, Lior Wolf |
2988 | Mixed-Privacy Forgetting in Deep Networks | Aditya Golatkar, Alessandro Achille, Avinash Ravichandran, Marzia Polito, Stefano Soatto |
2635 | CoSMo: Content-Style Modulation for Image Retrieval With Text Feedback | Seungmin Lee, Dongwan Kim, Bohyung Han |
7836 | Discrimination-Aware Mechanism for Fine-Grained Representation Learning | Furong Xu, Meng Wang, Wei Zhang, Yuan Cheng, Wei Chu |
6303 | Dual Contradistinctive Generative Autoencoder | Gaurav Parmar, Dacheng Li, Kwonjoon Lee, Zhuowen Tu |
4981 | Cross-Modal Contrastive Learning for Text-to-Image Generation | Han Zhang, Jing Yu Koh, Jason Baldridge, Honglak Lee, Yinfei Yang |
644 | Bridging the Visual Gap: Wide-Range Image Blending | Chia-Ni Lu, Ya-Chu Chang, Wei-Chen Chiu |
11462 | Exploiting Spatial Dimensions of Latent in GAN for Real-Time Image Editing | Hyunsu Kim, Yunjey Choi, Junho Kim, Sungjoo Yoo, Youngjung Uh |
1398 | ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows | Jie An, Siyu Huang, Yibing Song, Dejing Dou, Wei Liu, Jiebo Luo |
6872 | DualAST: Dual Style-Learning Networks for Artistic Style Transfer | Haibo Chen, Lei Zhao, Zhizhong Wang, Huiming Zhang, Zhiwen Zuo, Ailin Li, Wei Xing, Dongming Lu |
2885 | Single-Shot Freestyle Dance Reenactment | Oran Gafni, Oron Ashual, Lior Wolf |
5794 | SceneGen: Learning To Generate Realistic Traffic Scenes | Shuhan Tan, Kelvin Wong, Shenlong Wang, Sivabalan Manivasagam, Mengye Ren, Raquel Urtasun |
2642 | Learning Semantic-Aware Dynamics for Video Prediction | Xinzhu Bei, Yanchao Yang, Stefano Soatto |
2367 | Reinforced Attention for Few-Shot Learning and Beyond | Jie Hong, Pengfei Fang, Weihao Li, Tong Zhang, Christian Simon, Mehrtash Harandi, Lars Petersson |
5620 | Fast and Accurate Model Scaling | Piotr Dollár, Mannat Singh, Ross Girshick |
10021 | Multi-Label Learning From Single Positive Labels | Elijah Cole, Oisin Mac Aodha, Titouan Lorieul, Pietro Perona, Dan Morris, Nebojsa Jojic |
1950 | Contrastive Learning Based Hybrid Networks for Long-Tailed Image Classification | Peng Wang, Kai Han, Xiu-Shen Wei, Lei Zhang, Lei Wang |
1098 | Learning Graph Embeddings for Compositional Zero-Shot Learning | Muhammad Ferjad Naeem, Yongqin Xian, Federico Tombari, Zeynep Akata |
4080 | Multispectral Photometric Stereo for Spatially-Varying Spectral Reflectances: A Well Posed Problem? | Heng Guo, Fumio Okura, Boxin Shi, Takuya Funatomi, Yasuhiro Mukaigawa, Yasuyuki Matsushita |
6708 | LiBRe: A Practical Bayesian Approach to Adversarial Detection | Zhijie Deng, Xiao Yang, Shizhen Xu, Hang Su, Jun Zhu |
3487 | Searching by Generating: Flexible and Efficient One-Shot NAS With Architecture Generator | Sian-Yao Huang, Wei-Ta Chu |
11676 | Densely Connected Multi-Dilated Convolutional Networks for Dense Prediction Tasks | Naoya Takahashi, Yuki Mitsufuji |
5808 | DARCNN: Domain Adaptive Region-Based Convolutional Neural Network for Unsupervised Instance Segmentation in Biomedical Images | Joy Hsu, Wah Chiu, Serena Yeung |
2534 | FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space | Quande Liu, Cheng Chen, Jing Qin, Qi Dou, Pheng-Ann Heng |
2349 | Distractor-Aware Fast Tracking via Dynamic Convolutions and MOT Philosophy | Zikai Zhang, Bineng Zhong, Shengping Zhang, Zhenjun Tang, Xin Liu, Zhaoxiang Zhang |
5053 | Mining Better Samples for Contrastive Learning of Temporal Correspondence | Sangryul Jeon, Dongbo Min, Seungryong Kim, Kwanghoon Sohn |
4405 | UPFlow: Upsampling Pyramid for Unsupervised Optical Flow Learning | Kunming Luo, Chuan Wang, Shuaicheng Liu, Haoqiang Fan, Jue Wang, Jian Sun |
7376 | KeepAugment: A Simple Information-Preserving Data Augmentation Approach | Chengyue Gong, Dilin Wang, Meng Li, Vikas Chandra, Qiang Liu |
6966 | Keypoint-Graph-Driven Learning Framework for Object Pose Estimation | Shaobo Zhang, Wanqing Zhao, Ziyu Guan, Xianlin Peng, Jinye Peng |
2536 | AdCo: Adversarial Contrast for Efficient Learning of Unsupervised Representations From Self-Trained Negative Adversaries | Qianjiang Hu, Xiao Wang, Wei Hu, Guo-Jun Qi |
2474 | Generalized Domain Adaptation | Yu Mitsuzumi, Go Irie, Daiki Ikami, Takashi Shibata |
4071 | FixBi: Bridging Domain Spaces for Unsupervised Domain Adaptation | Jaemin Na, Heechul Jung, Hyung Jin Chang, Wonjun Hwang |
11397 | Learning Invariant Representations and Risks for Semi-Supervised Domain Adaptation | Bo Li, Yezhen Wang, Shanghang Zhang, Dongsheng Li, Kurt Keutzer, Trevor Darrell, Han Zhao |
1906 | Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations | Umberto Michieli, Pietro Zanuttigh |
10436 | Incremental Learning via Rate Reduction | Ziyang Wu, Christina Baek, Chong You, Yi Ma |
4186 | Partially View-Aligned Representation Learning With Noise-Robust Contrastive Loss | Mouxing Yang, Yunfan Li, Zhenyu Huang, Zitao Liu, Peng Hu, Xi Peng |
5671 | Spatially Consistent Representation Learning | Byungseok Roh, Wuhyun Shin, Ildoo Kim, Sungwoong Kim |
592 | Mask Guided Matting via Progressive Refinement Network | Qihang Yu, Jianming Zhang, He Zhang, Yilin Wang, Zhe Lin, Ning Xu, Yutong Bai, Alan Yuille |
4851 | The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth | Jamie Watson, Oisin Mac Aodha, Victor Prisacariu, Gabriel Brostow, Michael Firman |
1361 | Exemplar-Based Open-Set Panoptic Segmentation Network | Jaedong Hwang, Seoung Wug Oh, Joon-Young Lee, Bohyung Han |
10320 | Incremental Few-Shot Instance Segmentation | Dan Andrei Ganea, Bas Boom, Ronald Poppe |
398 | DoDNet: Learning To Segment Multi-Organ and Tumors From Multiple Partially Labeled Datasets | Jianpeng Zhang, Yutong Xie, Yong Xia, Chunhua Shen |
2529 | Semi-Supervised Semantic Segmentation With Directional Context-Aware Consistency | Xin Lai, Zhuotao Tian, Li Jiang, Shu Liu, Hengshuang Zhao, Liwei Wang, Jiaya Jia |
5229 | Source-Free Domain Adaptation for Semantic Segmentation | Yuang Liu, Wei Zhang, Jun Wang |
4002 | Learning the Superpixel in a Non-Iterative and Lifelong Manner | Lei Zhu, Qi She, Bin Zhang, Yanye Lu, Zhilin Lu, Duo Li, Jie Hu |
2335 | ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation | Xinyue Huo, Lingxi Xie, Jianzhong He, Zijie Yang, Wengang Zhou, Houqiang Li, Qi Tian |
7507 | Can We Characterize Tasks Without Labels or Features? | Bram Wallace, Ziyang Wu, Bharath Hariharan |
8301 | Reconsidering Representation Alignment for Multi-View Clustering | Daniel J. Trosten, Sigurd Løkse, Robert Jenssen, Michael Kampffmeyer |
5491 | Learning To Segment Rigid Motions From Two Frames | Gengshan Yang, Deva Ramanan |
692 | Self-Supervised Motion Learning From Static Images | Ziyuan Huang, Shiwei Zhang, Jianwen Jiang, Mingqian Tang, Rong Jin, Marcelo H. Ang |
1558 | Efficient Regional Memory Network for Video Object Segmentation | Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Wenxiu Sun |
3953 | SwiftNet: Real-Time Video Object Segmentation | Haochen Wang, Xiaolong Jiang, Haibing Ren, Yao Hu, Song Bai |
10268 | Improving OCR-Based Image Captioning by Incorporating Geometrical Relationship | Jing Wang, Jinhui Tang, Mingkun Yang, Xiang Bai, Jiebo Luo |
1525 | Improving Sign Language Translation With Monolingual Data by Sign Back-Translation | Hao Zhou, Wengang Zhou, Weizhen Qi, Junfu Pu, Houqiang Li |
1603 | Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing | Yu Wu, Yi Yang |
8136 | Looking Into Your Speech: Learning Cross-Modal Affinity for Audio-Visual Speech Separation | Jiyoung Lee, Soo-Whan Chung, Sunok Kim, Hong-Goo Kang, Kwanghoon Sohn |
6406 | Co-Grounding Networks With Semantic Attention for Referring Expression Comprehension in Videos | Sijie Song, Xudong Lin, Jiaying Liu, Zongming Guo, Shih-Fu Chang |
683 | Explicit Knowledge Incorporation for Visual Reasoning | Yifeng Zhang, Ming Jiang, Qi Zhao |
978 | Deep Gradient Projection Networks for Pan-sharpening | Shuang Xu, Jiangshe Zhang, Zixiang Zhao, Kai Sun, Junmin Liu, Chunxia Zhang |
5425 | Capturing Omni-Range Context for Omnidirectional Segmentation | Kailun Yang, Jiaming Zhang, Simon Reiß, Xinxin Hu, Rainer Stiefelhagen |
1302 | Gradient-Based Algorithms for Machine Teaching | Pei Wang, Kabir Nagrecha, Nuno Vasconcelos |
1159 | Depth From Camera Motion and Object Detection | Brent A. Griffin, Jason J. Corso |
Date: Monday, June 21, 2021 22:00 – 24:30
Paper Session Two:
Paper ID | Paper Title | Authors |
1567 | Deep RGB-D Saliency Detection With Depth-Sensitive Attention and Automatic Multi-Modal Fusion | Peng Sun, Wenhu Zhang, Huanyu Wang, Songyuan Li, Xi Li |
5860 | SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction From Video Data | Yuan-Ting Hu, Jiahong Wang, Raymond A. Yeh, Alexander G. Schwing |
944 | Deep Implicit Templates for 3D Shape Representation | Zerong Zheng, Tao Yu, Qionghai Dai, Yebin Liu |
3893 | Pulsar: Efficient Sphere-Based Neural Rendering | Christoph Lassner, Michael Zollhöfer |
4796 | Neural Deformation Graphs for Globally-Consistent Non-Rigid Reconstruction | Aljaž Božič, Pablo Palafox, Michael Zollhöfer, Justus Thies, Angela Dai, Matthias Nießner |
6299 | Modeling Multi-Label Action Dependencies for Temporal Action Localization | Praveen Tirupattur, Kevin Duarte, Yogesh S Rawat, Mubarak Shah |
3479 | ContactOpt: Optimizing Contact To Improve Grasps | Patrick Grady, Chengcheng Tang, Christopher D. Twigg, Minh Vo, Samarth Brahmbhatt, Charles C. Kemp |
6014 | From Synthetic to Real: Unsupervised Domain Adaptation for Animal Pose Estimation | Chen Li, Gim Hee Lee |
6772 | Deep Homography for Efficient Stereo Image Compression | Xin Deng, Wenzhe Yang, Ren Yang, Mai Xu, Enpeng Liu, Qianhan Feng, Radu Timofte |
3985 | FVC: A New Framework Towards Deep Video Compression in Feature Space | Zhihao Hu, Guo Lu, Dong Xu |
5222 | Zero-Shot Adversarial Quantization | Yuang Liu, Wei Zhang, Jun Wang |
1456 | Farewell to Mutual Information: Variational Distillation for Cross-Modal Person Re-Identification | Xudong Tian, Zhizhong Zhang, Shaohui Lin, Yanyun Qu, Yuan Xie, Lizhuang Ma |
1478 | Closed-Form Factorization of Latent Semantics in GANs | Yujun Shen, Bolei Zhou |
1640 | High-Fidelity Neural Human Motion Transfer From Monocular Video | Moritz Kappel, Vladislav Golyanik, Mohamed Elgharib, Jann-Ole Henningson, Hans-Peter Seidel, Susana Castillo, Christian Theobalt, Marcus Magnor |
8309 | Correlated Input-Dependent Label Noise in Large-Scale Image Classification | Mark Collier, Basil Mustafa, Efi Kokiopoulou, Rodolphe Jenatton, Jesse Berent |
4210 | Bi-GCN: Binary Graph Convolutional Network | Junfu Wang, Yunhong Wang, Zhen Yang, Liang Yang, Yuanfang Guo |
2141 | Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking | Ning Wang, Wengang Zhou, Jie Wang, Houqiang Li |
3966 | FS-Net: Fast Shape-Based Network for Category-Level 6D Object Pose Estimation With Decoupled Rotation Mechanism | Wei Chen, Xi Jia, Hyung Jin Chang, Jinming Duan, Linlin Shen, Aleš Leonardis |
2634 | On Learning the Geodesic Path for Incremental Learning | Christian Simon, Piotr Koniusz, Mehrtash Harandi |
5513 | UP-DETR: Unsupervised Pre-Training for Object Detection With Transformers | Zhigang Dai, Bolun Cai, Yugeng Lin, Junying Chen |
5075 | Robust Consistent Video Depth Estimation | Johannes Kopf, Xuejian Rong, Jia-Bin Huang |
6630 | Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing | Tianfei Zhou, Wenguan Wang, Si Liu, Yi Yang, Luc Van Gool |
5298 | Global Transport for Fluid Reconstruction With Learned Self-Supervision | Erik Franz, Barbara Solenthaler, Nils Thuerey |
784 | VLN BERT: A Recurrent Vision-and-Language BERT for Navigation | Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez-Opazo, Stephen Gould |
4361 | Single-View Robot Pose and Joint Angle Estimation via Render & Compare | Yann Labbé, Justin Carpentier, Mathieu Aubry, Josef Sivic |
3735 | Learning Deep Classifiers Consistent With Fine-Grained Novelty Detection | Jiacheng Cheng, Nuno Vasconcelos |
2983 | CRFace: Confidence Ranker for Model-Agnostic Face Detection Refinement | Noranart Vesdapunt, Baoyuan Wang |
2222 | Equalization Loss v2: A New Gradient Balance Approach for Long-Tailed Object Detection | Jingru Tan, Xin Lu, Gang Zhang, Changqing Yin, Quanquan Li |
3057 | Semantic-Aware Video Text Detection | Wei Feng, Fei Yin, Xu-Yao Zhang, Cheng-Lin Liu |
8441 | Improved Handling of Motion Blur in Online Object Detection | Mohamed Sayed, Gabriel Brostow |
5030 | IQDet: Instance-Wise Quality Distribution Sampling for Object Detection | Yuchen Ma, Songtao Liu, Zeming Li, Jian Sun |
6084 | One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation | Zhengzhe Liu, Xiaojuan Qi, Chi-Wing Fu |
4316 | Learning Monocular 3D Reconstruction of Articulated Categories From Motion | Filippos Kokkinos, Iasonas Kokkinos |
490 | SPSG: Self-Supervised Photometric Scene Generation From RGB-D Scans | Angela Dai, Yawar Siddiqui, Justus Thies, Julien Valentin, Matthias Nießner |
1506 | Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion | Shi Qiu, Saeed Anwar, Nick Barnes |
3516 | Unsupervised 3D Shape Completion Through GAN Inversion | Junzhe Zhang, Xinyi Chen, Zhongang Cai, Liang Pan, Haiyu Zhao, Shuai Yi, Chai Kiat Yeo, Bo Dai, Chen Change Loy |
6913 | 3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding | Shengheng Deng, Xun Xu, Chaozheng Wu, Ke Chen, Kui Jia |
5873 | Deep Implicit Moving Least-Squares Functions for 3D Reconstruction | Shi-Lin Liu, Hao-Xiang Guo, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu |
1233 | Using Shape To Categorize: Low-Shot Learning With an Explicit Shape Bias | Stefan Stojanov, Anh Thai, James M. Rehg |
7160 | Privacy Preserving Localization and Mapping From Uncalibrated Cameras | Marcel Geppert, Viktor Larsson, Pablo Speciale, Johannes L. Schönberger, Marc Pollefeys |
2187 | HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences | Feitong Tan, Danhang Tang, Mingsong Dou, Kaiwen Guo, Rohit Pandey, Cem Keskin, Ruofei Du, Deqing Sun, Sofien Bouaziz, Sean Fanello, Ping Tan, Yinda Zhang |
2631 | Learning Camera Localization via Dense Scene Matching | Shitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu, Ping Tan |
3096 | PlückerNet: Learn To Register 3D Line Reconstructions | Liu Liu, Hongdong Li, Haodong Yao, Ruyi Zha |
6766 | MultiLink: Multi-Class Structure Recovery via Agglomerative Clustering and Model Selection | Luca Magri, Filippo Leveni, Giacomo Boracchi |
2540 | 3D-MAN: 3D Multi-Frame Attention Network for Object Detection | Zetong Yang, Yin Zhou, Zhifeng Chen, Jiquan Ngiam |
1796 | Exploring intermediate representation for monocular vehicle pose estimation | Shichao Li, Zengqiang Yan, Hongyang Li, Kwang-Ting Cheng |
433 | Towards Long-Form Video Understanding | Chao-Yuan Wu, Philipp Krähenbühl |
8068 | TDN: Temporal Difference Networks for Efficient Action Recognition | Limin Wang, Zhan Tong, Bin Ji, Gangshan Wu |
759 | Self-Supervised Learning for Semi-Supervised Temporal Action Proposal | Xiang Wang, Shiwei Zhang, Zhiwu Qing, Yuanjie Shao, Changxin Gao, Nong Sang |
3889 | WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos | Mingfei Gao, Yingbo Zhou, Ran Xu, Richard Socher, Caiming Xiong |
6930 | Enhancing the Transferability of Adversarial Attacks Through Variance Tuning | Xiaosen Wang, Kun He |
10158 | You See What I Want You To See: Exploring Targeted Black-Box Transferability Attack for Hash-Based Image Retrieval Systems | Yanru Xiao, Cong Wang |
9906 | Pose Recognition With Cascade Transformers | Ke Li, Shijie Wang, Xiang Zhang, Yifan Xu, Weijian Xu, Zhuowen Tu |
1740 | End-to-End Human Pose and Mesh Reconstruction with Transformers | Kevin Lin, Lijuan Wang, Zicheng Liu |
1013 | Beyond Static Features for Temporally Consistent 3D Human Pose and Shape From a Video | Hongsuk Choi, Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee |
3209 | A Generalized Loss Function for Crowd Counting and Localization | Jia Wan, Ziquan Liu, Antoni B. Chan |
10252 | LOHO: Latent Optimization of Hairstyles via Orthogonalization | Rohit Saha, Brendan Duke, Florian Shkurti, Graham W. Taylor, Parham Aarabi |
8057 | Pseudo Facial Generation With Extreme Poses for Face Recognition | Guoli Wang, Jiaqi Ma, Qian Zhang, Jiwen Lu, Jie Zhou |
261 | Joint Generative and Contrastive Learning for Unsupervised Person Re-Identification | Hao Chen, Yaohui Wang, Benoit Lagadec, Antitza Dantcheva, Francois Bremond |
6862 | BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification | Ruibing Hou, Hong Chang, Bingpeng Ma, Rui Huang, Shiguang Shan |
8051 | Learning To Reconstruct High Speed and High Dynamic Range Videos From Events | Yunhao Zou, Yinqiang Zheng, Tsuyoshi Takatani, Ying Fu |
208 | Iterative Filter Adaptive Network for Single Image Defocus Deblurring | Junyong Lee, Hyeongseok Son, Jaesung Rim, Sunghyun Cho, Seungyong Lee |
5129 | Recorrupted-to-Recorrupted: Unsupervised Deep Learning for Image Denoising | Tongyao Pang, Huan Zheng, Yuhui Quan, Hui Ji |
4108 | Closing the Loop: Joint Rain Generation and Removal via Disentangled Image Translation | Yuntong Ye, Yi Chang, Hanyu Zhou, Luxin Yan |
5493 | Deep Denoising of Flash and No-Flash Pairs for Photography in Low-Light Environments | Zhihao Xia, Michaël Gharbi, Federico Perazzi, Kalyan Sunkavalli, Ayan Chakrabarti |
3932 | Controllable Image Restoration for Under-Display Camera in Smartphones | Kinam Kwon, Eunhee Kang, Sangwon Lee, Su-Jin Lee, Hyong-Euk Lee, ByungIn Yoo, Jae-Joon Han |
7697 | MetaSCI: Scalable and Adaptive Reconstruction for Video Compressive Sensing | Zhengjue Wang, Hao Zhang, Ziheng Cheng, Bo Chen, Xin Yuan |
224 | Learning the Non-Differentiable Optimization for Blind Super-Resolution | Zheng Hui, Jie Li, Xiumei Wang, Xinbo Gao |
2840 | Robust Reference-Based Super-Resolution via C2-Matching | Yuming Jiang, Kelvin C.K. Chan, Xintao Wang, Chen Change Loy, Ziwei Liu |
1463 | Space-Time Distillation for Video Super-Resolution | Zeyu Xiao, Xueyang Fu, Jie Huang, Zhen Cheng, Zhiwei Xiong |
1090 | Person30K: A Dual-Meta Generalization Network for Person Re-Identification | Yan Bai, Jile Jiao, Wang Ce, Jun Liu, Yihang Lou, Xuetao Feng, Ling-Yu Duan |
7260 | Zillow Indoor Dataset: Annotated Floor Plans With 360° Panoramas and 3D Room Layouts | Steve Cruz, Will Hutchcroft, Yuguang Li, Naji Khosravan, Ivaylo Boyadzhiev, Sing Bing Kang |
6195 | The Heterogeneity Hypothesis: Finding Layer-Wise Differentiated Network Architectures | Yawei Li, Wen Li, Martin Danelljan, Kai Zhang, Shuhang Gu, Luc Van Gool, Radu Timofte |
376 | Distilling Object Detectors via Decoupled Features | Jianyuan Guo, Kai Han, Yunhe Wang, Han Wu, Xinghao Chen, Chunjing Xu, Chang Xu |
562 | S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-Bit Neural Networks via Guided Distribution Calibration | Zhiqiang Shen, Zechun Liu, Jie Qin, Lei Huang, Kwang-Ting Cheng, Marios Savvides |
10832 | BCNet: Searching for Network Width With Bilaterally Coupled Network | Xiu Su, Shan You, Fei Wang, Chen Qian, Changshui Zhang, Chang Xu |
4819 | Multi-Attentional Deepfake Detection | Hanqing Zhao, Wenbo Zhou, Dongdong Chen, Tianyi Wei, Weiming Zhang, Nenghai Yu |
7128 | A Peek Into the Reasoning of Neural Networks: Interpreting With Structural Visual Concepts | Yunhao Ge, Yao Xiao, Zhi Xu, Meng Zheng, Srikrishna Karanam, Terrence Chen, Laurent Itti, Ziyan Wu |
3239 | Probabilistic Selective Encryption of Convolutional Neural Networks for Hierarchical Services | Jinyu Tian, Jiantao Zhou, Jia Duan |
3620 | Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval | Yawen Zeng, Da Cao, Xiaochi Wei, Meng Liu, Zhou Zhao, Zheng Qin |
4009 | PhD Learning: Learning With Pompeiu-Hausdorff Distances for Video-Based Vehicle Re-Identification | Jianan Zhao, Fengliang Qi, Guangyu Ren, Lin Xu |
819 | Pareidolia Face Reenactment | Linsen Song, Wayne Wu, Chaoyou Fu, Chen Qian, Chen Change Loy, Ran He |
5476 | Hyper-LifelongGAN: Scalable Lifelong Learning for Image Conditioned Generation | Mengyao Zhai, Lei Chen, Greg Mori |
743 | TediGAN: Text-Guided Diverse Face Image Generation and Manipulation | Weihao Xia, Yujiu Yang, Jing-Hao Xue, Baoyuan Wu |
1133 | TransFill: Reference-Guided Image Inpainting by Merging Multiple Color and Spatial Transformations | Yuqian Zhou, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi |
1437 | ArtCoder: An End-to-End Method for Generating Scanning-Robust Stylized QR Codes | Hao Su, Jianwei Niu, Xuefeng Liu, Qingfeng Li, Ji Wan, Mingliang Xu, Tao Ren |
7078 | Encoding in Style: A StyleGAN Encoder for Image-to-Image Translation | Elad Richardson, Yuval Alaluf, Or Patashnik, Yotam Nitzan, Yaniv Azar, Stav Shapiro, Daniel Cohen-Or |
4560 | Few-Shot Human Motion Transfer by Personalized Geometry and Texture Modeling | Zhichao Huang, Xintong Han, Jia Xu, Tong Zhang |
7586 | OCONet: Image Extrapolation by Object Completion | Richard Strong Bowen, Huiwen Chang, Charles Herrmann, Piotr Teterwak, Ce Liu, Ramin Zabih |
6394 | Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction | Bohan Wu, Suraj Nair, Roberto Martín-Martín, Li Fei-Fei, Chelsea Finn |
3984 | Mutual CRF-GNN for Few-Shot Learning | Shixiang Tang, Dapeng Chen, Lei Bai, Kaijian Liu, Yixiao Ge, Wanli Ouyang |
4535 | Re-Labeling ImageNet: From Single to Multi-Labels, From Global to Localized Labels | Sangdoo Yun, Seong Joon Oh, Byeongho Heo, Dongyoon Han, Junsuk Choe, Sanghyuk Chun |
8886 | Differentiable Patch Selection for Image Recognition | Jean-Baptiste Cordonnier, Aravindh Mahendran, Alexey Dosovitskiy, Dirk Weissenborn, Jakob Uszkoreit, Thomas Unterthiner |
2836 | Distribution Alignment: A Unified Framework for Long-Tail Visual Recognition | Songyang Zhang, Zeming Li, Shipeng Yan, Xuming He, Jian Sun |
3177 | Contrastive Embedding for Generalized Zero-Shot Learning | Zongyan Han, Zhenyong Fu, Shuo Chen, Jian Yang |
4268 | Normal Integration via Inverse Plane Fitting With Minimum Point-to-Plane Distance | Xu Cao, Boxin Shi, Fumio Okura, Yasuyuki Matsushita |
7985 | Bayesian Nested Neural Networks for Uncertainty Calibration and Adaptive Compression | Yufei Cui, Ziquan Liu, Qiao Li, Antoni B. Chan, Chun Jason Xue |
4614 | NetAdaptV2: Efficient Neural Architecture Search With Fast Super-Network Training and Architecture Optimization | Tien-Ju Yang, Yi-Lun Liao, Vivienne Sze |
2167 | MIST: Multiple Instance Spatial Transformer | Baptiste Angles, Yuhe Jin, Simon Kornblith, Andrea Tagliasacchi, Kwang Moo Yi |
2067 | Multi-Institutional Collaborations for Improving Deep Learning-Based Magnetic Resonance Image Reconstruction Using Federated Learning | Pengfei Guo, Puyang Wang, Jinyuan Zhou, Shanshan Jiang, Vishal M. Patel |
4271 | A Self-Boosting Framework for Automated Radiographic Report Generation | Zhanyu Wang, Luping Zhou, Lei Wang, Xiu Li |
2427 | Learning a Proposal Classifier for Multiple Object Tracking | Peng Dai, Renliang Weng, Wongun Choi, Changshui Zhang, Zhangping He, Wei Ding |
234 | Improving Multiple Object Tracking With Single Object Tracking | Linyu Zheng, Ming Tang, Yingying Chen, Guibo Zhu, Jinqiao Wang, Hanqing Lu |
5850 | Feature-Level Collaboration: Joint Unsupervised Learning of Optical Flow, Stereo Depth and Camera Motion | Cheng Chi, Qingjie Wang, Tianyu Hao, Peng Guo, Xin Yang |
7377 | MaxUp: Lightweight Adversarial Training With Data Augmentation Improves Neural Network Training | Chengyue Gong, Tongzheng Ren, Mao Ye, Qiang Liu |
6748 | Unsupervised Human Pose Estimation Through Transforming Shape Templates | Luca Schmidtke, Athanasios Vlontzos, Simon Ellershaw, Anna Lukens, Tomoki Arichi, Bernhard Kainz |
8484 | Understanding the Behaviour of Contrastive Loss | Feng Wang, Huaping Liu |
2728 | Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation | Jichang Li, Guanbin Li, Yemin Shi, Yizhou Yu |
4528 | Divergence Optimization for Noisy Universal Domain Adaptation | Qing Yu, Atsushi Hashimoto, Yoshitaka Ushiku |
11434 | Limitations of Post-Hoc Feature Alignment for Robustness | Collin Burns, Jacob Steinhardt |
1954 | Semantic-Aware Knowledge Distillation for Few-Shot Class-Incremental Learning | Ali Cheraghian, Shafin Rahman, Pengfei Fang, Soumava Kumar Roy, Lars Petersson, Mehrtash Harandi |
11197 | Adaptive Aggregation Networks for Class-Incremental Learning | Yaoyao Liu, Bernt Schiele, Qianru Sun |
6113 | Progressive Modality Reinforcement for Human Multimodal Emotion Recognition From Unaligned Multimodal Sequences | Fengmao Lv, Xiang Chen, Yanyong Huang, Lixin Duan, Guosheng Lin |
5916 | Unsupervised Visual Representation Learning by Tracking Patches in Video | Guangting Wang, Yizhou Zhou, Chong Luo, Wenxuan Xie, Wenjun Zeng, Zhiwei Xiong |
983 | HoHoNet: 360 Indoor Holistic Understanding With Latent Horizontal Features | Cheng Sun, Min Sun, Hwann-Tzong Chen |
5458 | Depth Completion With Twin Surface Extrapolation at Occlusion Boundaries | Saif Imran, Xiaoming Liu, Daniel Morris |
1395 | Zero-Shot Instance Segmentation | Ye Zheng, Jiahong Wu, Yongqiang Qin, Faen Zhang, Li Cui |
10636 | Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision | Zhenzhen Weng, Mehmet Giray Ogut, Shai Limonchik, Serena Yeung |
446 | Semi-Supervised Semantic Segmentation With Cross Pseudo Supervision | Xiaokang Chen, Yuhui Yuan, Gang Zeng, Jingdong Wang |
2532 | Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation | Yazhou Yao, Tao Chen, Guo-Sen Xie, Chuanyi Zhang, Fumin Shen, Qi Wu, Zhenmin Tang, Jian Zhang |
5925 | ABMDRNet: Adaptive-Weighted Bi-Directional Modality Difference Reduction Network for RGB-T Semantic Segmentation | Qiang Zhang, Shenlu Zhao, Yongjiang Luo, Dingwen Zhang, Nianchang Huang, Jungong Han |
4014 | BBAM: Bounding Box Attribution Map for Weakly Supervised Semantic and Instance Segmentation | Jungbeom Lee, Jihun Yi, Chaehun Shin, Sungroh Yoon |
4558 | Positive-Unlabeled Data Purification in the Wild for Object Detection | Jianyuan Guo, Kai Han, Han Wu, Chao Zhang, Xinghao Chen, Chunjing Xu, Chang Xu, Yunhe Wang |
7652 | Ranking Neural Checkpoints | Yandong Li, Xuhui Jia, Ruoxin Sang, Yukun Zhu, Bradley Green, Liqiang Wang, Boqing Gong |
10682 | SelfAugment: Automatic Augmentation Policies for Self-Supervised Learning | Colorado J Reed, Sean Metzger, Aravind Srinivas, Trevor Darrell, Kurt Keutzer |
2077 | Self-Supervised Multi-Frame Monocular Scene Flow | Junhwa Hur, Stefan Roth |
1549 | Skip-Convolutions for Efficient Video Processing | Amirhossein Habibian, Davide Abati, Taco S. Cohen, Babak Ehteshami Bejnordi |
1609 | Learning To Associate Every Segment for Video Panoptic Segmentation | Sanghyun Woo, Dahun Kim, Joon-Young Lee, In So Kweon |
4598 | Triple-Cooperative Video Shadow Detection | Zhihao Chen, Liang Wan, Lei Zhu, Jia Shen, Huazhu Fu, Wennan Liu, Jing Qin |
10358 | Image Change Captioning by Learning From an Auxiliary Task | Mehrdad Hosseinzadeh, Yang Wang |
4172 | How2Sign: A Large-Scale Multimodal Dataset for Continuous American Sign Language | Amanda Duarte, Shruti Palaskar, Lucas Ventura, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giro-i-Nieto |
1731 | Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation | Yapeng Tian, Di Hu, Chenliang Xu |
8850 | LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces From Video Using Pose and Lighting Normalization | Avisek Lahiri, Vivek Kwatra, Christian Frueh, John Lewis, Chris Bregler |
7458 | Interventional Video Grounding With Dual Contrastive Learning | Guoshun Nan, Rui Qiao, Yao Xiao, Jun Liu, Sicong Leng, Hao Zhang, Wei Lu |
987 | Roses Are Red, Violets Are Blue… but Should VQA Expect Them To? | Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf |
2862 | ReDet: A Rotation-Equivariant Detector for Aerial Object Detection | Jiaming Han, Jian Ding, Nan Xue, Gui-Song Xia |
2191 | Roof-GAN: Learning To Generate Roof Geometry and Relations for Residential Houses | Yiming Qian, Hao Zhang, Yasutaka Furukawa |
6778 | PANDA: Adapting Pretrained Features for Anomaly Detection and Segmentation | Tal Reiss, Niv Cohen, Liron Bergman, Yedid Hoshen |
7658 | Differentiable SLAM-Net: Learning Particle SLAM for Visual Navigation | Peter Karkus, Shaojun Cai, David Hsu |
Date: Tuesday, June 22, 2021 6:00– 8:30
Paper Session Three:
Paper ID | Paper Title | Authors |
2557 | DyStaB: Unsupervised Object Segmentation via Dynamic-Static Bootstrapping | Yanchao Yang, Brian Lai, Stefano Soatto |
456 | Diffusion Probabilistic Models for 3D Point Cloud Generation | Shitong Luo, Wei Hu |
1292 | Learned Initializations for Optimizing Coordinate-Based Neural Representations | Matthew Tancik, Ben Mildenhall, Terrance Wang, Divi Schmidt, Pratul P. Srinivasan, Jonathan T. Barron, Ren Ng |
4729 | Neural Scene Graphs for Dynamic Scenes | Julian Ost, Fahim Mannan, Nils Thuerey, Julian Knodt, Felix Heide |
6513 | Consensus Maximisation Using Influences of Monotone Boolean Functions | Ruwan Tennakoon, David Suter, Erchuan Zhang, Tat-Jun Chin, Alireza Bab-Hadiashar |
566 | Task Programming: Learning Data Efficient Behavior Representations | Jennifer J. Sun, Ann Kennedy, Eric Zhan, David J. Anderson, Yisong Yue, Pietro Perona |
1058 | SCANimate: Weakly Supervised Learning of Skinned Clothed Avatar Networks | Shunsuke Saito, Jinlong Yang, Qianli Ma, Michael J. Black |
1342 | Diverse Part Discovery: Occluded Person Re-Identification With Part-Aware Transformer | Yulin Li, Jianfeng He, Tianzhu Zhang, Xiang Liu, Yongdong Zhang, Feng Wu |
8456 | What’s in the Image? Explorable Decoding of Compressed Images | Yuval Bahat, Tomer Michaeli |
7798 | Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation | Golnaz Ghiasi, Yin Cui, Aravind Srinivas, Rui Qian, Tsung-Yi Lin, Ekin D. Cubuk, Quoc V. Le, Barret Zoph |
732 | Face Forgery Detection by 3D Decomposition | Xiangyu Zhu, Hao Wang, Hongyan Fei, Zhen Lei, Stan Z. Li |
1902 | Convolutional Hough Matching Networks | Juhong Min, Minsu Cho |
2997 | L2M-GAN: Learning To Manipulate Latent Space Semantics for Facial Attribute Editing | Guoxing Yang, Nanyi Fei, Mingyu Ding, Guangzhen Liu, Zhiwu Lu, Tao Xiang |
4865 | Patchwise Generative ConvNet: Training Energy-Based Models From a Single Natural Image for Internal Learning | Zilong Zheng, Jianwen Xie, Ping Li |
10084 | Generative Classifiers as a Basis for Trustworthy Image Classification | Radek Mackowiak, Lynton Ardizzone, Ullrich Köthe, Carsten Rother |
543 | HR-NAS: Searching Efficient High-Resolution Neural Architectures With Lightweight Transformers | Mingyu Ding, Xiaochen Lian, Linjie Yang, Peng Wang, Xiaojie Jin, Zhiwu Lu, Ping Luo |
2273 | Progressive Unsupervised Learning for Visual Object Tracking | Qiangqiang Wu, Jia Wan, Antoni B. Chan |
5934 | FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation | Yisheng He, Haibin Huang, Haoqiang Fan, Qifeng Chen, Jian Sun |
3270 | DER: Dynamically Expandable Representation for Class Incremental Learning | Shipeng Yan, Jiangwei Xie, Xuming He |
6831 | Dense Contrastive Learning for Self-Supervised Visual Pre-Training | Xinlong Wang, Rufeng Zhang, Chunhua Shen, Tao Kong, Lei Li |
7928 | S2R-DepthNet: Learning a Generalizable Depth-Specific Structural Representation | Xiaotian Chen, Yuwang Wang, Xuejin Chen, Wenjun Zeng |
7444 | Depth-Aware Mirror Segmentation | Haiyang Mei, Bo Dong, Wen Dong, Pieter Peers, Xin Yang, Qiang Zhang, Xiaopeng Wei |
7876 | Video Prediction Recalling Long-Term Motion Context via Memory Alignment Learning | Sangmin Lee, Hak Gu Kim, Dae Hwi Choi, Hyung-Il Kim, Yong Man Ro |
1637 | Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression | Chen Gao, Jinyu Chen, Si Liu, Luting Wang, Qiong Zhang, Qi Wu |
7655 | GATSBI: Generative Agent-Centric Spatio-Temporal Object Interaction | Cheol-Hui Min, Jinseok Bae, Junho Lee, Young Min Kim |
6262 | Crossing Cuts Polygonal Puzzles: Models and Solvers | Peleg Harel, Ohad Ben-Shahar |
1769 | Transformation Invariant Few-Shot Object Detection | Aoxue Li, Zhenguo Li |
5174 | Adaptive Class Suppression Loss for Long-Tail Object Detection | Tong Wang, Yousong Zhu, Chaoyang Zhao, Wei Zeng, Jinqiao Wang, Ming Tang |
4180 | What if We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels | Jeonghun Baek, Yusuke Matsui, Kiyoharu Aizawa |
2714 | Fourier Contour Embedding for Arbitrary-Shaped Text Detection | Yiqin Zhu, Jianyong Chen, Lingyu Liang, Zhanghui Kuang, Lianwen Jin, Wayne Zhang |
10709 | Humble Teachers Teach Better Students for Semi-Supervised Object Detection | Yihe Tang, Weifeng Chen, Yijun Luo, Yuting Zhang |
10513 | Cross-Modal Center Loss for 3D Cross-Modal Retrieval | Longlong Jing, Elahe Vahdani, Jiaxing Tan, Yingli Tian |
4554 | Single-View 3D Object Reconstruction From Shape Priors in Memory | Shuo Yang, Min Xu, Haozhe Xie, Stuart Perry, Jiahao Xia |
1125 | NeuralFusion: Online Depth Fusion in Latent Space | Silvan Weder, Johannes L. Schönberger, Marc Pollefeys, Martin R. Oswald |
2252 | PAConv: Position Adaptive Convolution With Dynamic Kernel Assembling on Point Clouds | Mutian Xu, Runyu Ding, Hengshuang Zhao, Xiaojuan Qi |
3524 | Self-Supervised Pillar Motion Learning for Autonomous Driving | Chenxu Luo, Xiaodong Yang, Alan Yuille |
6946 | Scan2Cap: Context-Aware Dense Captioning in RGB-D Scans | Zhenyu Chen, Ali Gholami, Matthias Nießner, Angel X. Chang |
7082 | Neural Parts: Learning Expressive 3D Shape Abstractions With Invertible Neural Networks | Despoina Paschalidou, Angelos Katharopoulos, Andreas Geiger, Sanja Fidler |
5581 | Universal Spectral Adversarial Attacks for Deformable Shapes | Arianna Rampini, Franco Pestarini, Luca Cosmo, Simone Melzi, Emanuele Rodolà |
7781 | Large-Scale Localization Datasets in Crowded Indoor Spaces | Donghwan Lee, Soohyun Ryu, Suyong Yeon, Yonghan Lee, Deokhwa Kim, Cheolho Han, Yohann Cabon, Philippe Weinzaepfel, Nicolas Guérin, Gabriela Csurka, Martin Humenberger |
2430 | Learnable Motion Coherence for Correspondence Pruning | Yuan Liu, Lingjie Liu, Cheng Lin, Zhen Dong, Wenping Wang |
2989 | Back to the Feature: Learning Robust Camera Localization From Pixels To Pose | Paul-Edouard Sarlin, Ajaykumar Unagar, Måns Larsson, Hugo Germain, Carl Toft, Viktor Larsson, Marc Pollefeys, Vincent Lepetit, Lars Hammarstrand, Fredrik Kahl, Torsten Sattler |
3629 | Wide-Baseline Relative Camera Pose Estimation With Directional Learning | Kefan Chen, Noah Snavely, Ameesh Makadia |
6536 | Deep Optimized Priors for 3D Shape Modeling and Reconstruction | Mingyue Yang, Yuxin Wen, Weikai Chen, Yongwei Chen, Kui Jia |
2912 | PVGNet: A Bottom-Up One-Stage 3D Object Detector With Integrated Multi-Level Features | Zhenwei Miao, Jikai Chen, Hongyu Pan, Ruiwen Zhang, Kaixuan Liu, Peihan Hao, Jun Zhu, Yang Wang, Xin Zhan |
2471 | Objects Are Different: Flexible Monocular 3D Object Detection | Yunpeng Zhang, Jiwen Lu, Jie Zhou |
466 | A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning | Christoph Feichtenhofer, Haoqi Fan, Bo Xiong, Ross Girshick, Kaiming He |
8549 | Representing Videos As Discriminative Sub-Graphs for Action Recognition | Dong Li, Zhaofan Qiu, Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei |
880 | Learning Salient Boundary Feature for Anchor-free Temporal Action Localization | Chuming Lin, Chengming Xu, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yanwei Fu |
679 | QAIR: Practical Query-Efficient Black-Box Attacks for Image Retrieval | Xiaodan Li, Jinfeng Li, Yuefeng Chen, Shaokai Ye, Yuan He, Shuhui Wang, Hang Su, Hui Xue |
7333 | Defending Multimodal Fusion Models Against Single-Source Adversaries | Karren Yang, Wan-Yi Lin, Manash Barman, Filipe Condessa, Zico Kolter |
844 | Training Generative Adversarial Networks in One Stage | Chengchao Shen, Youtan Yin, Xinchao Wang, Xubin Li, Jie Song, Mingli Song |
1895 | Learning Complete 3D Morphable Face Models From Images and Videos | Mallikarjun B R, Ayush Tewari, Hans-Peter Seidel, Mohamed Elgharib, Christian Theobalt |
2253 | We Are More Than Our Joints: Predicting How 3D Bodies Move | Yan Zhang, Michael J. Black, Siyu Tang |
1017 | HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation | Jiefeng Li, Chao Xu, Zhicun Chen, Siyuan Bian, Lixin Yang, Cewu Lu |
3902 | Learning To Count Everything | Viresh Ranjan, Udbhav Sharma, Thu Nguyen, Minh Hoai |
1650 | Information Bottleneck Disentanglement for Identity Swapping | Gege Gao, Huaibo Huang, Chaoyou Fu, Zhaoyang Li, Ran He |
9944 | Mitigating Face Recognition Bias via Group Adaptive Classifier | Sixue Gong, Xiaoming Liu, Anil K. Jain |
277 | Meta Batch-Instance Normalization for Generalizable Person Re-Identification | Seokeon Choi, Taekyung Kim, Minki Jeong, Hyoungseob Park, Changick Kim |
10117 | Refining Pseudo Labels With Clustering Consensus Over Generations for Unsupervised Object Re-Identification | Xiao Zhang, Yixiao Ge, Yu Qiao, Hongsheng Li |
8305 | Back to Event Basics: Self-Supervised Learning of Image Reconstruction for Event Cameras via Photometric Constancy | Federico Paredes-Vallés, Guido C. H. E. de Croon |
548 | DeFMO: Deblurring and Shape Recovery of Fast Moving Objects | Denys Rozumnyi, Martin R. Oswald, Vittorio Ferrari, Jiří Matas, Marc Pollefeys |
5416 | Efficient Multi-Stage Video Denoising With Recurrent Spatio-Temporal Fusion | Matteo Maggioni, Yibin Huang, Cheng Li, Shuai Xiao, Zhongqian Fu, Fenglong Song |
5436 | ZeroScatter: Domain Transfer for Long Distance Imaging and Vision Through Scattering Media | Zheng Shi, Ethan Tseng, Mario Bijelic, Werner Ritter, Felix Heide |
6037 | Restoring Extremely Dark Images in Real Time | Mohit Lamba, Kaushik Mitra |
3964 | Practical Wide-Angle Portraits Correction With Deep Structured Models | Jing Tan, Shan Zhao, Pengfei Xiong, Jiangyu Liu, Haoqiang Fan, Shuaicheng Liu |
9951 | End-to-End Learning for Joint Image Demosaicing, Denoising and Super-Resolution | Wenzhu Xing, Karen Egiazarian |
267 | Image Super-Resolution With Non-Local Sparse Attention | Yiqun Mei, Yuchen Fan, Yuqian Zhou |
3036 | Video Rescaling Networks With Joint Optimization Strategies for Downscaling and Upscaling | Yan-Cheng Huang, Yi-Hsin Chen, Cheng-You Lu, Hui-Po Wang, Wen-Hsiao Peng, Ching-Chun Huang |
2170 | Restore From Restored: Video Restoration With Pseudo Clean Video | Seunghwan Lee, Donghyeon Cho, Jiwon Kim, Tae Hyun Kim |
1092 | Enriching ImageNet With Human Similarity Judgments and Psychological Embeddings | Brett D. Roads, Bradley C. Love |
8131 | Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts | Soravit Changpinyo, Piyush Sharma, Nan Ding, Radu Soricut |
7919 | CondenseNet V2: Sparse Feature Reactivation for Deep Networks | Le Yang, Haojun Jiang, Ruojin Cai, Yulin Wang, Shiji Song, Gao Huang, Qi Tian |
941 | Revisiting Knowledge Distillation: An Inheritance and Exploration Framework | Zhen Huang, Xu Shen, Jun Xing, Tongliang Liu, Xinmei Tian, Houqiang Li, Bing Deng, Jianqiang Huang, Xian-Sheng Hua |
946 | Minimally Invasive Surgery for Sparse Neural Networks in Contrastive Manner | Chong Yu |
11586 | Effective Sparsification of Neural Networks With Global Sparsity Constraint | Xiao Zhou, Weizhong Zhang, Hang Xu, Tong Zhang |
7211 | Improving the Efficiency and Robustness of Deepfakes Detection Through Precise Geometric Features | Zekun Sun, Yujie Han, Zeyu Hua, Na Ruan, Weijia Jia |
8409 | Right for the Right Concept: Revising Neuro-Symbolic Concepts by Interacting With Their Explanations | Wolfgang Stammer, Patrick Schramowski, Kristian Kersting |
3722 | Protecting Intellectual Property of Generative Adversarial Networks From Ambiguity Attacks | Ding Sheng Ong, Chee Seng Chan, Kam Woh Ng, Lixin Fan, Qiang Yang |
572 | VIGOR: Cross-View Image Geo-Localization Beyond One-to-One Retrieval | Sijie Zhu, Taojiannan Yang, Chen Chen |
4736 | On Semantic Similarity in Video Retrieval | Michael Wray, Hazel Doughty, Dima Damen |
3760 | Flow-Guided One-Shot Talking Face Generation With a High-Resolution Audio-Visual Dataset | Zhimeng Zhang, Lincheng Li, Yu Ding, Changjie Fan |
5478 | Navigating the GAN Parameter Space for Semantic Image Editing | Anton Cherepkov, Andrey Voynov, Artem Babenko |
1299 | IMAGINE: Image Synthesis by Image-Guided Model Inversion | Pei Wang, Yijun Li, Krishna Kumar Singh, Jingwan Lu, Nuno Vasconcelos |
1550 | Human De-Occlusion: Invisible Perception and Recovery for Humans | Qiang Zhou, Shiyin Wang, Yitong Wang, Zilong Huang, Xinggang Wang |
2573 | Learning To Warp for Style Transfer | Xiao-Chang Liu, Yong-Liang Yang, Peter Hall |
7828 | StEP: Style-Based Encoder Pre-Training for Multi-Modal Image Synthesis | Moustafa Meshry, Yixuan Ren, Larry S. Davis, Abhinav Shrivastava |
5132 | ANR: Articulated Neural Rendering for Virtual Avatars | Amit Raj, Julian Tanke, James Hays, Minh Vo, Carsten Stoll, Christoph Lassner |
8365 | LayoutTransformer: Scene Layout Generation With Conceptual and Spatial Diversity | Cheng-Fu Yang, Wan-Cyuan Fan, Fu-En Yang, Yu-Chiang Frank Wang |
222 | Stochastic Image-to-Video Synthesis Using cINNs | Michael Dorkenwald, Timo Milbich, Andreas Blattmann, Robin Rombach, Konstantinos G. Derpanis, Björn Ommer |
5343 | Prototype Completion With Primitive Knowledge for Few-Shot Learning | Baoquan Zhang, Xutao Li, Yunming Ye, Zhichao Huang, Lisai Zhang |
11705 | Dynamic Class Queue for Large Scale Face Recognition in the Wild | Bi Li, Teng Xi, Gang Zhang, Haocheng Feng, Junyu Han, Jingtuo Liu, Errui Ding, Wenyu Liu |
10159 | Truly Shift-Invariant Convolutional Neural Networks | Anadi Chaman, Ivan Dokmanić |
3293 | RSG: A Simple but Effective Module for Learning Imbalanced Datasets | Jianfeng Wang, Thomas Lukasiewicz, Xiaolin Hu, Jianfei Cai, Zhenghua Xu |
6754 | Goal-Oriented Gaze Estimation for Zero-Shot Learning | Yang Liu, Lei Zhou, Xiao Bai, Yifei Huang, Lin Gu, Jun Zhou, Tatsuya Harada |
4294 | Uncalibrated Neural Inverse Rendering for Photometric Stereo of General Surfaces | Berk Kaya, Suryansh Kumar, Carlos Oliveira, Vittorio Ferrari, Luc Van Gool |
10421 | Robust Bayesian Neural Networks by Spectral Expectation Bound Regularization | Jiaru Zhang, Yang Hua, Zhengui Xue, Tao Song, Chengyu Zheng, Ruhui Ma, Haibing Guan |
6542 | MobileDets: Searching for Object Detection Architectures for Mobile Accelerators | Yunyang Xiong, Hanxiao Liu, Suyog Gupta, Berkin Akin, Gabriel Bender, Yongzhe Wang, Pieter-Jan Kindermans, Mingxing Tan, Vikas Singh, Bo Chen |
2698 | Hilbert Sinkhorn Divergence for Optimal Transport | Qian Li, Zhichao Wang, Gang Li, Jun Pang, Guandong Xu |
2785 | Object Classification From Randomized EEG Trials | Hamad Ahmed, Ronnie B. Wilbur, Hari M. Bharadwaj, Jeffrey Mark Siskind |
4924 | Leveraging Large-Scale Weakly Labeled Data for Semi-Supervised Mass Detection in Mammograms | Yuxing Tang, Zhenjie Cao, Yanbo Zhang, Zhicheng Yang, Zongcheng Ji, Yiwei Wang, Mei Han, Jie Ma, Jing Xiao, Peng Chang |
3400 | Tracking Pedestrian Heads in Dense Crowd | Ramana Sundararaman, Cédric De Almeida Braga, Eric Marchand, Julien Pettré |
1310 | Multiple Object Tracking With Correlation Learning | Qiang Wang, Yun Zheng, Pan Pan, Yinghui Xu |
8823 | SMURF: Self-Teaching Multi-Frame Unsupervised RAFT With Full-Image Warping | Austin Stone, Daniel Maurer, Alper Ayvaci, Anelia Angelova, Rico Jonschkowski |
2448 | Bilinear Parameterization for Non-Separable Singular Value Penalties | Marcus Valtonen Örnhag, José Pedro Iglesias, Carl Olsson |
420 | DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-Scale Consistency | Zongxin Yang, Xin Yu, Yi Yang |
1223 | Unsupervised Disentanglement of Linear-Encoded Facial Semantics | Yutong Zheng, Yu-Kai Huang, Ran Tao, Zhiqiang Shen, Marios Savvides |
2813 | MetaCorrection: Domain-Aware Meta Loss Correction for Unsupervised Domain Adaptation in Semantic Segmentation | Xiaoqing Guo, Chen Yang, Baopu Li, Yixuan Yuan |
4620 | Cross-Domain Gradient Discrepancy Minimization for Unsupervised Domain Adaptation | Zhekai Du, Jingjing Li, Hongzu Su, Lei Zhu, Ke Lu |
1262 | Generative Interventions for Causal Learning | Chengzhi Mao, Augustine Cha, Amogh Gupta, Hao Wang, Junfeng Yang, Carl Vondrick |
2806 | Distilling Causal Effect of Data in Class-Incremental Learning | Xinting Hu, Kaihua Tang, Chunyan Miao, Xian-Sheng Hua, Hanwang Zhang |
2428 | Embedding Transfer With Label Relaxation for Improved Metric Learning | Sungyeon Kim, Dongwon Kim, Minsu Cho, Suha Kwak |
8012 | M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-Training | Minheng Ni, Haoyang Huang, Lin Su, Edward Cui, Taroon Bharti, Lijuan Wang, Dongdong Zhang, Nan Duan |
6397 | Instance Localization for Self-Supervised Detection Pretraining | Ceyuan Yang, Zhirong Wu, Bolei Zhou, Stephen Lin |
1314 | VIP-DeepLab: Learning Visual Perception With Depth-Aware Video Panoptic Segmentation | Siyuan Qiao, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen |
6985 | AdaBins: Depth Estimation Using Adaptive Bins | Shariq Farooq Bhat, Ibraheem Alhashim, Peter Wonka |
1418 | Deep Occlusion-Aware Instance Segmentation With Overlapping BiLayers | Lei Ke, Yu-Wing Tai, Chi-Keung Tang |
10761 | Information-Theoretic Segmentation by Inpainting Error Maximization | Pedro Savarese, Sunnie S. Y. Kim, Michael Maire, Greg Shakhnarovich, David McAllester |
448 | PLOP: Learning Without Forgetting for Continual Semantic Segmentation | Arthur Douillard, Yifu Chen, Arnaud Dapogny, Matthieu Cord |
2873 | Coarse-To-Fine Domain Adaptive Semantic Segmentation With Photometric Alignment and Category-Center Regularization | Haoyu Ma, Xiangru Lin, Zifeng Wu, Yizhou Yu |
6053 | HyperSeg: Patch-Wise Hypernetwork for Real-Time Semantic Segmentation | Yuval Nirkin, Lior Wolf, Tal Hassner |
4016 | Anti-Adversarially Manipulated Attributions for Weakly and Semi-Supervised Semantic Segmentation | Jungbeom Lee, Eunji Kim, Sungroh Yoon |
4572 | Instant-Teaching: An End-to-End Semi-Supervised Object Detection Framework | Qiang Zhou, Chaohui Yu, Zhibin Wang, Qi Qian, Hao Li |
11326 | Unbiased Mean Teacher for Cross-Domain Object Detection | Jinhong Deng, Wen Li, Yuhua Chen, Lixin Duan |
10922 | MeanShift++: Extremely Fast Mode-Seeking With Applications to Segmentation and Object Tracking | Jennifer Jang, Heinrich Jiang |
2940 | FlowStep3D: Model Unrolling for Self-Supervised Scene Flow Estimation | Yair Kittenplon, Yonina C. Eldar, Dan Raviv |
2034 | Recognizing Actions in Videos From Unseen Viewpoints | AJ Piergiovanni, Michael S. Ryoo |
1665 | VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild | Jiaxu Miao, Yunchao Wei, Yu Wu, Chen Liang, Guangrui Li, Yi Yang |
4755 | Learning Position and Target Consistency for Memory-Based Video Object Segmentation | Li Hu, Peng Zhang, Bang Zhang, Pan Pan, Yinghui Xu, Rong Jin |
1719 | UC2: Universal Cross-Lingual Cross-Modal Vision-and-Language Pre-Training | Mingyang Zhou, Luowei Zhou, Shuohang Wang, Yu Cheng, Linjie Li, Zhou Yu, Jingjing Liu |
10239 | Fingerspelling Detection in American Sign Language | Bowen Shi, Diane Brentari, Greg Shakhnarovich, Karen Livescu |
1845 | Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation | Hang Zhou, Yasheng Sun, Wayne Wu, Chen Change Loy, Xiaogang Wang, Ziwei Liu |
1631 | Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation | Tianrui Hui, Shaofei Huang, Si Liu, Zihan Ding, Guanbin Li, Wenguan Wang, Jizhong Han, Fei Wang |
7642 | Cascaded Prediction Network via Segment Tree for Temporal Video Grounding | Yang Zhao, Zhou Zhao, Zhu Zhang, Zhijie Lin |
990 | How Transferable Are Reasoning Patterns in VQA? | Corentin Kervadec, Théo Jaunet, Grigory Antipov, Moez Baccouche, Romain Vuillemot, Christian Wolf |
3065 | PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation | Xiangtai Li, Hao He, Xia Li, Duo Li, Guangliang Cheng, Jianping Shi, Lubin Weng, Yunhai Tong, Zhouchen Lin |
2979 | HDMapGen: A Hierarchical Graph Generative Model of High Definition Maps | Lu Mi, Hang Zhao, Charlie Nash, Xiaohan Jin, Jiyang Gao, Chen Sun, Cordelia Schmid, Nir Shavit, Yuning Chai, Dragomir Anguelov |
3362 | A Circular-Structured Representation for Visual Emotion Distribution Learning | Jingyuan Yang, Jie Li, Leida Li, Xiumei Wang, Xinbo Gao |
2304 | More Photos Are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval | Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yongxin Yang, Tao Xiang, Yi-Zhe Song |
Date: Tuesday, June 22, 2021 11:00 – 13:30
Paper Session Four:
Paper ID | Paper Title | Authors |
8890 | Line Segment Detection Using Transformers Without Edges | Yifan Xu, Weijian Xu, David Cheung, Zhuowen Tu |
1139 | Predator: Registration of 3D Point Clouds With Low Overlap | Shengyu Huang, Zan Gojcic, Mikhail Usvyatsov, Andreas Wieser, Konrad Schindler |
1824 | Point2Skeleton: Learning Skeletal Representations from Point Clouds | Cheng Lin, Changjian Li, Yuan Liu, Nenglun Chen, Yi-King Choi, Wenping Wang |
4945 | Neural Lumigraph Rendering | Petr Kellnhofer, Lars C. Jebe, Andrew Jones, Ryan Spicer, Kari Pulli, Gordon Wetzstein |
2419 | Rotation Coordinate Descent for Fast Globally Optimal Rotation Averaging | Alvaro Parra, Shin-Fang Chng, Tat-Jun Chin, Anders Eriksson, Ian Reid |
3659 | Towards Evaluating and Training Verifiably Robust Neural Networks | Zhaoyang Lyu, Minghao Guo, Tong Wu, Guodong Xu, Kehuan Zhang, Dahua Lin |
1929 | Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-Localization in Large Scenes From Body-Mounted Sensors | Vladimir Guzov, Aymen Mir, Torsten Sattler, Gerard Pons-Moll |
3358 | Discover Cross-Modality Nuances for Visible-Infrared Person Re-Identification | Qiong Wu, Pingyang Dai, Jie Chen, Chia-Wen Lin, Yongjian Wu, Feiyue Huang, Bineng Zhong, Rongrong Ji |
1427 | Dual Pixel Exploration: Simultaneous Depth Estimation and Image Restoration | Liyuan Pan, Shah Chowdhury, Richard Hartley, Miaomiao Liu, Hongguang Zhang, Hongdong Li |
3586 | Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets | Yuan-Hong Liao, Amlan Kar, Sanja Fidler |
1619 | ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis | Yinan He, Bei Gan, Siyu Chen, Yichun Zhou, Guojun Yin, Luchuan Song, Lu Sheng, Jing Shao, Ziwei Liu |
5085 | Spatial-Temporal Correlation and Topology Learning for Person Re-Identification in Videos | Jiawei Liu, Zheng-Jun Zha, Wei Wu, Kecheng Zheng, Qibin Sun |
5822 | SSN: Soft Shadow Network for Image Compositing | Yichen Sheng, Jianming Zhang, Bedrich Benes |
11788 | Soft-IntroVAE: Analyzing and Improving the Introspective Variational Autoencoder | Tal Daniel, Aviv Tamar |
4081 | Learning Placeholders for Open-Set Recognition | Da-Wei Zhou, Han-Jia Ye, De-Chuan Zhan |
1896 | ReNAS: Relativistic Evaluation of Neural Architecture Search | Yixing Xu, Yunhe Wang, Kai Han, Yehui Tang, Shangling Jui, Chunjing Xu, Chang Xu |
4117 | Learning To Filter: Siamese Relation Network for Robust Tracking | Siyuan Cheng, Bineng Zhong, Guorong Li, Xin Liu, Zhenjun Tang, Xianxian Li, Jing Wang |
2644 | Generative Hierarchical Features From Synthesizing Images | Yinghao Xu, Yujun Shen, Jiapeng Zhu, Ceyuan Yang, Bolei Zhou |
8086 | Continual Adaptation of Visual Representations via Domain Randomization and Meta-Learning | Riccardo Volpi, Diane Larlus, Grégory Rogez |
8866 | NewtonianVAE: Proportional Control and Goal Identification From Pixels via Physical Latent Spaces | Miguel Jaques, Michael Burke, Timothy M. Hospedales |
6073 | 3D-to-2D Distillation for Indoor Scene Parsing | Zhengzhe Liu, Xiaojuan Qi, Chi-Wing Fu |
10030 | Repurposing GANs for One-Shot Semantic Part Segmentation | Nontawat Tritrong, Pitchaporn Rewatbowornwong, Supasorn Suwajanakorn |
2958 | Temporal Query Networks for Fine-Grained Video Understanding | Chuhan Zhang, Ankush Gupta, Andrew Zisserman |
3887 | ManipulaTHOR: A Framework for Visual Object Manipulation | Kiana Ehsani, Winson Han, Alvaro Herrasti, Eli VanderBilt, Luca Weihs, Eric Kolve, Aniruddha Kembhavi, Roozbeh Mottaghi |
344 | Omnimatte: Associating Objects and Their Effects in Video | Erika Lu, Forrester Cole, Tali Dekel, Andrew Zisserman, William T. Freeman, Michael Rubinstein |
582 | MeGA-CDA: Memory Guided Attention for Category-Aware Unsupervised Domain Adaptive Object Detection | Vibashan VS, Vikram Gupta, Poojan Oza, Vishwanath A. Sindagi, Vishal M. Patel |
2564 | Generalized Few-Shot Object Detection Without Forgetting | Zhibo Fan, Yuchen Ma, Zeming Li, Jian Sun |
6392 | DAP: Detection-Aware Pre-Training With Weak Supervision | Yuanyi Zhong, Jianfeng Wang, Lijuan Wang, Jian Peng, Yu-Xiong Wang, Lei Zhang |
5538 | A Multiplexed Network for End-to-End, Multilingual OCR | Jing Huang, Guan Pang, Rama Kovvuri, Mandy Toh, Kevin J Liang, Praveen Krishnan, Xi Yin, Tal Hassner |
6276 | Scene Text Retrieval via Joint Text Detection and Similarity Learning | Hao Wang, Xiang Bai, Mingkun Yang, Shenggao Zhu, Jing Wang, Wenyu Liu |
1987 | Data-Uncertainty Guided Multi-Phase Learning for Semi-Supervised Object Detection | Zhenyu Wang, Yali Li, Ye Guo, Lu Fang, Shengjin Wang |
1291 | pixelNeRF: Neural Radiance Fields From One or Few Images | Alex Yu, Vickie Ye, Matthew Tancik, Angjoo Kanazawa |
4891 | From Points to Multi-Object 3D Reconstruction | Francis Engelmann, Konstantinos Rematas, Bastian Leibe, Vittorio Ferrari |
3949 | 4D Hyperspectral Photoacoustic Data Restoration With Reliability Analysis | Weihang Liao, Art Subpa-asa, Yinqiang Zheng, Imari Sato |
2306 | RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction | Yinyu Nie, Ji Hou, Xiaoguang Han, Matthias Nießner |
4083 | Style-Based Point Generator With Adversarial Rendering for Point Cloud Completion | Chulin Xie, Chuxin Wang, Bo Zhang, Hao Yang, Dong Chen, Fang Wen |
7033 | Denoise and Contrast for Category Agnostic Shape Completion | Antonio Alliegro, Diego Valsesia, Giulia Fracastoro, Enrico Magli, Tatiana Tommasi |
8562 | Neural Surface Maps | Luca Morreale, Noam Aigerman, Vladimir G. Kim, Niloy J. Mitra |
3438 | RGB-D Local Implicit Function for Depth Completion of Transparent Objects | Luyang Zhu, Arsalan Mousavian, Yu Xiang, Hammad Mazhar, Jozef van Eenbergen, Shoubhik Debnath, Dieter Fox |
10157 | Uncertainty-Aware Camera Pose Estimation From Points and Lines | Alexander Vakhitov, Luis Ferraz, Antonio Agudo, Francesc Moreno-Noguer |
2491 | Patch2Pix: Epipolar-Guided Pixel-Level Correspondences | Qunjie Zhou, Torsten Sattler, Laura Leal-Taixé |
7332 | Deep Multi-Task Learning for Joint Localization, Perception, and Prediction | John Phillips, Julieta Martinez, Ioan Andrei Bârsan, Sergio Casas, Abbas Sadat, Raquel Urtasun |
4114 | IBRNet: Learning Multi-View Image-Based Rendering | Qianqian Wang, Zhicheng Wang, Kyle Genova, Pratul P. Srinivasan, Howard Zhou, Jonathan T. Barron, Ricardo Martin-Brualla, Noah Snavely, Thomas Funkhouser |
7304 | Unsupervised Learning of 3D Object Categories From Videos in the Wild | Philipp Henzler, Jeremy Reizenstein, Patrick Labatut, Roman Shapovalov, Tobias Ritschel, Andrea Vedaldi, David Novotny |
754 | LiDAR-Aug: A General Rendering-Based Augmentation Framework for 3D Object Detection | Jin Fang, Xinxin Zuo, Dingfu Zhou, Shengze Jin, Sen Wang, Liangjun Zhang |
3226 | Delving Into Localization Errors for Monocular 3D Object Detection | Xinzhu Ma, Yinmin Zhang, Dan Xu, Dongzhan Zhou, Shuai Yi, Haojie Li, Wanli Ouyang |
1652 | 3D CNNs With Adaptive Temporal Feature Resolutions | Mohsen Fayyaz, Emad Bahrami, Ali Diba, Mehdi Noroozi, Ehsan Adeli, Luc Van Gool, Jürgen Gall |
313 | 3D Human Action Representation Learning via Cross-View Consistency Pursuit | Linguo Li, Minsi Wang, Bingbing Ni, Hang Wang, Jiancheng Yang, Wenjun Zhang |
1723 | Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations | Zhihui Li, Lina Yao |
960 | Delving into Data: Effectively Substitute Training for Black-box Attack | Wenxuan Wang, Bangjie Yin, Taiping Yao, Li Zhang, Yanwei Fu, Shouhong Ding, Jilin Li, Feiyue Huang, Xiangyang Xue |
7686 | Data-Free Model Extraction | Jean-Baptiste Truong, Pratyush Maini, Robert J. Walls, Nicolas Papernot |
7269 | Adaptive Weighted Discriminator for Training Generative Adversarial Networks | Vasily Zadorozhnyy, Qiang Cheng, Qiang Ye |
1898 | Monocular Reconstruction of Neural Face Reflectance Fields | Mallikarjun B R, Ayush Tewari, Tae-Hyun Oh, Tim Weyrich, Bernd Bickel, Hans-Peter Seidel, Hanspeter Pfister, Wojciech Matusik, Mohamed Elgharib, Christian Theobalt |
2267 | Towards Accurate 3D Human Motion Prediction From Incomplete Observations | Qiongjie Cui, Huaijiang Sun |
1429 | Monocular Real-Time Full Body Capture With Inter-Part Correlations | Yuxiao Zhou, Marc Habermann, Ikhsanul Habibie, Ayush Tewari, Christian Theobalt, Feng Xu |
6178 | Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting | Lingbo Liu, Jiaqi Chen, Hefeng Wu, Guanbin Li, Chenglong Li, Liang Lin |
2827 | One Shot Face Swapping on Megapixels | Yuhao Zhu, Qi Li, Jian Wang, Cheng-Zhong Xu, Zhenan Sun |
2974 | Dynamic Probabilistic Graph Convolution for Facial Action Unit Intensity Estimation | Tengfei Song, Zijun Cui, Yuru Wang, Wenming Zheng, Qiang Ji |
815 | Joint Noise-Tolerant Learning and Meta Camera Shift Adaptation for Unsupervised Person Re-Identification | Fengxiang Yang, Zhun Zhong, Zhiming Luo, Yuanzheng Cai, Yaojin Lin, Shaozi Li, Nicu Sebe |
10905 | Prototype-Guided Saliency Feature Learning for Person Search | Hanjae Kim, Sunghun Joung, Ig-Jae Kim, Kwanghoon Sohn |
3944 | Labeled From Unlabeled: Exploiting Unlabeled Data for Few-Shot Deep HDR Deghosting | K. Ram Prabhakar, Gowtham Senthil, Susmit Agrawal, R. Venkatesh Babu, Rama Krishna Sai S Gorthi |
2053 | Learning Spatially-Variant MAP Models for Non-Blind Image Deblurring | Jiangxin Dong, Stefan Roth, Bernt Schiele |
7780 | NBNet: Noise Basis Learning for Image Denoising With Subspace Projection | Shen Cheng, Yuzhi Wang, Haibin Huang, Donghao Liu, Haoqiang Fan, Shuaicheng Liu |
6177 | Image De-Raining via Continual Learning | Man Zhou, Jie Xiao, Yifan Chang, Xueyang Fu, Aiping Liu, Jinshan Pan, Zheng-Jun Zha |
6156 | Exploring Sparsity in Image Super-Resolution for Efficient Inference | Longguang Wang, Xiaoyu Dong, Yingqian Wang, Xinyi Ying, Zaiping Lin, Wei An, Yulan Guo |
4280 | From Shadow Generation To Shadow Removal | Zhihao Liu, Hui Yin, Xinyi Wu, Zhenyao Wu, Yang Mi, Song Wang |
1962 | Spatiotemporal Registration for Event-Based Visual Odometry | Daqi Liu, Alvaro Parra, Tat-Jun Chin |
302 | BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond | Kelvin C.K. Chan, Xintao Wang, Ke Yu, Chao Dong, Chen Change Loy |
3343 | Fast Bayesian Uncertainty Estimation and Reduction of Batch Normalized Single Image Super-Resolution Network | Aupendu Kar, Prabir Kumar Biswas |
2552 | Learning Temporal Consistency for Low Light Video Enhancement From Single Images | Fan Zhang, Yu Li, Shaodi You, Ying Fu |
2107 | Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges | Qingyong Hu, Bo Yang, Sheikh Khalid, Wen Xiao, Niki Trigoni, Andrew Markham |
8839 | Neural Side-by-Side: Predicting Human Preferences for No-Reference Super-Resolution Evaluation | Valentin Khrulkov, Artem Babenko |
5369 | Slimmable Compressive Autoencoders for Practical Neural Image Compression | Fei Yang, Luis Herranz, Yongmei Cheng, Mikhail G. Mozerov |
1390 | Distilling Knowledge via Knowledge Review | Pengguang Chen, Shu Liu, Hengshuang Zhao, Jiaya Jia |
1043 | Manifold Regularized Dynamic Network Pruning | Yehui Tang, Yunhe Wang, Yixing Xu, Yiping Deng, Chao Xu, Dacheng Tao, Chang Xu |
2224 | Learnable Companding Quantization for Accurate Low-Bit Neural Networks | Kohei Yamamoto |
10100 | Lips Don’t Lie: A Generalisable and Robust Approach To Face Forgery Detection | Alexandros Haliassos, Konstantinos Vougioukas, Stavros Petridis, Maja Pantic |
10338 | Guided Integrated Gradients: An Adaptive Path Method for Removing Noise | Andrei Kapishnikov, Subhashini Venugopalan, Besim Avci, Ben Wedin, Michael Terry, Tolga Bolukbasi |
3934 | Scalable Differential Privacy With Sparse Network Finetuning | Zelun Luo, Daniel J. Wu, Ehsan Adeli, Li Fei-Fei |
2115 | Deep Graph Matching Under Quadratic Constraint | Quankai Gao, Fudong Wang, Nan Xue, Jin-Gang Yu, Gui-Song Xia |
8225 | T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval | Xiaohan Wang, Linchao Zhu, Yi Yang |
5722 | FaceInpainter: High Fidelity Face Adaptation to Heterogeneous Domains | Jia Li, Zhaoyang Li, Jie Cao, Xingguang Song, Ran He |
5597 | Partition-Guided GANs | Mohammadreza Armandpour, Ali Sadeghian, Chunyuan Li, Mingyuan Zhou |
1451 | Repopulating Street Scenes | Yifan Wang, Andrew Liu, Richard Tucker, Jiajun Wu, Brian L. Curless, Steven M. Seitz, Noah Snavely |
2855 | Image Inpainting With External-Internal Learning and Monochromic Bottleneck | Tengfei Wang, Hao Ouyang, Qifeng Chen |
2878 | DG-Font: Deformable Generative Networks for Unsupervised Font Generation | Yangchen Xie, Xinyuan Chen, Li Sun, Yue Lu |
7839 | Drafting and Revision: Laplacian Pyramid Network for Fast High-Quality Artistic Style Transfer | Tianwei Lin, Zhuoqi Ma, Fu Li, Dongliang He, Xin Li, Errui Ding, Nannan Wang, Jie Li, Xinbo Gao |
5214 | StylePeople: A Generative Model of Fullbody Human Avatars | Artur Grigorev, Karim Iskakov, Anastasia Ianina, Renat Bashirov, Ilya Zakharkin, Alexander Vakhitov, Victor Lempitsky |
11062 | Synthesize-It-Classifier: Learning a Generative Classifier Through Recurrent Self-Analysis | Arghya Pal, Raphaël C.-W. Phan, KokSheik Wong |
244 | Understanding Object Dynamics for Interactive Image-to-Video Synthesis | Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Björn Ommer |
5839 | Learning Dynamic Alignment via Meta-Filter for Few-Shot Learning | Chengming Xu, Yanwei Fu, Chen Liu, Chengjie Wang, Jilin Li, Feiyue Huang, Li Zhang, Xiangyang Xue |
2175 | Jo-SRC: A Contrastive Approach for Combating Noisy Labels | Yazhou Yao, Zeren Sun, Chuanyi Zhang, Fumin Shen, Qi Wu, Jian Zhang, Zhenmin Tang |
4599 | On Focal Loss for Class-Posterior Probability Estimation: A Theoretical Perspective | Nontawat Charoenphakdee, Jayakorn Vongkulbhisal, Nuttapong Chairatanakul, Masashi Sugiyama |
6434 | MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition | Shuang Li, Kaixiong Gong, Chi Harold Liu, Yulin Wang, Feng Qiao, Xinjing Cheng |
7256 | Open World Compositional Zero-Shot Learning | Massimiliano Mancini, Muhammad Ferjad Naeem, Yongqin Xian, Zeynep Akata |
6107 | Deep Texture Recognition via Exploiting Cross-Layer Statistical Self-Similarity | Zhile Chen, Feng Li, Yuhui Quan, Yong Xu, Hui Ji |
3260 | Combinatorial Learning of Graph Edit Distance via Dynamic Embedding | Runzhong Wang, Tianqi Zhang, Tianshu Yu, Junchi Yan, Xiaokang Yang |
7448 | TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search | Yawen Duan, Xin Chen, Hang Xu, Zewei Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li |
10123 | An Alternative Probabilistic Interpretation of the Huber Loss | Gregory P. Meyer |
7795 | Joint Deep Model-Based MR Image and Coil Sensitivity Reconstruction Network (Joint-ICNet) for Fast MRI | Yohan Jun, Hyungseob Shin, Taejoon Eo, Dosik Hwang |
6230 | Automatic Vertebra Localization and Identification in CT by Spine Rectification and Anatomically-Constrained Optimization | Fakai Wang, Kang Zheng, Le Lu, Jing Xiao, Min Wu, Shun Miao |
3472 | Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box Estimation | Bin Yan, Xinyu Zhang, Dong Wang, Huchuan Lu, Xiaoyun Yang |
1576 | Learnable Graph Matching: Incorporating Graph Partitioning With Deep Feature Learning for Multiple Object Tracking | Jiawei He, Zehao Huang, Naiyan Wang, Zhaoxiang Zhang |
1821 | Group-aware Label Transfer for Domain Adaptive Person Re-identification | Kecheng Zheng, Wu Liu, Lingxiao He, Tao Mei, Jiebo Luo, Zheng-Jun Zha |
8382 | Double Low-Rank Representation With Projection Distance Penalty for Clustering | Zhiqiang Fu, Yao Zhao, Dongxia Chang, Xingxing Zhang, Yiming Wang |
453 | Multiple Instance Active Learning for Object Detection | Tianning Yuan, Fang Wan, Mengying Fu, Jianzhuang Liu, Songcen Xu, Xiangyang Ji, Qixiang Ye |
2199 | Learning Compositional Representation for 4D Captures With Neural ODE | Boyan Jiang, Yinda Zhang, Xingkui Wei, Xiangyang Xue, Yanwei Fu |
2879 | Curriculum Graph Co-Teaching for Multi-Target Domain Adaptation | Subhankar Roy, Evgeny Krivosheev, Zhun Zhong, Nicu Sebe, Elisa Ricci |
4872 | Instance Level Affinity-Based Transfer for Unsupervised Domain Adaptation | Astuti Sharma, Tarun Kalluri, Manmohan Chandraker |
3798 | Deep Stable Learning for Out-of-Distribution Generalization | Xingxuan Zhang, Peng Cui, Renzhe Xu, Linjun Zhou, Yue He, Zheyan Shen |
3143 | ORDisCo: Effective and Efficient Usage of Incremental Unlabeled Data for Semi-Supervised Continual Learning | Liyuan Wang, Kuo Yang, Chongxuan Li, Lanqing Hong, Zhenguo Li, Jun Zhu |
2689 | Dynamic Metric Learning: Towards a Scalable Metric Space To Accommodate Multiple Semantic Scales | Yifan Sun, Yuke Zhu, Yuhan Zhang, Pengkun Zheng, Xi Qiu, Chi Zhang, Yichen Wei |
8722 | Learning Cross-Modal Retrieval With Noisy Labels | Peng Hu, Xi Peng, Hongyuan Zhu, Liangli Zhen, Jie Lin |
7087 | How Well Do Self-Supervised Models Transfer? | Linus Ericsson, Henry Gouk, Timothy M. Hospedales |
2643 | Generic Perceptual Loss for Modeling Structured Output Dependencies | Yifan Liu, Hao Chen, Yu Chen, Wei Yin, Chunhua Shen |
7564 | EDNet: Efficient Disparity Estimation With Cost Volume Combination and Attention-Based Spatial Residual | Songyan Zhang, Zhicheng Wang, Qiang Wang, Jinshuo Zhang, Gang Wei, Xiaowen Chu |
1909 | BoxInst: High-Performance Instance Segmentation With Box Annotations | Zhi Tian, Chunhua Shen, Xinlong Wang, Hao Chen |
2060 | PhySG: Inverse Rendering With Spherical Gaussians for Physics-Based Material Editing and Relighting | Kai Zhang, Fujun Luan, Qianqian Wang, Kavita Bala, Noah Snavely |
458 | MaX-DeepLab: End-to-End Panoptic Segmentation With Mask Transformers | Huiyu Wang, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen |
3092 | Scale-Aware Graph Neural Network for Few-Shot Semantic Segmentation | Guo-Sen Xie, Jie Liu, Huan Xiong, Ling Shao |
8742 | Part-Aware Panoptic Segmentation | Daan de Geus, Panagiotis Meletis, Chenyang Lu, Xiaoxiao Wen, Gijs Dubbelman |
4107 | Railroad Is Not a Train: Saliency As Pseudo-Pixel Supervision for Weakly Supervised Semantic Segmentation | Seungho Lee, Minhyun Lee, Jongwuk Lee, Hyunjung Shim |
6035 | Mask-Embedded Discriminator With Region-Based Semantic Regularization for Semi-Supervised Class-Conditional Image Synthesis | Yi Liu, Xiaoyang Huo, Tianyi Chen, Xiangping Zeng, Si Wu, Zhiwen Yu, Hau-San Wong |
1953 | Unsupervised Hyperbolic Representation Learning via Message Passing Auto-Encoders | Jiwoong Park, Junho Cho, Hyung Jin Chang, Jin Young Choi |
3346 | 4D Panoptic LiDAR Segmentation | Mehmet Aygün, Aljoša Ošep, Mark Weber, Maxim Maximov, Cyrill Stachniss, Jens Behley, Laura Leal-Taixé |
3147 | EffiScene: Efficient Per-Pixel Rigidity Inference for Unsupervised Joint Learning of Optical Flow, Depth, Camera Pose and Motion Segmentation | Yang Jiao, Trac D. Tran, Guangming Shi |
2976 | Learning by Aligning Videos in Time | Sanjay Haresh, Sateesh Kumar, Huseyin Coskun, Shahram N. Syed, Andrey Konin, Zeeshan Zia, Quoc-Huy Tran |
1786 | Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion | Ho Kei Cheng, Yu-Wing Tai, Chi-Keung Tang |
5785 | Polygonal Point Set Tracking | Gunhee Nam, Miran Heo, Seoung Wug Oh, Joon-Young Lee, Seon Joo Kim |
7322 | VinVL: Revisiting Visual Representations in Vision-Language Models | Pengchuan Zhang, Xiujun Li, Xiaowei Hu, Jianwei Yang, Lei Zhang, Lijuan Wang, Yejin Choi, Jianfeng Gao |
3268 | Visual Semantic Role Labeling for Video Understanding | Arka Sadhu, Tanmay Gupta, Mark Yatskar, Ram Nevatia, Aniruddha Kembhavi |
2308 | Can Audio-Visual Integration Strengthen Robustness Under Multimodal Attacks? | Yapeng Tian, Chenliang Xu |
1835 | Relation-aware Instance Refinement for Weakly Supervised Visual Grounding | Yongfei Liu, Bo Wan, Lin Ma, Xuming He |
10687 | Learning Better Visual Dialog Agents With Pretrained Visual-Linguistic Representation | Tao Tu, Qing Ping, Govindarajan Thattai, Gokhan Tur, Prem Natarajan |
1154 | Separating Skills and Concepts for Novel Visual Question Answering | Spencer Whitehead, Hui Wu, Heng Ji, Rogerio Feris, Kate Saenko |
2281 | Generating Manga From Illustrations via Mimicking Manga Creation Workflow | Lvmin Zhang, Xinrui Wang, Qingnan Fan, Yi Ji, Chunping Liu |
3476 | SelfDoc: Self-Supervised Document Representation Learning | Peizhao Li, Jiuxiang Gu, Jason Kuen, Vlad I. Morariu, Handong Zhao, Rajiv Jain, Varun Manjunatha, Hongfu Liu |
11090 | Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality | Trisha Mittal, Puneet Mathur, Aniket Bera, Dinesh Manocha |
2542 | Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting | Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song |
Date: Tuesday, June 22, 2021 22:00 – 24:30
Paper Session Five:
Paper ID | Paper Title | Authors |
2677 | Layer-Wise Searching for 1-Bit Detectors | Sheng Xu, Junhe Zhao, Jinhu Lü, Baochang Zhang, Shumin Han, David Doermann |
1285 | Weakly Supervised Learning of Rigid 3D Scene Flow | Zan Gojcic, Or Litany, Andreas Wieser, Leonidas J. Guibas, Tolga Birdal |
3494 | Learning Compositional Radiance Fields of Dynamic Human Heads | Ziyan Wang, Timur Bagautdinov, Stephen Lombardi, Tomas Simon, Jason Saragih, Jessica Hodgins, Michael Zollhöfer |
2703 | Learning Accurate Dense Correspondences and When To Trust Them | Prune Truong, Martin Danelljan, Luc Van Gool, Radu Timofte |
5094 | RSN: Range Sparse Net for Efficient, Accurate LiDAR 3D Object Detection | Pei Sun, Weiyue Wang, Yuning Chai, Gamaleldin Elsayed, Alex Bewley, Xiao Zhang, Cristian Sminchisescu, Dragomir Anguelov |
8619 | LAFEAT: Piercing Through Adversarial Defenses With Latent Features | Yunrui Yu, Xitong Gao, Cheng-Zhong Xu |
2715 | Function4D: Real-Time Human Volumetric Capture From Very Sparse Consumer RGBD Sensors | Tao Yu, Zerong Zheng, Kaiwen Guo, Pengpeng Liu, Qionghai Dai, Yebin Liu |
5093 | Polka Lines: Learning Structured Illumination and Reconstruction for Active Stereo | Seung-Hwan Baek, Felix Heide |
5040 | FBI-Denoiser: Fast Blind Image Denoiser for Poisson-Gaussian Noise | Jaeseok Byun, Sungmin Cha, Taesup Moon |
6136 | Face Forensics in the Wild | Tianfei Zhou, Wenguan Wang, Zhiyuan Liang, Jianbing Shen |
1664 | Exploring Adversarial Fake Images on Face Manifold | Dongze Li, Wei Wang, Hongxing Fan, Jing Dong |
3471 | Pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis | Eric R. Chan, Marco Monteiro, Petr Kellnhofer, Jiajun Wu, Gordon Wetzstein |
5563 | Animating Pictures With Eulerian Motion Fields | Aleksander Holynski, Brian L. Curless, Steven M. Seitz, Richard Szeliski |
4843 | DriveGAN: Towards a Controllable High-Quality Neural Simulation | Seung Wook Kim, Jonah Philion, Antonio Torralba, Sanja Fidler |
8142 | Towards Open World Object Detection | K J Joseph, Salman Khan, Fahad Shahbaz Khan, Vineeth N Balasubramanian |
8778 | DiNTS: Differentiable Neural Network Topology Search for 3D Medical Image Segmentation | Yufan He, Dong Yang, Holger Roth, Can Zhao, Daguang Xu |
4887 | Siamese Natural Language Tracker: Tracking by Natural Language Descriptions With Siamese Trackers | Qi Feng, Vitaly Ablavsky, Qinxun Bai, Stan Sclaroff |
2551 | Where and What? Examining Interpretable Disentangled Representations | Xinqi Zhu, Chang Xu, Dacheng Tao |
8346 | Prototype Augmentation and Self-Supervision for Incremental Learning | Fei Zhu, Xu-Yao Zhang, Chuang Wang, Fei Yin, Cheng-Lin Liu |
11578 | Brain Image Synthesis With Unsupervised Multivariate Canonical CSCl4Net | Yawen Huang, Feng Zheng, Danyang Wang, Weilin Huang, Matthew R. Scott, Ling Shao |
3386 | Polygonal Building Extraction by Frame Field Learning | Nicolas Girard, Dmitriy Smirnov, Justin Solomon, Yuliya Tarabalka |
10853 | InverseForm: A Loss Function for Structured Boundary-Aware Segmentation | Shubhankar Borse, Ying Wang, Yizhe Zhang, Fatih Porikli |
3473 | SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation | Brendan Duke, Abdalla Ahmed, Christian Wolf, Parham Aarabi, Graham W. Taylor |
5618 | Visual Room Rearrangement | Luca Weihs, Matt Deitke, Aniruddha Kembhavi, Roozbeh Mottaghi |
3931 | A Deep Emulator for Secondary Motion of 3D Characters | Mianlun Zheng, Yi Zhou, Duygu Ceylan, Jernej Barbič |
1582 | Interactive Self-Training With Mean Teachers for Semi-Supervised Object Detection | Qize Yang, Xihan Wei, Biao Wang, Xian-Sheng Hua, Lei Zhang |
4174 | UniT: Unified Knowledge Transfer for Any-Shot Object Detection and Segmentation | Siddhesh Khandelwal, Raghav Goyal, Leonid Sigal |
7446 | Unsupervised Object Detection With LIDAR Clues | Hao Tian, Yuntao Chen, Jifeng Dai, Zhaoxiang Zhang, Xizhou Zhu |
6919 | Implicit Feature Alignment: Learn To Convert Text Recognizer to Text Spotter | Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Dezhi Peng, Zhe Li, Mengchao He, Yongpan Wang, Canjie Luo |
6806 | Self-Attention Based Text Knowledge Mining for Text Detection | Qi Wan, Haoqin Ji, Linlin Shen |
1441 | Shallow Feature Matters for Weakly Supervised Object Localization | Jun Wei, Qin Wang, Zhen Li, Sheng Wang, S. Kevin Zhou, Shuguang Cui |
1493 | Self-Supervised 3D Mesh Reconstruction From Single Images | Tao Hu, Liwei Wang, Xiaogang Xu, Shu Liu, Jiaya Jia |
5189 | Sketch2Model: View-Aware 3D Modeling From Single Free-Hand Sketches | Song-Hai Zhang, Yuan-Chen Guo, Qing-Wen Gu |
4646 | Learning Parallel Dense Correspondence From Spatio-Temporal Descriptors for Efficient and Robust 4D Reconstruction | Jiapeng Tang, Dan Xu, Kui Jia, Lei Zhang |
2466 | Refer-It-in-RGBD: A Bottom-Up Approach for 3D Visual Grounding in RGBD Images | Haolin Liu, Anran Lin, Xiaoguang Han, Lei Yang, Yizhou Yu, Shuguang Cui |
4139 | VoxelContext-Net: An Octree Based Framework for Point Cloud Compression | Zizheng Que, Guo Lu, Dong Xu |
8215 | CorrNet3D: Unsupervised End-to-End Learning of Dense Correspondence for 3D Point Clouds | Yiming Zeng, Yue Qian, Zhiyu Zhu, Junhui Hou, Hui Yuan, Ying He |
10371 | Inferring CAD Modeling Sequences Using Zone Graphs | Xianghao Xu, Wenzhe Peng, Chin-Yi Cheng, Karl D.D. Willis, Daniel Ritchie |
4307 | Seeing Behind Objects for 3D Multi-Object Tracking in RGB-D Sequences | Norman Müller, Yu-Shiang Wong, Niloy J. Mitra, Angela Dai, Matthias Nießner |
1377 | View Generalization for Single Image Textured 3D Models | Anand Bhattad, Aysegul Dundar, Guilin Liu, Andrew Tao, Bryan Catanzaro |
2543 | A Decomposition Model for Stereo Matching | Chengtang Yao, Yunde Jia, Huijun Di, Pengxiang Li, Yuwei Wu |
10955 | VS-Net: Voting With Segmentation for Visual Localization | Zhaoyang Huang, Han Zhou, Yijin Li, Bangbang Yang, Yan Xu, Xiaowei Zhou, Hujun Bao, Guofeng Zhang, Hongsheng Li |
4615 | MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments From a Single Moving Camera | Felix Wimbauer, Nan Yang, Lukas von Stumberg, Niclas Zeller, Daniel Cremers |
10304 | Shape and Material Capture at Home | Daniel Lichy, Jiaye Wu, Soumyadip Sengupta, David W. Jacobs |
1212 | Offboard 3D Object Detection From Point Cloud Sequences | Charles R. Qi, Yin Zhou, Mahyar Najibi, Pei Sun, Khoa Vo, Boyang Deng, Dragomir Anguelov |
4205 | M3DSSD: Monocular 3D Single Stage Object Detector | Shujie Luo, Hang Dai, Ling Shao, Yong Ding |
1812 | 2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition | Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis |
3850 | Deep Analysis of CNN-Based Spatio-Temporal Representations for Action Recognition | Chun-Fu Richard Chen, Rameswar Panda, Kandan Ramakrishnan, Rogerio Feris, John Cohn, Aude Oliva, Quanfu Fan |
2266 | The Blessings of Unlabeled Background in Untrimmed Videos | Yuan Liu, Jingyuan Chen, Zhenfang Chen, Bing Deng, Jianqiang Huang, Hanwang Zhang |
1767 | PointGuard: Provably Robust 3D Point Cloud Classification | Hongbin Liu, Jinyuan Jia, Neil Zhenqiang Gong |
7898 | DSRNA: Differentiable Search of Robust Neural Architectures | Ramtin Hosseini, Xingyi Yang, Pengtao Xie |
4844 | Backdoor Attacks Against Deep Learning Systems in the Physical World | Emily Wenger, Josephine Passananti, Arjun Nitin Bhagoji, Yuanshun Yao, Haitao Zheng, Ben Y. Zhao |
4971 | Riggable 3D Face Reconstruction via In-Network Optimization | Ziqian Bai, Zhaopeng Cui, Xiaoming Liu, Ping Tan |
2678 | NeuralHumanFVV: Real-Time Neural Volumetric Human Performance Rendering Using RGB Cameras | Xin Suo, Yuheng Jiang, Pei Lin, Yingliang Zhang, Minye Wu, Kaiwen Guo, Lan Xu |
1439 | Context Modeling in 3D Human Pose Estimation: A Unified Perspective | Xiaoxuan Ma, Jiajun Su, Chunyu Wang, Hai Ci, Yizhou Wang |
1162 | Dive Into Ambiguity: Latent Distribution Mining and Pairwise Uncertainty Estimation for Facial Expression Recognition | Jiahui She, Yibo Hu, Hailin Shi, Jun Wang, Qiu Shen, Tao Mei |
10601 | Lifting 2D StyleGAN for 3D-Aware Face Generation | Yichun Shi, Divyansh Aggarwal, Anil K. Jain |
3006 | Hybrid Message Passing With Performance-Driven Structures for Facial Action Unit Detection | Tengfei Song, Zijun Cui, Wenming Zheng, Qiang Ji |
837 | Learning to Generalize Unseen Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification | Yuyang Zhao, Zhun Zhong, Fengxiang Yang, Zhiming Luo, Yaojin Lin, Shaozi Li, Nicu Sebe |
3736 | Invertible Image Signal Processing | Yazhou Xing, Zian Qian, Qifeng Chen |
5579 | End-to-End High Dynamic Range Camera Pipeline Optimization | Nicolas Robidoux, Luis E. García Capel, Dong-eun Seo, Avinash Sharma, Federico Ariza, Felix Heide |
3087 | Blind Deblurring for Saturated Images | Liang Chen, Jiawei Zhang, Songnan Lin, Faming Fang, Jimmy S. Ren |
11535 | Extreme Low-Light Environment-Driven Image Denoising Over Permanently Shadowed Lunar Regions With a Physical Noise Model | Ben Moseley, Valentin Bickel, Ignacio G. López-Francos, Loveneesh Rana |
8642 | Controlling the Rain: From Removal to Rendering | Siqi Ni, Xueyun Cao, Tao Yue, Xuemei Hu |
340 | De-Rendering the World’s Revolutionary Artefacts | Shangzhe Wu, Ameesh Makadia, Jiajun Wu, Noah Snavely, Richard Tucker, Angjoo Kanazawa |
4703 | Progressively Complementary Network for Fisheye Image Rectification Using Appearance Flow | Shangrong Yang, Chunyu Lin, Kang Liao, Chunjie Zhang, Yao Zhao |
4442 | High-Speed Image Reconstruction Through Short-Term Plasticity for Spiking Cameras | Yajing Zheng, Lingxiao Zheng, Zhaofei Yu, Boxin Shi, Yonghong Tian, Tiejun Huang |
886 | MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution | Liying Lu, Wenbo Li, Xin Tao, Jiangbo Lu, Jiaya Jia |
6315 | Single Pair Cross-Modality Super Resolution | Guy Shacht, Dov Danon, Sharon Fogel, Daniel Cohen-Or |
3475 | Temporal Modulation Network for Controllable Space-Time Video Super-Resolution | Gang Xu, Jun Xu, Zhen Li, Liang Wang, Xing Sun, Ming-Ming Cheng |
3446 | The Multi-Temporal Urban Development SpaceNet Dataset | Adam Van Etten, Daniel Hogan, Jesus Martinez Manso, Jacob Shermeyer, Nicholas Weir, Ryan Lewis |
9937 | Euro-PVI: Pedestrian Vehicle Interactions in Dense Urban Centers | Apratim Bhattacharyya, Daniel Olmeda Reino, Mario Fritz, Bernt Schiele |
9933 | AttentiveNAS: Improving Neural Architecture Search via Attentive Sampling | Dilin Wang, Meng Li, Chengyue Gong, Vikas Chandra |
1462 | Learning Student Networks in the Wild | Hanting Chen, Tianyu Guo, Chang Xu, Wenshuo Li, Chunjing Xu, Chao Xu, Yunhe Wang |
2617 | Towards Compact CNNs via Collaborative Compression | Yuchao Li, Shaohui Lin, Jianzhuang Liu, Qixiang Ye, Mengdi Wang, Fei Chao, Fan Yang, Jincheng Ma, Qi Tian, Rongrong Ji |
4631 | Network Quantization With Element-Wise Gradient Scaling | Junghyup Lee, Dohyung Kim, Bumsub Ham |
10966 | Frequency-Aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection | Jiaming Li, Hongtao Xie, Jiahong Li, Zhongyuan Wang, Yongdong Zhang |
11528 | Building Reliable Explanations of Unreliable Neural Networks: Locally Smoothing Perspective of Model Interpretation | Dohun Lim, Hyeonseok Lee, Sungchan Kim |
5056 | Perceptual Indistinguishability-Net (PI-Net): Facial Image Obfuscation With Manipulable Semantics | Jia-Wei Chen, Li-Ju Chen, Chia-Mu Yu, Chun-Shien Lu |
3915 | Coming Down to Earth: Satellite-to-Street View Synthesis for Geo-Localization | Aysim Toker, Qunjie Zhou, Maxim Maximov, Laura Leal-Taixé |
1264 | Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes | Zhengqi Li, Simon Niklaus, Noah Snavely, Oliver Wang |
7778 | Not Just Compete, but Collaborate: Local Image-to-Image Translation via Cooperative Mask Prediction | Daejin Kim, Mohammad Azam Khan, Jaegul Choo |
6651 | Posterior Promoted GAN With Distribution Discriminator for Unsupervised Image Synthesis | Xianchao Zhang, Ziyang Cheng, Xiaotong Zhang, Han Liu |
1519 | Surrogate Gradient Field for Latent Space Manipulation | Minjun Li, Yanghua Jin, Huachun Zhu |
4279 | Image Inpainting Guided by Coherence Priors of Semantics and Textures | Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh |
3233 | Spatially-Invariant Style-Codes Controlled Makeup Transfer | Han Deng, Chu Han, Hongmin Cai, Guoqiang Han, Shengfeng He |
7856 | Memory-Guided Unsupervised Image-to-Image Translation | Somi Jeong, Youngjung Kim, Eungbean Lee, Kwanghoon Sohn |
5568 | Hierarchical Motion Understanding via Motion Programs | Sumith Kulal, Jiayuan Mao, Alex Aiken, Jiajun Wu |
11574 | Adaptive Rank Estimate in Robust Principal Component Analysis | Zhengqin Xu, Rui He, Shoulie Xie, Shiqian Wu |
1500 | Deep Animation Video Interpolation in the Wild | Li Siyao, Shiyu Zhao, Weijiang Yu, Wenxiu Sun, Dimitris Metaxas, Chen Change Loy, Ziwei Liu |
6889 | ECKPN: Explicit Class Knowledge Propagation Network for Transductive Few-Shot Learning | Chaofan Chen, Xiaoshan Yang, Changsheng Xu, Xuhui Huang, Zhe Ma |
4337 | Multi-Objective Interpolation Training for Robustness To Label Noise | Diego Ortego, Eric Arazo, Paul Albert, Noel E. O'Connor, Kevin McGuinness |
8295 | T-vMF Similarity for Regularizing Intra-Class Feature Distribution | Takumi Kobayashi |
9891 | Disentangling Label Distribution for Long-Tailed Visual Recognition | Youngkyu Hong, Seungju Han, Kwanghee Choi, Seokjun Seo, Beomsu Kim, Buru Chang |
2124 | Leveraging the Availability of Two Cameras for Illuminant Estimation | Abdelrahman Abdelhamed, Abhijith Punnappurath, Michael S. Brown |
665 | Decoupled Dynamic Filter Networks | Jingkai Zhou, Varun Jampani, Zhixiong Pi, Qiong Liu, Ming-Hsuan Yang |
4042 | Rethinking Graph Neural Architecture Search From Message-Passing | Shaofei Cai, Liang Li, Jincan Deng, Beichen Zhang, Zheng-Jun Zha, Li Su, Qingming Huang |
7833 | Towards Improving the Consistency, Efficiency, and Flexibility of Differentiable Neural Architecture Search | Yibo Yang, Shan You, Hongyang Li, Fei Wang, Chen Qian, Zhouchen Lin |
3445 | Unsupervised Visual Attention and Invariance for Reinforcement Learning | Xudong Wang, Long Lian, Stella X. Yu |
7331 | Mol2Image: Improved Conditional Flow Models for Molecule to Image Synthesis | Karren Yang, Samuel Goldman, Wengong Jin, Alex X. Lu, Regina Barzilay, Tommi Jaakkola, Caroline Uhler |
11819 | TSGCNet: Discriminative Geometric Feature Learning With Two-Stream Graph Convolutional Network for 3D Dental Model Segmentation | Lingming Zhang, Yue Zhao, Deyu Meng, Zhiming Cui, Chenqiang Gao, Xinbo Gao, Chunfeng Lian, Dinggang Shen |
4215 | IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking | Shuai Jia, Yibing Song, Chao Ma, Xiaokang Yang |
5594 | GMOT-40: A Benchmark for Generic Multiple Object Tracking | Hexin Bai, Wensheng Cheng, Peng Chu, Juehuan Liu, Kai Zhang, Haibin Ling |
5806 | Combined Depth Space Based Architecture Search for Person Re-Identification | Hanjun Li, Gaojie Wu, Wei-Shi Zheng |
11627 | Learning an Explicit Weighting Scheme for Adapting Complex HSI Noise | Xiangyu Rui, Xiangyong Cao, Qi Xie, Zongsheng Yue, Qian Zhao, Deyu Meng |
2504 | VaB-AL: Incorporating Class Imbalance and Difficulty With Variational Bayes for Active Learning | Jongwon Choi, Kwang Moo Yi, Jihoon Kim, Jinho Choo, Byoungjip Kim, Jinyeop Chang, Youngjune Gwon, Hyung Jin Chang |
8156 | Learning a Facial Expression Embedding Disentangled From Identity | Wei Zhang, Xianpeng Ji, Keyu Chen, Yu Ding, Changjie Fan |
2939 | SRDAN: Scale-Aware and Range-Aware Domain Adaptation Network for Cross-Dataset 3D Object Detection | Weichen Zhang, Wen Li, Dong Xu |
5079 | Regressive Domain Adaptation for Unsupervised Keypoint Detection | Junguang Jiang, Yifei Ji, Ximei Wang, Yufeng Liu, Jianmin Wang, Mingsheng Long |
4340 | Uncertainty-Guided Model Generalization to Unseen Domains | Fengchun Qiao, Xi Peng |
3342 | Self-Promoted Prototype Refinement for Few-Shot Class-Incremental Learning | Kai Zhu, Yang Cao, Wei Zhai, Jie Cheng, Zheng-Jun Zha |
3079 | Noise-Resistant Deep Metric Learning With Ranking-Based Instance Selection | Chang Liu, Han Yu, Boyang Li, Zhiqi Shen, Zhanning Gao, Peiran Ren, Xuansong Xie, Lizhen Cui, Chunyan Miao |
5729 | DAT: Training Deep Networks Robust To Label-Noise by Matching the Feature Distributions | Yuntao Qu, Shasha Mo, Jianwei Niu |
7159 | OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning | Spyros Gidaris, Andrei Bursuc, Gilles Puy, Nikos Komodakis, Matthieu Cord, Patrick Pérez |
5169 | Learning Affinity-Aware Upsampling for Deep Image Matting | Yutong Dai, Hao Lu, Chunhua Shen |
8373 | PLADE-Net: Towards Pixel-Level Accuracy for Self-Supervised Single-View Depth Estimation With Neural Positional Encoding and Distilled Matting Loss | Juan Luis Gonzalez, Munchurl Kim |
2649 | RefineMask: Towards High-Quality Instance Segmentation With Fine-Grained Features | Gang Zhang, Xin Lu, Jingru Tan, Jianmin Li, Zhaoxiang Zhang, Quanquan Li, Xiaolin Hu |
6116 | CompositeTasking: Understanding Images by Spatial Composition of Tasks | Nikola Popović, Danda Pani Paudel, Thomas Probst, Guolei Sun, Luc Van Gool |
1035 | Rethinking Semantic Segmentation From a Sequence-to-Sequence Perspective With Transformers | Sixiao Zheng, Jiachen Lu, Hengshuang Zhao, Xiatian Zhu, Zekun Luo, Yabiao Wang, Yanwei Fu, Jianfeng Feng, Tao Xiang, Philip H.S. Torr, Li Zhang |
3168 | FSDR: Frequency Space Domain Randomization for Domain Generalization | Jiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu |
2838 | Transformation Driven Visual Reasoning | Xin Hong, Yanyan Lan, Liang Pang, Jiafeng Guo, Xueqi Cheng |
4630 | Background-Aware Pooling and Noise-Aware Loss for Weakly-Supervised Semantic Segmentation | Youngmin Oh, Beomjun Kim, Bumsub Ham |
8453 | Adaptive Consistency Regularization for Semi-Supervised Transfer Learning | Abulikemu Abuduweili, Xingjian Li, Humphrey Shi, Cheng-Zhong Xu, Dejing Dou |
2332 | Self-Generated Defocus Blur Detection via Dual Adversarial Discriminators | Wenda Zhao, Cai Shang, Huchuan Lu |
1397 | Ego-Exo: Transferring Visual Representations From Third-Person to First-Person Videos | Yanghao Li, Tushar Nagarajan, Bo Xiong, Kristen Grauman |
3191 | PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds | Yi Wei, Ziyi Wang, Yongming Rao, Jiwen Lu, Jie Zhou |
3876 | Spatiotemporal Contrastive Video Representation Learning | Rui Qian, Tianjian Meng, Boqing Gong, Ming-Hsuan Yang, Huisheng Wang, Serge Belongie, Yin Cui |
1866 | Deep Video Matting via Spatio-Temporal Alignment and Aggregation | Yanan Sun, Guanzhi Wang, Qiao Gu, Chi-Keung Tang, Yu-Wing Tai |
5988 | Target-Aware Object Discovery and Association for Unsupervised Video Multi-Object Segmentation | Tianfei Zhou, Jianwu Li, Xueyi Li, Ling Shao |
10384 | Multimodal Contrastive Training for Visual Representation Learning | Xin Yuan, Zhe Lin, Jason Kuen, Jianming Zhang, Yilin Wang, Michael Maire, Ajinkya Kale, Baldo Faieta |
5443 | Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs | Xudong Lin, Gedas Bertasius, Jue Wang, Shih-Fu Chang, Devi Parikh, Lorenzo Torresani |
3368 | Distilling Audio-Visual Knowledge by Compositional Contrastive Learning | Yanbei Chen, Yongqin Xian, A. Sophia Koepke, Ying Shan, Zeynep Akata |
2292 | Structured Multi-Level Interaction Network for Video Moment Localization via Language Query | Hao Wang, Zheng-Jun Zha, Liang Li, Dong Liu, Jiebo Luo |
2590 | Scene-Intuitive Agent for Remote Embodied Visual Grounding | Xiangru Lin, Guanbin Li, Yizhou Yu |
1226 | Domain-Robust VQA With Diverse Datasets and Methods but No Target Labels | Mingda Zhang, Tristan Maidment, Ahmad Diab, Adriana Kovashka, Rebecca Hwa |
3662 | Composing Photos Like a Photographer | Chaoyi Hong, Shuaiyuan Du, Ke Xian, Hao Lu, Zhiguo Cao, Weicai Zhong |
8331 | Dogfight: Detecting Drones From Drones Videos | Muhammad Waseem Ashraf, Waqas Sultani, Mubarak Shah |
427 | Multi-Modal Fusion Transformer for End-to-End Autonomous Driving | Aditya Prakash, Kashyap Chitta, Andreas Geiger |
4018 | Cloud2Curve: Generation and Vectorization of Parametric Sketches | Ayan Das, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song |
Date: Wednesday, June 23, 2021 6:00– 8:30
Paper Session Six:
Paper ID | Paper Title | Authors |
5142 | Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition | Shancheng Fang, Hongtao Xie, Yuxin Wang, Zhendong Mao, Yongdong Zhang |
2270 | MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization | Jiahui Huang, He Wang, Tolga Birdal, Minhyuk Sung, Federica Arrigoni, Shi-Min Hu, Leonidas J. Guibas |
5041 | NeuTex: Neural Texture Mapping for Volumetric Neural Rendering | Fanbo Xiang, Zexiang Xu, Miloš Hašan, Yannick Hold-Geoffroy, Kalyan Sunkavalli, Hao Su |
414 | UnsupervisedR&R: Unsupervised Point Cloud Registration via Differentiable Rendering | Mohamed El Banani, Luya Gao, Justin Johnson |
6098 | RangeIoUDet: Range Image Based Real-Time 3D Object Detector Optimized by Intersection Over Union | Zhidong Liang, Zehan Zhang, Ming Zhang, Xian Zhao, Shiliang Pu |
10357 | Architectural Adversarial Robustness: The Case for Deep Pursuit | George Cazenavette, Calvin Murdock, Simon Lucey |
457 | SimPoE: Simulated Character Control for 3D Human Pose Estimation | Ye Yuan, Shih-En Wei, Tomas Simon, Kris Kitani, Jason Saragih |
5450 | CodedStereo: Learned Phase Masks for Large Depth-of-Field Stereo | Shiyu Tan, Yicheng Wu, Shoou-I Yu, Ashok Veeraraghavan |
7815 | PSD: Principled Synthetic-to-Real Dehazing Guided by Physical Priors | Zeyuan Chen, Yangchao Wang, Yang Yang, Dong Liu |
10082 | OpenRooms: An Open Framework for Photorealistic Indoor Scene Datasets | Zhengqin Li, Ting-Wei Yu, Shen Sang, Sarah Wang, Meng Song, Yuhan Liu, Yu-Ying Yeh, Rui Zhu, Nitesh Gundavarapu, Jia Shi, Sai Bi, Hong-Xing Yu, Zexiang Xu, Kalyan Sunkavalli, Miloš Hašan, Ravi Ramamoorthi, Manmohan Chandraker |
7067 | A Closer Look at Fourier Spectrum Discrepancies for CNN-Generated Images Detection | Keshigeyan Chandrasegaran, Ngoc-Trung Tran, Ngai-Man Cheung |
3702 | NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections | Ricardo Martin-Brualla, Noha Radwan, Mehdi S. M. Sajjadi, Jonathan T. Barron, Alexey Dosovitskiy, Daniel Duckworth |
7596 | ID-Unet: Iterative Soft and Hard Deformation for View Synthesis | Mingyu Yin, Li Sun, Qingli Li |
4877 | GeoSim: Realistic Video Simulation via Geometry-Aware Composition for Self-Driving | Yun Chen, Frieda Rong, Shivam Duggal, Shenlong Wang, Xinchen Yan, Sivabalan Manivasagam, Shangjie Xue, Ersin Yumer, Raquel Urtasun |
4070 | All Labels Are Not Created Equal: Enhancing Semi-Supervision via Label Grouping and Co-Training | Islam Nassar, Samitha Herath, Ehsan Abbasnejad, Wray Buntine, Gholamreza Haffari |
11399 | Orthogonal Over-Parameterized Training | Weiyang Liu, Rongmei Lin, Zhen Liu, James M. Rehg, Liam Paull, Li Xiong, Le Song, Adrian Weller |
11659 | DeepTag: An Unsupervised Deep Learning Method for Motion Tracking on Cardiac Tagging Magnetic Resonance Images | Meng Ye, Mikael Kanski, Dong Yang, Qi Chang, Zhennan Yan, Qiaoying Huang, Leon Axel, Dimitris Metaxas |
2819 | Transferable Query Selection for Active Domain Adaptation | Bo Fu, Zhangjie Cao, Jianmin Wang, Mingsheng Long |
5388 | When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework | Zhizhong Huang, Junping Zhang, Hongming Shan |
3899 | Simpler Certified Radius Maximization by Propagating Covariances | Xingjian Zhen, Rudrasis Chakraborty, Vikas Singh |
3741 | Improving Panoptic Segmentation at All Scales | Lorenzo Porzi, Samuel Rota Bulò, Peter Kontschieder |
4411 | Learning Triadic Belief Dynamics in Nonverbal Communication From Videos | Lifeng Fan, Shuwen Qiu, Zilong Zheng, Tao Gao, Song-Chun Zhu, Yixin Zhu |
4551 | Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps | Yuk Heo, Yeong Jun Koh, Chang-Su Kim |
4263 | Less Is More: ClipBERT for Video-and-Language Learning via Sparse Sampling | Jie Lei, Linjie Li, Luowei Zhou, Zhe Gan, Tamara L. Berg, Mohit Bansal, Jingjing Liu |
5359 | Im2Vec: Synthesizing Vector Graphics Without Vector Supervision | Pradyumna Reddy, Michaël Gharbi, Michal Lukáč, Niloy J. Mitra |
4163 | FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding | Bo Sun, Banghuai Li, Shengcai Cai, Ye Yuan, Chi Zhang |
4621 | Beyond Max-Margin: Class Margin Equilibrium for Few-Shot Object Detection | Bohao Li, Boyu Yang, Chang Liu, Feng Liu, Rongrong Ji, Qixiang Ye |
10441 | Dynamic Head: Unifying Object Detection Heads With Attentions | Xiyang Dai, Yinpeng Chen, Bin Xiao, Dongdong Chen, Mengchen Liu, Lu Yuan, Lei Zhang |
7270 | Dictionary-Guided Scene Text Recognition | Nguyen Nguyen, Thu Nguyen, Vinh Tran, Minh-Triet Tran, Thanh Duc Ngo, Thien Huu Nguyen, Minh Hoai |
6857 | Progressive Contour Regression for Arbitrary-Shape Scene Text Detection | Pengwen Dai, Sanyi Zhang, Hua Zhang, Xiaochun Cao |
1920 | Strengthen Learning Tolerance for Weakly Supervised Object Localization | Guangyu Guo, Junwei Han, Fang Wan, Dingwen Zhang |
2132 | StruMonoNet: Structure-Aware Monocular 3D Prediction | Zhenpei Yang, Li Erran Li, Qixing Huang |
5382 | Fully Understanding Generic Objects: Modeling, Segmentation, and Reconstruction | Feng Liu, Luan Tran, Xiaoming Liu |
7225 | Exploiting & Refining Depth Distributions With Triangulation Light Curtains | Yaadhav Raaj, Siddharth Ancha, Robert Tamburo, David Held, Srinivasa G. Narasimhan |
2662 | PMP-Net: Point Cloud Completion by Learning Multi-Step Point Moving Paths | Xin Wen, Peng Xiang, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu |
4464 | TearingNet: Point Cloud Autoencoder To Learn Topology-Friendly Representations | Jiahao Pang, Duanshun Li, Dong Tian |
10827 | 3D Object Detection With Pointformer | Xuran Pan, Zhuofan Xia, Shiji Song, Li Erran Li, Gao Huang |
552 | NeuroMorph: Unsupervised Shape Interpolation and Correspondence in One Go | Marvin Eisenberger, David Novotny, Gael Kerchenbaum, Patrick Labatut, Natalia Neverova, Daniel Cremers, Andrea Vedaldi |
5459 | Towards Part-Based Understanding of RGB-D Scans | Alexey Bokhovkin, Vladislav Ishimtsev, Emil Bogomolov, Denis Zorin, Alexey Artemov, Evgeny Burnaev, Angela Dai |
2454 | NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis | Pratul P. Srinivasan, Boyang Deng, Xiuming Zhang, Matthew Tancik, Ben Mildenhall, Jonathan T. Barron |
2709 | Probabilistic Model Distillation for Semantic Correspondence | Xin Li, Deng-Ping Fan, Fan Yang, Ao Luo, Hong Cheng, Zicheng Liu |
1180 | SceneGraphFusion: Incremental 3D Scene Graph Prediction From RGB-D Sequences | Shun-Cheng Wu, Johanna Wald, Keisuke Tateno, Nassir Navab, Federico Tombari |
5815 | Self-Supervised Learning of Depth Inference for Multi-View Stereo | Jiayu Yang, Jose M. Alvarez, Miaomiao Liu |
2995 | Mesoscopic Photogrammetry With an Unstabilized Phone Camera | Kevin C. Zhou, Colin Cooke, Jaehee Park, Ruobing Qian, Roarke Horstmeyer, Joseph A. Izatt, Sina Farsiu |
1579 | LiDAR R-CNN: An Efficient and Universal 3D Object Detector | Zhichao Li, Feng Wang, Naiyan Wang |
4987 | Monocular 3D Object Detection: An Extrinsic Parameter Free Approach | Yunsong Zhou, Yuan He, Hongzi Zhu, Cheng Wang, Hongyang Li, Qinhong Jiang |
2066 | Beyond Short Clips: End-to-End Video-Level Learning With Collaborative Memories | Xitong Yang, Haoqi Fan, Lorenzo Torresani, Larry S. Davis, Heng Wang |
2514 | Multimodal Motion Prediction With Stacked Transformers | Yicheng Liu, Jinghuai Zhang, Liangji Fang, Qinhong Jiang, Bolei Zhou |
3822 | Weakly Supervised Action Selection Learning in Video | Junwei Ma, Satya Krishna Gorti, Maksims Volkovs, Guangwei Yu |
2236 | BASAR:Black-Box Attack on Skeletal Action Recognition | Yunfeng Diao, Tianjia Shao, Yong-Liang Yang, Kun Zhou, He Wang |
10059 | Adversarial Robustness Across Representation Spaces | Pranjal Awasthi, George Yu, Chun-Sung Ferng, Andrew Tomkins, Da-Cheng Juan |
7277 | img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation | Vítor Albiero, Xingyu Chen, Xi Yin, Guan Pang, Tal Hassner |
7375 | OSTeC: One-Shot Texture Completion | Baris Gecer, Jiankang Deng, Stefanos Zafeiriou |
2691 | Locally Aware Piecewise Transformation Fields for 3D Human Mesh Registration | Shaofei Wang, Andreas Geiger, Siyu Tang |
2831 | Monocular 3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks | Yu Cheng, Bo Wang, Bo Yang, Robby T. Tan |
3651 | Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition | Delian Ruan, Yan Yan, Shenqi Lai, Zhenhua Chai, Chunhua Shen, Hanzi Wang |
227 | SDD-FIQA: Unsupervised Face Image Quality Assessment With Similarity Distribution Distance | Fu-Zhao Ou, Xingyu Chen, Ruixin Zhang, Yuge Huang, Shaoxin Li, Jilin Li, Yong Li, Liujuan Cao, Yuan-Gen Wang |
7423 | Facial Action Unit Detection With Transformers | Geethu Miriam Jacob, Björn Stenger |
2426 | Anchor-Free Person Search | Yichao Yan, Jinpeng Li, Jie Qin, Song Bai, Shengcai Liao, Li Liu, Fan Zhu, Ling Shao |
4173 | Neural Camera Simulators | Hao Ouyang, Zifan Shi, Chenyang Lei, Ka Lung Law, Qifeng Chen |
6987 | Neural Auto-Exposure for High-Dynamic Range Object Detection | Emmanuel Onzon, Fahim Mannan, Felix Heide |
3264 | ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring | Dongxu Li, Chenchen Xu, Kaihao Zhang, Xin Yu, Yiran Zhong, Wenqi Ren, Hanna Suominen, Hongdong Li |
288 | Memory Oriented Transfer Learning for Semi-Supervised Image Deraining | Huaibo Huang, Aijing Yu, Ran He |
11692 | Robust Representation Learning With Feedback for Single Image Deraining | Chenghao Chen, Hao Li |
4408 | A Multi-Task Network for Joint Specular Highlight Detection and Removal | Gang Fu, Qing Zhang, Lei Zhu, Ping Li, Chunxia Xiao |
4827 | Panoramic Image Reflection Removal | Yuchen Hong, Qian Zheng, Lingran Zhao, Xudong Jiang, Alex C. Kot, Boxin Shi |
3051 | Turning Frequency to Resolution: Video Super-Resolution via Event Cameras | Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, Dacheng Tao |
951 | SRWarp: Generalized Image Super-Resolution under Arbitrary Transformation | Sanghyun Son, Kyoung Mu Lee |
6495 | Learning Scene Structure Guidance via Cross-Task Knowledge Transfer for Single Depth Super-Resolution | Baoli Sun, Xinchen Ye, Baopu Li, Haojie Li, Zhihui Wang, Rui Xu |
7173 | Gated Spatio-Temporal Attention-Guided Video Deblurring | Maitreya Suin, A. N. Rajagopalan |
3908 | Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark | Longyin Wen, Dawei Du, Pengfei Zhu, Qinghua Hu, Qilong Wang, Liefeng Bo, Siwei Lyu |
10684 | Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild With Pose Annotations | Adel Ahmadyan, Liangkai Zhang, Artsiom Ablavatski, Jianing Wei, Matthias Grundmann |
11470 | Dynamic Domain Adaptation for Efficient Inference | Shuang Li, JinMing Zhang, Wenxuan Ma, Chi Harold Liu, Wei Li |
3225 | General Instance Distillation for Object Detection | Xing Dai, Zeren Jiang, Zhao Wu, Yiping Bao, Zhicheng Wang, Si Liu, Erjin Zhou |
3109 | Data-Free Knowledge Distillation for Image Super-Resolution | Yiman Zhang, Hanting Chen, Xinghao Chen, Yiping Deng, Chunjing Xu, Yunhe Wang |
8145 | Improving Accuracy of Binary Neural Networks Using Unbalanced Activation Distribution | Hyungjun Kim, Jihoon Park, Changhun Lee, Jae-Joon Kim |
7214 | Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs | Hui-Po Wang, Ning Yu, Mario Fritz |
8263 | Cross Modal Focal Loss for RGBD Face Anti-Spoofing | Anjith George, Sébastien Marcel |
6288 | On the Difficulty of Membership Inference Attacks | Shahbaz Rezaei, Xin Liu |
6705 | Lifelong Person Re-Identification via Adaptive Knowledge Accumulation | Nan Pu, Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew |
1970 | Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes | Julian Chibane, Aayush Bansal, Verica Lazova, Gerard Pons-Moll |
514 | Regularizing Generative Adversarial Networks Under Limited Data | Hung-Yu Tseng, Lu Jiang, Ce Liu, Ming-Hsuan Yang, Weilong Yang |
7418 | Automatic Correction of Internal Units in Generative Neural Networks | Ali Tousi, Haedong Jeong, Jiyeon Han, Hwanil Choi, Jaesik Choi |
1523 | HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms | Mahmoud Afifi, Marcus A. Brubaker, Michael S. Brown |
6085 | Prior Based Human Completion | Zibo Zhao, Wen Liu, Yanyu Xu, Xianing Chen, Weixin Luo, Lei Jin, Bohui Zhu, Tong Liu, Binqiang Zhao, Shenghua Gao |
3618 | Diverse Semantic Image Synthesis via Probability Distribution Modeling | Zhentao Tan, Menglei Chai, Dongdong Chen, Jing Liao, Qi Chu, Bin Liu, Gang Hua, Nenghai Yu |
8297 | Adaptive Convolutions for Structure-Aware Style Transfer | Prashanth Chandran, Gaspard Zoss, Paulo Gotardo, Markus Gross, Derek Bradley |
6677 | PISE: Person Image Synthesis and Editing With Decoupled GAN | Jinsong Zhang, Kun Li, Yu-Kun Lai, Jingyu Yang |
4642 | Semi-Supervised Synthesis of High-Resolution Editable Textures for 3D Humans | Bindita Chaudhuri, Nikolaos Sarafianos, Linda Shapiro, Tony Tung |
3444 | CDFI: Compression-Driven Network Design for Frame Interpolation | Tianyu Ding, Luming Liang, Zhihui Zhu, Ilya Zharkov |
7537 | Few-Shot Classification With Feature Map Reconstruction Networks | Davis Wertheimer, Luming Tang, Bharath Hariharan |
5931 | Augmentation Strategies for Learning With Noisy Labels | Kento Nishi, Yi Ding, Alex Rich, Tobias Höllerer |
1454 | Activate or Not: Learning Customized Activation | Ningning Ma, Xiangyu Zhang, Ming Liu, Jian Sun |
10160 | Background Splitting: Finding Rare Classes in a Sea of Background | Ravi Teja Mullapudi, Fait Poms, William R. Mark, Deva Ramanan, Kayvon Fatahalian |
2915 | CLCC: Contrastive Learning for Color Constancy | Yi-Chen Lo, Chia-Che Chang, Hsuan-Chao Chiu, Yu-Hao Huang, Chia-Ping Chen, Yu-Lin Chang, Kevin Jou |
3115 | Dynamic Region-Aware Convolution | Jin Chen, Xijun Wang, Zichao Guo, Xiangyu Zhang, Jian Sun |
5480 | Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking | Yiding Yang, Zhou Ren, Haoxiang Li, Chunluan Zhou, Xinchao Wang, Gang Hua |
10624 | Searching for Fast Model Families on Datacenter Accelerators | Sheng Li, Mingxing Tan, Ruoming Pang, Andrew Li, Liqun Cheng, Quoc V. Le, Norman P. Jouppi |
6563 | Discrete-Continuous Action Space Policy Gradient-Based Attention for Image-Text Matching | Shiyang Yan, Li Yu, Yuan Xie |
8266 | Quantifying Explainers of Graph Neural Networks in Computational Pathology | Guillaume Jaume, Pushpak Pati, Behzad Bozorgtabar, Antonio Foncubierta, Anna Maria Anniciello, Florinda Feroce, Tilman Rau, Jean-Philippe Thiran, Maria Gabrani, Orcun Goksel |
2486 | Forecasting Irreversible Disease via Progression Learning | Botong Wu, Sijie Ren, Jing Li, Xinwei Sun, Shi-Ming Li, Yizhou Wang |
6901 | Transformer Tracking | Xin Chen, Bin Yan, Jiawen Zhu, Dong Wang, Xiaoyun Yang, Huchuan Lu |
6124 | Online Multiple Object Tracking With Cross-Task Synergy | Song Guo, Jingya Wang, Xinchao Wang, Dacheng Tao |
6219 | Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification | Jiaxing Chen, Xinyang Jiang, Fudong Wang, Jun Zhang, Feng Zheng, Xing Sun, Wei-Shi Zheng |
2479 | Regularizing Neural Networks via Adversarial Model Perturbation | Yaowei Zheng, Richong Zhang, Yongyi Mao |
2835 | Task-Aware Variational Adversarial Active Learning | Kwanyoung Kim, Dongwon Park, Kwang In Kim, Se Young Chun |
8179 | VDSM: Unsupervised Video Disentanglement With State-Space Modeling and Deep Mixtures of Experts | Matthew J. Vowels, Necati Cihan Camgoz, Richard Bowden |
2960 | Multi-Target Domain Adaptation With Collaborative Consistency Learning | Takashi Isobe, Xu Jia, Shuaijun Chen, Jianzhong He, Yongjie Shi, Jianzhuang Liu, Huchuan Lu, Shengjin Wang |
6105 | Learning To Relate Depth and Semantics for Unsupervised Domain Adaptation | Suman Saha, Anton Obukhov, Danda Pani Paudel, Menelaos Kanakis, Yuhua Chen, Stamatios Georgoulis, Luc Van Gool |
4346 | Adversarially Adaptive Normalization for Single Domain Generalization | Xinjie Fan, Qifei Wang, Junjie Ke, Feng Yang, Boqing Gong, Mingyuan Zhou |
4189 | Rainbow Memory: Continual Learning With a Memory of Diverse Samples | Jihwan Bang, Heesu Kim, YoungJoon Yoo, Jung-Woo Ha, Jonghyun Choi |
3378 | Asymmetric Metric Learning for Knowledge Transfer | Mateusz Budnik, Yannis Avrithis |
10164 | Scalability vs. Utility: Do We Have To Sacrifice One for the Other in Data Importance Quantification? | Ruoxi Jia, Fan Wu, Xuehui Sun, Jiacen Xu, David Dao, Bhavya Kailkhura, Ce Zhang, Bo Li, Dawn Song |
8027 | Self-Supervised Learning on 3D Point Clouds by Learning Discrete Generative Models | Benjamin Eckart, Wentao Yuan, Chao Liu, Jan Kautz |
843 | Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks | Xiaoxiao Long, Lingjie Liu, Wei Li, Christian Theobalt, Wenping Wang |
9958 | Beyond Image to Depth: Improving Depth Prediction Using Echoes | Kranti Kumar Parida, Siddharth Srivastava, Gaurav Sharma |
3042 | Deeply Shape-Guided Cascade for Instance Segmentation | Hao Ding, Siyuan Qiao, Alan Yuille, Wei Shen |
1130 | Linguistic Structures As Weak Supervision for Visual Scene Graph Generation | Keren Ye, Adriana Kovashka |
1210 | Semantic Segmentation With Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization | Daiqing Li, Junlin Yang, Karsten Kreis, Antonio Torralba, Sanja Fidler |
3575 | Self-Guided and Cross-Guided Learning for Few-Shot Segmentation | Bingfeng Zhang, Jimin Xiao, Terry Qin |
2934 | Scene Essence | Jiayan Qiu, Yiding Yang, Xinchao Wang, Dacheng Tao |
578 | Adaptive Prototype Learning and Allocation for Few-Shot Segmentation | Gen Li, Varun Jampani, Laura Sevilla-Lara, Deqing Sun, Jonghyun Kim, Joongkyu Kim |
305 | Cluster, Split, Fuse, and Update: Meta-Learning for Open Compound Domain Adaptive Semantic Segmentation | Rui Gong, Yuhua Chen, Danda Pani Paudel, Yawei Li, Ajad Chhatkuli, Wen Li, Dengxin Dai, Luc Van Gool |
3383 | Unsupervised Part Segmentation Through Disentangling Appearance and Shape | Shilong Liu, Lei Zhang, Xiao Yang, Hang Su, Jun Zhu |
3751 | Temporal Action Segmentation From Timestamp Supervision | Zhe Li, Yazan Abu Farha, Jürgen Gall |
7251 | RAFT-3D: Scene Flow Using Rigid-Motion Embeddings | Zachary Teed, Jia Deng |
4949 | Coarse-Fine Networks for Temporal Activity Detection in Videos | Kumara Kahatapitiya, Michael S. Ryoo |
2082 | Learning Discriminative Prototypes With Dynamic Time Warping | Xiaobin Chang, Frederick Tung, Greg Mori |
6302 | Learning Dynamic Network Using a Reuse Gate Function in Semi-Supervised Video Object Segmentation | Hyojin Park, Jayeon Yoo, Seohyeong Jeong, Ganesh Venkatesh, Nojun Kwak |
274 | Probabilistic Embeddings for Cross-Modal Retrieval | Sanghyuk Chun, Seong Joon Oh, Rafael Sampaio de Rezende, Yannis Kalantidis, Diane Larlus |
6167 | Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning | Shaoxiang Chen, Yu-Gang Jiang |
11068 | Positive Sample Propagation Along the Audio-Visual Event Line | Jinxing Zhou, Liang Zheng, Yiran Zhong, Shijie Hao, Meng Wang |
2404 | Embracing Uncertainty: Decoupling and De-Bias for Robust Temporal Grounding | Hao Zhou, Chongyang Zhang, Yan Luo, Yanjun Chen, Chuanping Hu |
2791 | Structured Scene Memory for Vision-Language Navigation | Hanqing Wang, Wenguan Wang, Wei Liang, Caiming Xiong, Jianbing Shen |
1720 | Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules | Aisha Urooj, Hilde Kuehne, Kevin Duarte, Chuang Gan, Niels Lobo, Mubarak Shah |
8500 | Hierarchical Layout-Aware Graph Convolutional Network for Unified Aesthetics Assessment | Dongyu She, Yu-Kun Lai, Gaoxiong Yi, Kun Xu |
1993 | Parser-Free Virtual Try-On via Distilling Appearance Flows | Yuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, Ping Luo |
7368 | Self-Supervised Simultaneous Multi-Step Prediction of Road Dynamics and Cost Map | Elmira Amirloo, Mohsen Rohani, Ershad Banijamali, Jun Luo, Pascal Poupart |
5096 | StyleMeUp: Towards Style-Agnostic Sketch-Based Image Retrieval | Aneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang, Tao Xiang, Yi-Zhe Song |
Date: Wednesday, June 23, 2021 11:00 – 13:30
Paper Session Seven:
Paper ID | Paper Title | Authors |
5050 | VarifocalNet: An IoU-Aware Dense Object Detector | Haoyang Zhang, Ying Wang, Feras Dayoub, Niko Sünderhauf |
2755 | Variational Relational Point Completion Network | Liang Pan, Xinyi Chen, Zhongang Cai, Junzhe Zhang, Haiyu Zhao, Shuai Yi, Ziwei Liu |
8458 | NeX: Real-Time View Synthesis With Neural Basis Expansion | Suttisak Wizadwongsa, Pakkapon Phongthawee, Jiraphon Yenphraphai, Supasorn Suwajanakorn |
2610 | Robust Neural Routing Through Space Partitions for Camera Relocalization in Dynamic Indoor Environments | Siyan Dong, Qingnan Fan, He Wang, Ji Shi, Li Yi, Thomas Funkhouser, Baoquan Chen, Leonidas J. Guibas |
3916 | Categorical Depth Distribution Network for Monocular 3D Object Detection | Cody Reading, Ali Harakeh, Julia Chae, Steven L. Waslander |
1610 | Dual Attention Suppression Attack: Generate Adversarial Camouflage in Physical World | Jiakai Wang, Aishan Liu, Zixin Yin, Shunchang Liu, Shiyu Tang, Xianglong Liu |
902 | PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation | Kehong Gong, Jianfeng Zhang, Jiashi Feng |
5465 | Passive Inter-Photon Imaging | Atul Ingle, Trevor Seets, Mauro Buttafava, Shantanu Gupta, Alberto Tosi, Mohit Gupta, Andreas Velten |
2663 | Adaptive Consistency Prior Based Deep Network for Image Denoising | Chao Ren, Xiaohai He, Chuncheng Wang, Zhibo Zhao |
4547 | Dynamic Slimmable Network | Changlin Li, Guangrun Wang, Bing Wang, Xiaodan Liang, Zhihui Li, Xiaojun Chang |
7174 | The Neural Tangent Link Between CNN Denoisers and Non-Local Filters | Julián Tachella, Junqi Tang, Mike Davies |
6581 | Learning Continuous Image Representation With Local Implicit Image Function | Yinbo Chen, Sifei Liu, Xiaolong Wang |
327 | Image-to-Image Translation via Hierarchical Style Disentanglement | Xinyang Li, Shengchuan Zhang, Jie Hu, Liujuan Cao, Xiaopeng Hong, Xudong Mao, Feiyue Huang, Yongjian Wu, Rongrong Ji |
5483 | Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction | Guy Gafni, Justus Thies, Michael Zollhöfer, Matthias Nießner |
3633 | Adversarial Robustness Under Long-Tailed Distribution | Tong Wu, Ziwei Liu, Qingqiu Huang, Yu Wang, Dahua Lin |
1364 | Representative Batch Normalization With Feature Calibration | Shang-Hua Gao, Qi Han, Duo Li, Ming-Ming Cheng, Pai Peng |
829 | Learning to Track Instances without Video Annotations | Yang Fu, Sifei Liu, Umar Iqbal, Shalini De Mello, Humphrey Shi, Jan Kautz |
7002 | Reducing Domain Gap by Reducing Style Bias | Hyeonseob Nam, HyunJae Lee, Jongchan Park, Wonjun Yoon, Donggeun Yoo |
5588 | Taskology: Utilizing Task Relations at Scale | Yao Lu, Sören Pirk, Jan Dlabal, Anthony Brohan, Ankita Pasad, Zhao Chen, Vincent Casser, Anelia Angelova, Ariel Gordon |
6415 | MOS: Towards Scaling Out-of-Distribution Detection for Large Semantic Space | Rui Huang, Yixuan Li |
8088 | DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation | Xing Shen, Jirui Yang, Chunbo Wei, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Xiaoliang Cheng, Kewei Liang |
3872 | Fine-Grained Angular Contrastive Learning With Coarse Labels | Guy Bukchin, Eli Schwartz, Kate Saenko, Ori Shahar, Rogerio Feris, Raja Giryes, Leonid Karlinsky |
5858 | End-to-End Video Instance Segmentation With Transformers | Yuqing Wang, Zhaoliang Xu, Xinlong Wang, Chunhua Shen, Baoshan Cheng, Hao Shen, Huaxia Xia |
5007 | TAP: Text-Aware Pre-Training for Text-VQA and Text-Caption | Zhengyuan Yang, Yijuan Lu, Jianfeng Wang, Xi Yin, Dinei Florencio, Lijuan Wang, Cha Zhang, Lei Zhang, Jiebo Luo |
10509 | Real-Time High-Resolution Background Matting | Shanchuan Lin, Andrey Ryabtsev, Soumyadip Sengupta, Brian L. Curless, Steven M. Seitz, Ira Kemelmacher-Shlizerman |
7869 | Camouflaged Object Segmentation With Distraction Mining | Haiyang Mei, Ge-Peng Ji, Ziqi Wei, Xin Yang, Xiaopeng Wei, Deng-Ping Fan |
5445 | Semantic Relation Reasoning for Shot-Stable Few-Shot Object Detection | Chenchen Zhu, Fangyi Chen, Uzair Ahmed, Zhiqiang Shen, Marios Savvides |
1616 | Beyond Bounding-Box: Convex-Hull Feature Adaptation for Oriented and Densely Packed Object Detection | Zonghao Guo, Chang Liu, Xiaosong Zhang, Jianbin Jiao, Xiangyang Ji, Qixiang Ye |
10173 | TextOCR: Towards Large-Scale End-to-End Reasoning for Arbitrary-Shaped Scene Text | Amanpreet Singh, Guan Pang, Mandy Toh, Jing Huang, Wojciech Galuba, Tal Hassner |
11134 | MOST: A Multi-Oriented Scene Text Detector With Localization Refinement | Minghang He, Minghui Liao, Zhibo Yang, Humen Zhong, Jun Tang, Wenqing Cheng, Cong Yao, Yongpan Wang, Xiang Bai |
3082 | Points As Queries: Weakly Semi-Supervised Object Detection by Points | Liangyu Chen, Tong Yang, Xiangyu Zhang, Wei Zhang, Jian Sun |
2192 | Holistic 3D Scene Understanding From a Single Image With Implicit Representation | Cheng Zhang, Zhaopeng Cui, Yinda Zhang, Bing Zeng, Marc Pollefeys, Shuaicheng Liu |
5540 | Shelf-Supervised Mesh Prediction in the Wild | Yufei Ye, Shubham Tulsiani, Abhinav Gupta |
2868 | Mesh Saliency: An Independent Perceptual Measure or a Derivative of Image Saliency? | Ran Song, Wei Zhang, Yitian Zhao, Yonghuai Liu, Paul L. Rosin |
2771 | MetaSets: Meta-Learning on Point Sets for Generalizable Representations | Chao Huang, Zhangjie Cao, Yunbo Wang, Jianmin Wang, Mingsheng Long |
4761 | Few-Shot 3D Point Cloud Semantic Segmentation | Na Zhao, Tat-Seng Chua, Gim Hee Lee |
11241 | Point Cloud Instance Segmentation Using Probabilistic Embeddings | Biao Zhang, Peter Wonka |
772 | Robust Point Cloud Registration Framework Based on Deep Graph Matching | Kexue Fu, Shaolei Liu, Xiaoyuan Luo, Manning Wang |
10511 | Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food | Quin Thames, Arjun Karpur, Wade Norris, Fangting Xia, Liviu Panait, Tobias Weyand, Jack Sim |
4401 | Differentiable Diffusion for Dense Depth Estimation From Multi-View Images | Numair Khan, Min H. Kim, James Tompkin |
3434 | LoFTR: Detector-Free Local Feature Matching With Transformers | Jiaming Sun, Zehong Shen, Yuang Wang, Hujun Bao, Xiaowei Zhou |
1972 | DI-Fusion: Online Implicit 3D Reconstruction With Deep Priors | Jiahui Huang, Shi-Sheng Huang, Haoxuan Song, Shi-Min Hu |
6130 | SMD-Nets: Stereo Mixture Density Networks | Fabio Tosi, Yiyi Liao, Carolin Schmitt, Andreas Geiger |
3297 | Deep Two-View Structure-From-Motion Revisited | Jianyuan Wang, Yiran Zhong, Yuchao Dai, Stan Birchfield, Kaihao Zhang, Nikolai Smolyanskiy, Hongdong Li |
2149 | Back-Tracing Representative Points for Voting-Based 3D Object Detection in Point Clouds | Bowen Cheng, Lu Sheng, Shaoshuai Shi, Ming Yang, Dong Xu |
5545 | GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection | Abhinav Kumar, Garrick Brazil, Xiaoming Liu |
2472 | Graph-Based High-Order Relation Modeling for Long-Term Action Recognition | Jiaming Zhou, Kun-Yu Lin, Haoxin Li, Wei-Shi Zheng |
4028 | SGCN: Sparse Graph Convolution Network for Pedestrian Trajectory Prediction | Liushuai Shi, Le Wang, Chengjiang Long, Sanping Zhou, Mo Zhou, Zhenxing Niu, Gang Hua |
1453 | Reformulating HOI Detection As Adaptive Set Prediction | Mingfei Chen, Yue Liao, Si Liu, Zhiyuan Chen, Fei Wang, Chen Qian |
3072 | MagDR: Mask-Guided Detection and Reconstruction for Defending Deepfakes | Zhikai Chen, Lingxi Xie, Shanmin Pang, Yong He, Bo Zhang |
11041 | Improving the Transferability of Adversarial Samples With Adversarial Transformations | Weibin Wu, Yuxin Su, Michael R. Lyu, Irwin King |
2440 | FCPose: Fully Convolutional Multi-Person Pose Estimation With Dynamic Instance-Aware Convolutions | Weian Mao, Zhi Tian, Xinlong Wang, Chunhua Shen |
387 | DexYCB: A Benchmark for Capturing Hand Grasping of Objects | Yu-Wei Chao, Wei Yang, Yu Xiang, Pavlo Molchanov, Ankur Handa, Jonathan Tremblay, Yashraj S. Narang, Karl Van Wyk, Umar Iqbal, Stan Birchfield, Jan Kautz, Dieter Fox |
4286 | Neural Body: Implicit Neural Representations With Structured Latent Codes for Novel View Synthesis of Dynamic Humans | Sida Peng, Yuanqing Zhang, Yinghao Xu, Qianqian Wang, Qing Shuai, Hujun Bao, Xiaowei Zhou |
3459 | PCLs: Geometry-Aware Neural Reconstruction of 3D Pose With Perspective Crop Layers | Frank Yu, Mathieu Salzmann, Pascal Fua, Helge Rhodin |
4259 | Affective Processes: Stochastic Modelling of Temporal Context for Emotion and Facial Expression Recognition | Enrique Sanchez, Mani Kumar Tellamekala, Michel Valstar, Georgios Tzimiropoulos |
685 | Structure-Aware Face Clustering on a Large-Scale Graph With 107 Nodes | Shuai Shen, Wanhua Li, Zheng Zhu, Guan Huang, Dalong Du, Jiwen Lu, Jie Zhou |
7274 | Cross-View Gait Recognition With Deep Universal Linear Embeddings | Shaoxiong Zhang, Yunhong Wang, Annan Li |
2457 | Partial Person Re-Identification With Part-Part Correspondence Learning | Tianyu He, Xu Shen, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua |
5463 | Mask-ToF: Learning Microlens Masks for Flying Pixel Correction in Time-of-Flight Imaging | Ilya Chugunov, Seung-Hwan Baek, Qiang Fu, Wolfgang Heidrich, Felix Heide |
4269 | Effective Snapshot Compressive-Spectral Imaging via Deep Denoising and Total Variation Priors | Haiquan Qiu, Yao Wang, Deyu Meng |
4914 | Test-Time Fast Adaptation for Dynamic Scene Deblurring via Meta-Auxiliary Learning | Zhixiang Chi, Yang Wang, Yuanhao Yu, Jin Tang |
412 | Removing Raindrops and Rain Streaks in One Go | Ruijie Quan, Xin Yu, Yuanzhi Liang, Yi Yang |
1532 | Learning Multi-Scale Photo Exposure Correction | Mahmoud Afifi, Konstantinos G. Derpanis, Björn Ommer, Michael S. Brown |
1193 | Towards Real-World Blind Face Restoration With Generative Facial Prior | Xintao Wang, Yu Li, Honglun Zhang, Ying Shan |
4882 | Image Restoration for Under-Display Camera | Yuqian Zhou, David Ren, Neil Emerton, Sehoon Lim, Timothy Large |
3062 | LAU-Net: Latitude Adaptive Upscaling Network for Omnidirectional Image Super-Resolution | Xin Deng, Hao Wang, Mai Xu, Yichen Guo, Yuhang Song, Li Yang |
1089 | Interpreting Super-Resolution Networks With Local Attribution Maps | Jinjin Gu, Chao Dong |
7145 | Deep Burst Super-Resolution | Goutam Bhat, Martin Danelljan, Luc Van Gool, Radu Timofte |
7674 | Towards Rolling Shutter Correction and Deblurring in Dynamic Scenes | Zhihang Zhong, Yinqiang Zheng, Imari Sato |
3977 | Towards Fast and Accurate Real-World Depth Super-Resolution: Benchmark Dataset and Baseline | Lingzhi He, Hongguang Zhu, Feng Li, Huihui Bai, Runmin Cong, Chunjie Zhang, Chunyu Lin, Meiqin Liu, Yao Zhao |
10750 | Learning To Restore Hazy Video: A New Real-World Dataset and a New Method | Xinyi Zhang, Hang Dong, Jinshan Pan, Chao Zhu, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Fei Wang |
5272 | Gradient Forward-Propagation for Large-Scale Temporal Video Modelling | Mateusz Malinowski, Dimitrios Vytiniotis, Grzegorz Świrszcz, Viorica Pătrăucean, João Carreira |
3978 | Complementary Relation Contrastive Distillation | Jinguo Zhu, Shixiang Tang, Dapeng Chen, Shijie Yu, Yakun Liu, Mingzhe Rong, Aijun Yang, Xiaohua Wang |
5524 | Network Pruning via Performance Maximization | Shangqian Gao, Feihu Huang, Weidong Cai, Heng Huang |
8376 | Distribution-Aware Adaptive Multi-Bit Quantization | Sijie Zhao, Tao Yue, Xuemei Hu |
10749 | The Affective Growth of Computer Vision | Norman Makoto Su, David J. Crandall |
1263 | Fair Attribute Classification Through Latent Space De-Biasing | Vikram V. Ramaswamy, Sunnie S. Y. Kim, Olga Russakovsky |
7885 | Soteria: Provable Defense Against Privacy Leakage in Federated Learning From Representation Perspective | Jingwei Sun, Ang Li, Binghui Wang, Huanrui Yang, Hai Li, Yiran Chen |
1957 | Deep Compositional Metric Learning | Wenzhao Zheng, Chengkun Wang, Jiwen Lu, Jie Zhou |
5411 | Physically-Aware Generative Network for 3D Shape Modeling | Mariem Mezghanni, Malika Boulkenafed, André Lieutier, Maks Ovsjanikov |
1179 | Semantic Palette: Guiding Scene Generation With Class Proportions | Guillaume Le Moing, Tuan-Hung Vu, Himalaya Jain, Patrick Pérez, Matthieu Cord |
7612 | Linear Semantics in Generative Adversarial Networks | Jianjin Xu, Changxi Zheng |
2910 | Region-Aware Adaptive Instance Normalization for Image Harmonization | Jun Ling, Han Xue, Li Song, Rong Xie, Xiao Gu |
6187 | PD-GAN: Probabilistic Diverse GAN for Image Inpainting | Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao |
4675 | In the Light of Feature Distributions: Moment Matching for Neural Style Transfer | Nikolai Kalischek, Jan D. Wegner, Konrad Schindler |
11816 | High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network | Jie Liang, Hui Zeng, Lei Zhang |
6691 | Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes | Jiashun Wang, Huazhe Xu, Jingwei Xu, Sifei Liu, Xiaolong Wang |
5246 | A Sliced Wasserstein Loss for Neural Texture Synthesis | Eric Heitz, Kenneth Vanhoey, Thomas Chambon, Laurent Belcour |
3892 | Space-Time Neural Irradiance Fields for Free-Viewpoint Video | Wenqi Xian, Jia-Bin Huang, Johannes Kopf, Changil Kim |
8703 | Rethinking Class Relations: Absolute-Relative Supervised and Unsupervised Few-Shot Learning | Hongguang Zhang, Piotr Koniusz, Songlei Jian, Hongdong Li, Philip H. S. Torr |
7719 | Joint Negative and Positive Learning for Noisy Labels | Youngdong Kim, Juseung Yun, Hyounguk Shon, Junmo Kim |
6322 | Out-of-Distribution Detection Using Union of 1-Dimensional Subspaces | Alireza Zaeemzadeh, Niccolò Bisagno, Zeno Sambugaro, Nicola Conci, Nazanin Rahnavard, Mubarak Shah |
866 | OpenMix: Reviving Known Knowledge for Discovering Novel Visual Categories in an Open World | Zhun Zhong, Linchao Zhu, Zhiming Luo, Shaozi Li, Yi Yang, Nicu Sebe |
419 | Calibrated RGB-D Salient Object Detection | Wei Ji, Jingjing Li, Shuang Yu, Miao Zhang, Yongri Piao, Shunyu Yao, Qi Bi, Kai Ma, Yefeng Zheng, Huchuan Lu, Li Cheng |
6016 | Permuted AdaIN: Reducing the Bias Towards Global Statistics in Image Classification | Oren Nuriel, Sagie Benaim, Lior Wolf |
7354 | Binary Graph Neural Networks | Mehdi Bahri, Gaétan Bahl, Stefanos Zafeiriou |
11671 | Contrastive Neural Architecture Search With Neural Architecture Comparators | Yaofo Chen, Yong Guo, Qi Chen, Minli Li, Wei Zeng, Yaowei Wang, Mingkui Tan |
2687 | Group Whitening: Balancing Learning Efficiency and Representational Capacity | Lei Huang, Yi Zhou, Li Liu, Fan Zhu, Ling Shao |
2227 | Towards Unified Surgical Skill Assessment | Daochang Liu, Qiyue Li, Tingting Jiang, Yizhou Wang, Rulin Miao, Fei Shan, Ziyu Li |
4326 | Every Annotation Counts: Multi-Label Deep Supervision for Medical Image Segmentation | Simon Reiß, Constantin Seibold, Alexander Freytag, Erik Rodner, Rainer Stiefelhagen |
6911 | Graph Attention Tracking | Dongyan Guo, Yanyan Shao, Ying Cui, Zhenhua Wang, Liyan Zhang, Chunhua Shen |
8651 | Discriminative Appearance Modeling With Multi-Track Pooling for Real-Time Multi-Object Tracking | Chanho Kim, Li Fuxin, Mazen Alotaibi, James M. Rehg |
3648 | Scale-Aware Automatic Augmentation for Object Detection | Yukang Chen, Yanwei Li, Tao Kong, Lu Qi, Ruihang Chu, Lei Li, Jiaya Jia |
2574 | Confluent Vessel Trees With Accurate Bifurcations | Zhongwen Zhang, Dmitrii Marin, Maria Drangova, Yuri Boykov |
3996 | Sequential Graph Convolutional Network for Active Learning | Razvan Caramalau, Binod Bhattarai, Tae-Kyun Kim |
11307 | CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models | Mengyue Yang, Furui Liu, Zhitang Chen, Xinwei Shen, Jianye Hao, Jun Wang |
3181 | Domain-Specific Suppression for Adaptive Object Detection | Yu Wang, Rui Zhang, Shuo Zhang, Miao Li, Yangyang Xia, Xishan Zhang, Shaoli Liu |
6132 | Uncertainty Reduction for Model Adaptation in Semantic Segmentation | Prabhu Teja S, François Fleuret |
5777 | Open Domain Generalization with Domain-Augmented Meta-Learning | Yang Shu, Zhangjie Cao, Chenyu Wang, Jianmin Wang, Mingsheng Long |
6235 | Layerwise Optimization by Gradient Decomposition for Continual Learning | Shixiang Tang, Dapeng Chen, Jinguo Zhu, Shijie Yu, Wanli Ouyang |
3886 | SLADE: A Self-Training Framework for Distance Metric Learning | Jiali Duan, Yen-Liang Lin, Son Tran, Larry S. Davis, C.-C. Jay Kuo |
11511 | DualGraph: A Graph-Based Method for Reasoning About Label Noise | HaiYang Zhang, XiMing Xing, Liang Liu |
8732 | CutPaste: Self-Supervised Learning for Anomaly Detection and Localization | Chun-Liang Li, Kihyuk Sohn, Jinsung Yoon, Tomas Pfister |
916 | Self-Supervised Visibility Learning for Novel View Synthesis | Yujiao Shi, Hongdong Li, Xin Yu |
9996 | Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging | S. Mahdi H. Miangoleh, Sebastian Dille, Long Mai, Sylvain Paris, Yağiz Aksoy |
3597 | Seesaw Loss for Long-Tailed Instance Segmentation | Jiaqi Wang, Wenwei Zhang, Yuhang Zang, Yuhang Cao, Jiangmiao Pang, Tao Gong, Kai Chen, Ziwei Liu, Chen Change Loy, Dahua Lin |
1368 | Exploiting Edge-Oriented Reasoning for 3D Point-Based Scene Graph Analysis | Chaoyi Zhang, Jianhui Yu, Yang Song, Weidong Cai |
1585 | Rethinking BiSeNet for Real-Time Semantic Segmentation | Mingyuan Fan, Shenqi Lai, Junshi Huang, Xiaoming Wei, Zhenhua Chai, Junfeng Luo, Xiaolin Wei |
4281 | Exploit Visual Dependency Relations for Semantic Segmentation | Mingyuan Liu, Dan Schonfeld, Wei Tang |
4845 | Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution | Chi Zhang, Baoxiong Jia, Song-Chun Zhu, Yixin Zhu |
1386 | Anti-Aliasing Semantic Reconstruction for Few-Shot Semantic Segmentation | Binghao Liu, Yao Ding, Jianbin Jiao, Xiangyang Ji, Qixiang Ye |
315 | Domain Consensus Clustering for Universal Domain Adaptation | Guangrui Li, Guoliang Kang, Yi Zhu, Yunchao Wei, Yi Yang |
4039 | Progressive Stage-Wise Learning for Unsupervised Feature Representation Enhancement | Zefan Li, Chenxi Liu, Alan Yuille, Bingbing Ni, Wenjun Zhang, Wen Gao |
7060 | NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions | Junbin Xiao, Xindi Shang, Angela Yao, Tat-Seng Chua |
862 | Spatio-temporal Contrastive Domain Adaptation for Action Recognition | Xiaolin Song, Sicheng Zhao, Jingyu Yang, Huanjing Yue, Pengfei Xu, Runbo Hu, Hua Chai |
5125 | Shot Contrastive Self-Supervised Learning for Scene Boundary Detection | Shixing Chen, Xiaohan Nie, David Fan, Dongqing Zhang, Vimal Bhat, Raffay Hamid |
2351 | Anchor-Constrained Viterbi for Set-Supervised Action Segmentation | Jun Li, Sinisa Todorovic |
7419 | SG-Net: Spatial Granularity Network for One-Stage Video Instance Segmentation | Dongfang Liu, Yiming Cui, Wenbo Tan, Yingjie Chen |
393 | Thinking Fast and Slow: Efficient Text-to-Visual Retrieval With Transformers | Antoine Miech, Jean-Baptiste Alayrac, Ivan Laptev, Josef Sivic, Andrew Zisserman |
7043 | Open-Book Video Captioning With Retrieve-Copy-Generate Network | Ziqi Zhang, Zhongang Qi, Chunfeng Yuan, Ying Shan, Bing Li, Ying Deng, Weiming Hu |
6641 | Causal Attention for Vision-Language Tasks | Xu Yang, Hanwang Zhang, Guojun Qi, Jianfei Cai |
3112 | Locate Then Segment: A Strong Pipeline for Referring Image Segmentation | Ya Jing, Tao Kong, Wei Wang, Liang Wang, Lei Li, Tieniu Tan |
3639 | Pushing It Out of the Way: Interactive Visual Navigation | Kuo-Hao Zeng, Luca Weihs, Ali Farhadi, Roozbeh Mottaghi |
2065 | SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning Over Traffic Events | Li Xu, He Huang, Jun Liu |
11433 | User-Guided Line Art Flat Filling With Split Filling Mechanism | Lvmin Zhang, Chengze Li, Edgar Simo-Serra, Yi Ji, Tien-Tsin Wong, Chunping Liu |
4041 | CT-Net: Complementary Transfering Network for Garment Transfer With Arbitrary Geometric Changes | Fan Yang, Guosheng Lin |
4790 | AdvSim: Generating Safety-Critical Scenarios for Self-Driving Vehicles | Jingkang Wang, Ava Pun, James Tu, Sivabalan Manivasagam, Abbas Sadat, Sergio Casas, Mengye Ren, Raquel Urtasun |
8342 | AIFit: Automatic 3D Human-Interpretable Feedback Models for Fitness Training | Mihai Fieraru, Mihai Zanfir, Silviu Cristian Pirlea, Vlad Olaru, Cristian Sminchisescu |
Date: Wednesday, June 23, 2021 22:00 – 24:30
Paper Session Eight:
Paper ID | Paper Title | Authors |
1881 | Informative and Consistent Correspondence Mining for Cross-Domain Weakly Supervised Object Detection | Luwei Hou, Yu Zhang, Kui Fu, Jia Li |
2936 | Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation | Xinge Zhu, Hui Zhou, Tai Wang, Fangzhou Hong, Yuexin Ma, Wei Li, Hongsheng Li, Dahua Lin |
8752 | Neural Splines: Fitting 3D Surfaces With Infinitely-Wide Neural Networks | Francis Williams, Matthew Trager, Joan Bruna, Denis Zorin |
5721 | Learning Neural Representation of Camera Pose with Matrix Representation of Pose Shift via View Synthesis | Yaxuan Zhu, Ruiqi Gao, Siyuan Huang, Song-Chun Zhu, Ying Nian Wu |
1797 | Action Unit Memory Network for Weakly Supervised Temporal Action Localization | Wang Luo, Tianzhu Zhang, Wenfei Yang, Jingen Liu, Tao Mei, Feng Wu, Yongdong Zhang |
1668 | Weakly-Supervised Physically Unconstrained Gaze Estimation | Rakshit Kothari, Shalini De Mello, Umar Iqbal, Wonmin Byeon, Seonwook Park, Jan Kautz |
1078 | On Self-Contact and Human Pose | Lea Müller, Ahmed A. A. Osman, Siyu Tang, Chun-Hao P. Huang, Michael J. Black |
6914 | Seeing in Extra Darkness Using a Deep-Red Flash | Jinhui Xiong, Jian Wang, Wolfgang Heidrich, Shree Nayar |
1468 | Light Field Super-Resolution With Zero-Shot Learning | Zhen Cheng, Zhiwei Xiong, Chang Chen, Dong Liu, Zheng-Jun Zha |
10249 | Neural Cellular Automata Manifold | Alejandro Hernandez, Armand Vilalta, Francesc Moreno-Noguer |
9920 | Convolutional Dynamic Alignment Networks for Interpretable Classifications | Moritz Böhle, Mario Fritz, Bernt Schiele |
517 | One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing | Ting-Chun Wang, Arun Mallya, Ming-Yu Liu |
2069 | Stay Positive: Non-Negative Image Synthesis for Augmented Reality | Katie Luo, Guandao Yang, Wenqi Xian, Harald Haraldsson, Bharath Hariharan, Serge Belongie |
8433 | Playable Video Generation | Willi Menapace, Stéphane Lathuilière, Sergey Tulyakov, Aliaksandr Siarohin, Elisa Ricci |
771 | Uncertainty-Aware Joint Salient Object and Camouflaged Object Detection | Aixuan Li, Jing Zhang, Yunqiu Lv, Bowen Liu, Tong Zhang, Yuchao Dai |
372 | Discovering Hidden Physics Behind Transport Dynamics | Peirong Liu, Lin Tian, Yubo Zhang, Stephen Aylward, Yueh Lee, Marc Niethammer |
1698 | AutoFlow: Learning a Better Training Set for Optical Flow | Deqing Sun, Daniel Vlasic, Charles Herrmann, Varun Jampani, Michael Krainin, Huiwen Chang, Ramin Zabih, William T. Freeman, Ce Liu |
11153 | Unsupervised Multi-Source Domain Adaptation Without Access to Source Data | Sk Miraj Ahmed, Dripta S. Raychaudhuri, Sujoy Paul, Samet Oymak, Amit K. Roy-Chowdhury |
1407 | A Second-Order Approach to Learning With Instance-Dependent Label Noise | Zhaowei Zhu, Tongliang Liu, Yang Liu |
10043 | Post-Hoc Uncertainty Calibration for Domain Drift Scenarios | Christian Tomani, Sebastian Gruber, Muhammed Ebrar Erdem, Daniel Cremers, Florian Buettner |
3174 | Cross-View Regularization for Domain Adaptive Panoptic Segmentation | Jiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu |
6508 | DatasetGAN: Efficient Labeled Data Factory With Minimal Human Effort | Yuxuan Zhang, Huan Ling, Jun Gao, Kangxue Yin, Jean-Francois Lafleche, Adela Barriuso, Antonio Torralba, Sanja Fidler |
6466 | Learning To Segment Actions From Visual and Language Instructions via Differentiable Weak Sequence Alignment | Yuhan Shen, Lu Wang, Ehsan Elhamifar |
7518 | SIPSA-Net: Shift-Invariant Pan Sharpening With Moving Object Alignment for Satellite Imagery | Jaehyup Lee, Soomin Seo, Munchurl Kim |
7921 | Joint-DetNAS: Upgrade Your Detector With NAS, Pruning and Dynamic Distillation | Lewei Yao, Renjie Pi, Hang Xu, Wei Zhang, Zhenguo Li, Tong Zhang |
6629 | Dense Relation Distillation With Context-Aware Aggregation for Few-Shot Object Detection | Hanzhe Hu, Shuai Bai, Aoxue Li, Jinshi Cui, Liwei Wang |
3851 | OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection | Tingting Liang, Yongtao Wang, Zhi Tang, Guosheng Hu, Haibin Ling |
6986 | Pedestrian and Ego-Vehicle Trajectory Prediction From Monocular Camera | Lukáš Neumann, Andrea Vedaldi |
1312 | DetectoRS: Detecting Objects With Recursive Feature Pyramid and Switchable Atrous Convolution | Siyuan Qiao, Liang-Chieh Chen, Alan Yuille |
4229 | Weakly-Supervised Instance Segmentation via Class-Agnostic Learning With Salient Images | Xinggang Wang, Jiapei Feng, Bin Hu, Qi Ding, Longjin Ran, Xiaoxin Chen, Wenyu Liu |
2456 | 3DCaricShop: A Dataset and a Baseline Method for Single-View 3D Caricature Face Reconstruction | Yuda Qiu, Xiaojie Xu, Lingteng Qiu, Yan Pan, Yushuang Wu, Weikai Chen, Xiaoguang Han |
5667 | D2IM-Net: Learning Detail Disentangled Implicit Fields From Single Images | Manyi Li, Hao Zhang |
6656 | Sign-Agnostic Implicit Learning of Surface Self-Similarities for Shape Modeling and Reconstruction From Raw Point Clouds | Wenbin Zhao, Jiabao Lei, Yuxin Wen, Jianguo Zhang, Kui Jia |
2916 | Learning Progressive Point Embeddings for 3D Point Cloud Generation | Cheng Wen, Baosheng Yu, Dacheng Tao |
5312 | Learning Fine-Grained Segmentation of 3D Shapes Without Part Labels | Xiaogang Wang, Xun Sun, Xinyu Cao, Kai Xu, Bin Zhou |
735 | Deformed Implicit Field: Modeling 3D Shapes With Learned Dense Correspondence | Yu Deng, Jiaolong Yang, Xin Tong |
1128 | Recurrent Multi-View Alignment Network for Unsupervised Surface Registration | Wanquan Feng, Juyong Zhang, Hongrui Cai, Haofei Xu, Junhui Hou, Hujun Bao |
615 | DeepLM: Large-Scale Nonlinear Least Squares on Deep Learning Frameworks Using Stochastic Domain Decomposition | Jingwei Huang, Shan Huang, Mingwei Sun |
4667 | D-NeRF: Neural Radiance Fields for Dynamic Scenes | Albert Pumarola, Enric Corona, Gerard Pons-Moll, Francesc Moreno-Noguer |
4004 | AdaStereo: A Simple and Efficient Approach for Adaptive Stereo Matching | Xiao Song, Guorun Yang, Xinge Zhu, Hui Zhou, Zhe Wang, Jianping Shi |
6212 | Tangent Space Backpropagation for 3D Transformation Groups | Zachary Teed, Jia Deng |
8121 | Unsupervised Learning for Robust Fitting: A Reinforcement Learning Approach | Giang Truong, Huu Le, David Suter, Erchuan Zhang, Syed Zulqarnain Gilani |
4801 | Hybrid Rotation Averaging: A Fast and Robust Rotation Averaging Approach | Yu Chen, Ji Zhao, Laurent Kneip |
2241 | ST3D: Self-Training for Unsupervised Domain Adaptation on 3D Object Detection | Jihan Yang, Shaoshuai Shi, Zhe Wang, Hongsheng Li, Xiaojuan Qi |
7023 | MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation | Hansheng Chen, Yuyao Huang, Wei Tian, Zhong Gao, Lu Xiong |
2739 | Semi-Supervised Action Recognition With Temporal Contrastive Learning | Ankit Singh, Omprakash Chakraborty, Ashutosh Varshney, Rameswar Panda, Rogerio Feris, Kate Saenko, Abir Das |
4856 | TrafficSim: Learning To Simulate Realistic Multi-Agent Behaviors | Simon Suo, Sebastian Regalado, Sergio Casas, Raquel Urtasun |
1633 | QPIC: Query-Based Pairwise Human-Object Interaction Detection With Image-Wide Contextual Information | Masato Tamura, Hiroki Ohashi, Tomoaki Yoshinaga |
4368 | Class-Aware Robust Adversarial Training for Object Detection | Pin-Chun Chen, Bo-Han Kung, Jun-Cheng Chen |
11707 | SurFree: A Fast Surrogate-Free Black-Box Attack | Thibault Maho, Teddy Furon, Erwan Le Merrer |
2476 | Lite-HRNet: A Lightweight High-Resolution Network | Changqian Yu, Bin Xiao, Changxin Gao, Lu Yuan, Lei Zhang, Nong Sang, Jingdong Wang |
724 | Model-Based 3D Hand Reconstruction via Self-Supervised Learning | Yujin Chen, Zhigang Tu, Di Kang, Linchao Bao, Ying Zhang, Xuefei Zhe, Ruizhi Chen, Junsong Yuan |
4356 | LEAP: Learning Articulated Occupancy of People | Marko Mihajlovic, Yan Zhang, Michael J. Black, Siyu Tang |
4678 | Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction | Shanyan Guan, Jingwei Xu, Yunbo Wang, Bingbing Ni, Xiaokang Yang |
10889 | Exploiting Semantic Embedding and Visual Feature for Facial Action Unit Detection | Huiyuan Yang, Lijun Yin, Yi Zhou, Jiuxiang Gu |
701 | WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition | Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, Junjie Huang, Xinze Chen, Jiagang Zhu, Tian Yang, Jiwen Lu, Dalong Du, Jie Zhou |
3300 | PML: Progressive Margin Loss for Long-Tailed Age Classification | Zongyong Deng, Hao Liu, Yaoxing Wang, Chenyang Wang, Zekuan Yu, Xuehong Sun |
3046 | Fine-Grained Shape-Appearance Mutual Learning for Cloth-Changing Person Re-Identification | Peixian Hong, Tao Wu, Ancong Wu, Xintong Han, Wei-Shi Zheng |
6134 | Physics-Based Iterative Projection Complex Neural Network for Phase Retrieval in Lensless Microscopy Imaging | Feilong Zhang, Xianming Liu, Cheng Guo, Shiyi Lin, Junjun Jiang, Xiangyang Ji |
1492 | Asymmetric Gained Deep Image Compression With Continuous Rate Adaptation | Ze Cui, Jing Wang, Shangyin Gao, Tiansheng Guo, Yihui Feng, Bo Bai |
5141 | Learning a Non-Blind Deblurring Network for Night Blurry Images | Liang Chen, Jiawei Zhang, Jinshan Pan, Songnan Lin, Faming Fang, Jimmy S. Ren |
640 | Contrastive Learning for Compact Single Image Dehazing | Haiyan Wu, Yanyun Qu, Shaohui Lin, Jian Zhou, Ruizhi Qiao, Zhizhong Zhang, Yuan Xie, Lizhuang Ma |
3037 | Retinex-Inspired Unrolling With Cooperative Prior Architecture Search for Low-Light Image Enhancement | Risheng Liu, Long Ma, Jiaao Zhang, Xin Fan, Zhongxuan Luo |
1756 | Auto-Exposure Fusion for Single-Image Shadow Removal | Lan Fu, Changqing Zhou, Qing Guo, Felix Juefei-Xu, Hongkai Yu, Wei Feng, Yang Liu, Song Wang |
5692 | Unsupervised Degradation Representation Learning for Blind Super-Resolution | Longguang Wang, Yingqian Wang, Xiaoyu Dong, Qingyu Xu, Jungang Yang, Wei An, Yulan Guo |
2744 | Lighting, Reflectance and Geometry Estimation From 360° Panoramic Stereo | Junxuan Li, Hongdong Li, Yasuyuki Matsushita |
1253 | Flow-Based Kernel Prior With Application to Blind Super-Resolution | Jingyun Liang, Kai Zhang, Shuhang Gu, Luc Van Gool, Radu Timofte |
7907 | KOALAnet: Blind Super-Resolution Using Kernel-Oriented Adaptive Local Adjustment | Soo Ye Kim, Hyeonjun Sim, Munchurl Kim |
3482 | 3D Video Stabilization With Depth Estimation by CNN-Based Optimization | Yao-Chih Lee, Kuan-Wei Tseng, Yu-Ta Chen, Chien-Cheng Chen, Chu-Song Chen, Yi-Ping Hung |
4730 | iMiGUE: An Identity-Free Video Dataset for Micro-Gesture Understanding and Emotion Analysis | Xin Liu, Henglin Shi, Haoyu Chen, Zitong Yu, Xiaobai Li, Guoying Zhao |
2750 | ACRE: Abstract Causal REasoning Beyond Covariation | Chi Zhang, Baoxiong Jia, Mark Edmonds, Song-Chun Zhu, Yixin Zhu |
7829 | Enhance Curvature Information by Structured Stochastic Quasi-Newton Methods | Minghan Yang, Dong Xu, Hongyu Chen, Zaiwen Wen, Mengyun Chen |
4492 | Refine Myself by Teaching Myself: Feature Refinement via Self-Knowledge Distillation | Mingi Ji, Seungjae Shin, Seunghyun Hwang, Gibeom Park, Il-Chul Moon |
5838 | Towards Efficient Tensor Decomposition-Based DNN Model Compression With Optimization Framework | Miao Yin, Yang Sui, Siyu Liao, Bo Yuan |
8893 | QPP: Real-Time Quantization Parameter Prediction for Deep Neural Networks | Vladimir Kryzhanovskiy, Gleb Balitskiy, Nikolay Kozyrskiy, Aleksandr Zuruev |
4686 | Explaining Classifiers Using Adversarial Perturbations on the Perceptual Ball | Andrew Elliott, Stephen Law, Chris Russell |
7407 | Verifiability and Predictability: Interpreting Utilities of Network Architectures for Point Cloud Processing | Wen Shen, Zhihua Wei, Shikun Huang, Binbin Zhang, Panyue Chen, Ping Zhao, Quanshi Zhang |
8518 | Model-Contrastive Federated Learning | Qinbin Li, Bingsheng He, Dawn Song |
2333 | Compatibility-Aware Heterogeneous Visual Search | Rahul Duggal, Hao Zhou, Shuo Yang, Yuanjun Xiong, Wei Xia, Zhuowen Tu, Stefano Soatto |
5834 | Plan2Scene: Converting Floorplans to 3D Scenes | Madhawa Vidanapathirana, Qirui Wu, Yasutaka Furukawa, Angel X. Chang, Manolis Savva |
1268 | Few-Shot Image Generation via Cross-Domain Correspondence | Utkarsh Ojha, Yijun Li, Jingwan Lu, Alexei A. Efros, Yong Jae Lee, Eli Shechtman, Richard Zhang |
8135 | Adversarial Generation of Continuous Images | Ivan Skorokhodov, Savva Ignatyev, Mohamed Elhoseiny |
3556 | DeFLOCNet: Deep Image Editing via Flexible Low-Level Controls | Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao, Bin Jiang, Wei Liu |
7804 | Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE | Jialun Peng, Dong Liu, Songcen Xu, Houqiang Li |
4721 | Smoothing the Disentangled Latent Style Space for Unsupervised Image-to-Image Translation | Yahui Liu, Enver Sangineto, Yajing Chen, Linchao Bao, Haoxian Zhang, Nicu Sebe, Bruno Lepri, Wei Wang, Marco De Nadai |
532 | Flow Guided Transformable Bottleneck Networks for Motion Retargeting | Jian Ren, Menglei Chai, Oliver J. Woodford, Kyle Olszewski, Sergey Tulyakov |
8229 | Learning Semantic Person Image Generation by Region-Adaptive Normalization | Zhengyao Lv, Xiaoming Li, Xin Li, Fu Li, Tianwei Lin, Dongliang He, Wangmeng Zuo |
5754 | Large-Capacity Image Steganography Based on Invertible Neural Networks | Shao-Ping Lu, Rong Wang, Tao Zhong, Paul L. Rosin |
6861 | Self-Supervised Video GANs: Learning for Appearance Consistency and Motion Coherency | Sangeek Hyun, Jihwan Kim, Jae-Pil Heo |
10643 | Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning | Mamshad Nayeem Rizve, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah |
9959 | Clusformer: A Transformer Based Clustering Approach to Unsupervised Large-Scale Face and Visual Landmark Recognition | Xuan-Bac Nguyen, Duc Toan Bui, Chi Nhan Duong, Tien D. Bui, Khoa Luu |
1297 | CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning | Chen Wei, Kihyuk Sohn, Clayton Mellina, Alan Yuille, Fan Yang |
868 | Neighborhood Contrastive Learning for Novel Class Discovery | Zhun Zhong, Enrico Fini, Subhankar Roy, Zhiming Luo, Elisa Ricci, Nicu Sebe |
443 | Predicting Human Scanpaths in Visual Question Answering | Xianyu Chen, Ming Jiang, Qi Zhao |
7718 | Diverse Branch Block: Building a Convolution as an Inception-Like Unit | Xiaohan Ding, Xiangyu Zhang, Jungong Han, Guiguang Ding |
636 | HourNAS: Extremely Fast Neural Architecture Search Through an Hourglass Lens | Zhaohui Yang, Yunhe Wang, Xinghao Chen, Jianyuan Guo, Wei Zhang, Chao Xu, Chunjing Xu, Dacheng Tao, Chang Xu |
11940 | Neural Architecture Search With Random Labels | Xuanyang Zhang, Pengfei Hou, Xiangyu Zhang, Jian Sun |
3675 | Metadata Normalization | Mandy Lu, Qingyu Zhao, Jiequan Zhang, Kilian M. Pohl, Li Fei-Fei, Juan Carlos Niebles, Ehsan Adeli |
11821 | Learning-Based Image Registration With Meta-Regularization | Ebrahim Al Safadi, Xubo Song |
7029 | Lesion-Aware Transformers for Diabetic Retinopathy Grading | Rui Sun, Yihao Li, Tianzhu Zhang, Zhendong Mao, Feng Wu, Yongdong Zhang |
7038 | CapsuleRRT: Relationships-Aware Regression Tracking via Capsules | Ding Ma, Xiangqian Wu |
8685 | Improving Multiple Pedestrian Tracking by Track Management and Occlusion Handling | Daniel Stadler, Jürgen Beyerer |
10845 | Prioritized Architecture Sampling With Monto-Carlo Tree Search | Xiu Su, Tao Huang, Yanxi Li, Shan You, Fei Wang, Chen Qian, Changshui Zhang, Chang Xu |
7797 | Stochastic Whitening Batch Normalization | Shengdong Zhang, Ehsan Nezhadarya, Homa Fashandi, Jiayi Liu, Darin Graham, Mohak Shah |
8641 | Revisiting Superpixels for Active Learning in Semantic Segmentation With Realistic Annotation Costs | Lile Cai, Xun Xu, Jun Hao Liew, Chuan Sheng Foo |
564 | Dynamic Transfer for Multi-Source Domain Adaptation | Yunsheng Li, Lu Yuan, Yinpeng Chen, Pei Wang, Nuno Vasconcelos |
3292 | Multi-Source Domain Adaptation With Collaborative Learning for Semantic Segmentation | Jianzhong He, Xu Jia, Shuaijun Chen, Jianzhuang Liu |
7481 | Semi-Supervised Domain Adaptation Based on Dual-Level Domain Mixing for Semantic Segmentation | Shuaijun Chen, Xu Jia, Jianzhong He, Yongjie Shi, Jianzhuang Liu |
7452 | What Can Style Transfer and Paintings Do for Model Robustness? | Hubert Lin, Mitchell van Zuijlen, Sylvia C. Pont, Maarten W.A. Wijntjes, Kavita Bala |
6297 | IIRC: Incremental Implicitly-Refined Classification | Mohamed Abdelsalam, Mojtaba Faramarzi, Shagun Sodhani, Sarath Chandar |
5718 | LayoutGMN: Neural Graph Matching for Structural Layout Similarity | Akshay Gadi Patil, Manyi Li, Matthew Fisher, Manolis Savva, Hao Zhang |
399 | CASTing Your Model: Learning To Localize Improves Self-Supervised Representations | Ramprasaath R. Selvaraju, Karan Desai, Justin Johnson, Nikhil Naik |
9917 | Representation Learning via Global Temporal Alignment and Cycle-Consistency | Isma Hadji, Konstantinos G. Derpanis, Allan D. Jepson |
1188 | Sparse Auxiliary Networks for Unified Monocular Depth Prediction and Completion | Vitor Guizilini, Rareș Ambruș, Wolfram Burgard, Adrien Gaidon |
10204 | Single Image Depth Prediction With Wavelet Decomposition | Michaël Ramamonjisoa, Michael Firman, Jamie Watson, Vincent Lepetit, Daniyar Turmukhambetov |
3844 | FAPIS: A Few-Shot Anchor-Free Part-Based Instance Segmenter | Khoi Nguyen, Sinisa Todorovic |
2839 | Bipartite Graph Network With Adaptive Message Passing for Unbiased Scene Graph Generation | Rongjie Li, Songyang Zhang, Bo Wan, Xuming He |
1868 | Semantic Image Matting | Yanan Sun, Chi-Keung Tang, Yu-Wing Tai |
4384 | Three Ways To Improve Semantic Segmentation With Self-Supervised Depth Estimation | Lukas Hoyer, Dengxin Dai, Yuhua Chen, Adrian Köring, Suman Saha, Luc Van Gool |
6055 | Robust Instance Segmentation Through Reasoning About Multi-Object Occlusion | Xiaoding Yuan, Adam Kortylewski, Yihong Sun, Alan Yuille |
5566 | Learning Graphs for Knowledge Transfer With Limited Labels | Pallabi Ghosh, Nirat Saini, Larry S. Davis, Abhinav Shrivastava |
410 | VirTex: Learning Visual Representations From Textual Annotations | Karan Desai, Justin Johnson |
4138 | COMPLETER: Incomplete Multi-View Clustering via Contrastive Prediction | Yijie Lin, Yuanbiao Gou, Zitao Liu, Boyun Li, Jiancheng Lv, Xi Peng |
10838 | Home Action Genome: Cooperative Compositional Action Understanding | Nishant Rai, Haofeng Chen, Jingwei Ji, Rishi Desai, Kazuki Kozuka, Shun Ishizaka, Ehsan Adeli, Juan Carlos Niebles |
2328 | Learning Goals From Failure | Dave Epstein, Carl Vondrick |
5184 | VideoMoCo: Contrastive Video Representation Learning With Temporally Adversarial Examples | Tian Pan, Yibing Song, Tianyu Yang, Wenhao Jiang, Wei Liu |
2487 | Spatial Feature Calibration and Temporal Fusion for Effective One-Stage Video Instance Segmentation | Minghan Li, Shuai Li, Lida Li, Lei Zhang |
8337 | Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation | Saquib Sarfraz, Naila Murray, Vivek Sharma, Ali Diba, Luc Van Gool, Rainer Stiefelhagen |
619 | Context-Aware Biaffine Localizing Network for Temporal Sentence Grounding | Daizong Liu, Xiaoye Qu, Jianfeng Dong, Pan Zhou, Yu Cheng, Wei Wei, Zichuan Xu, Yulai Xie |
7889 | Towards Diverse Paragraph Captioning for Untrimmed Videos | Yuqing Song, Shizhe Chen, Qin Jin |
7231 | Intelligent Carpet: Inferring 3D Human Pose From Tactile Signals | Yiyue Luo, Yunzhu Li, Michael Foshey, Wan Shou, Pratyusha Sharma, Tomás Palacios, Antonio Torralba, Wojciech Matusik |
3484 | Bottom-Up Shift and Reasoning for Referring Image Segmentation | Sibei Yang, Meng Xia, Guanbin Li, Hong-Yu Zhou, Yizhou Yu |
3645 | Topological Planning With Transformers for Vision-and-Language Navigation | Kevin Chen, Junshen K. Chen, Jo Chuang, Marynel Vázquez, Silvio Savarese |
2764 | AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning | Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala |
4327 | HDR Environment Map Estimation for Real-Time Augmented Reality | Gowri Somanath, Daniel Kurz |
5741 | Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback | Hui Wu, Yupeng Gao, Xiaoxiao Guo, Ziad Al-Halah, Steven Rennie, Kristen Grauman, Rogerio Feris |
5680 | TPCN: Temporal Point Cloud Networks for Motion Forecasting | Maosheng Ye, Tongyi Cao, Qifeng Chen |
230 | Generalizable Pedestrian Detection: The Elephant in the Room | Irtiza Hasan, Shengcai Liao, Jinpeng Li, Saad Ullah Akram, Ling Shao |
Date: Thursday, June 24, 2021 6:00– 8:30
Paper Session Nine:
Paper ID | Paper Title | Authors |
4256 | Indoor Panorama Planar 3D Reconstruction via Divide and Conquer | Cheng Sun, Chi-Wei Hsiao, Ning-Hsu Wang, Min Sun, Hwann-Tzong Chen |
3271 | SOE-Net: A Self-Attention and Orientation Encoding Network for Point Cloud Based Place Recognition | Yan Xia, Yusheng Xu, Shuang Li, Rui Wang, Juan Du, Daniel Cremers, Uwe Stilla |
9941 | Neural Geometric Level of Detail: Real-Time Rendering With Implicit 3D Shapes | Towaki Takikawa, Joey Litalien, Kangxue Yin, Karsten Kreis, Charles Loop, Derek Nowrouzezahrai, Alec Jacobson, Morgan McGuire, Sanja Fidler |
4862 | SOLD2: Self-Supervised Occlusion-Aware Line Description and Detection | Rémi Pautrat, Juan-Ting Lin, Viktor Larsson, Martin R. Oswald, Marc Pollefeys |
1000 | PGT: A Progressive Method for Training Models on Long Videos | Bo Pang, Gao Peng, Yizhuo Li, Cewu Lu |
3232 | Dual Attention Guided Gaze Target Detection in the Wild | Yi Fang, Jiapeng Tang, Wang Shen, Wei Shen, Xiao Gu, Li Song, Guangtao Zhai |
2221 | ChallenCap: Monocular 3D Capture of Challenging Human Performances Using Multi-Modal References | Yannan He, Anqi Pang, Xin Chen, Han Liang, Minye Wu, Yuexin Ma, Lan Xu |
7777 | Blocks-World Cameras | Jongho Lee, Mohit Gupta |
1635 | Real-Time Sphere Sweeping Stereo From Multiview Fisheye Images | Andreas Meuleman, Hyeonjoong Jang, Daniel S. Jeon, Min H. Kim |
3454 | Optimal Gradient Checkpoint Search for Arbitrary Computation Graphs | Jianwei Feng, Dong Huang |
10440 | Black-Box Explanation of Object Detectors via Saliency Maps | Vitali Petsiuk, Rajiv Jain, Varun Manjunatha, Vlad I. Morariu, Ashutosh Mehra, Vicente Ordonez, Kate Saenko |
3367 | GIRAFFE: Representing Scenes As Compositional Generative Neural Feature Fields | Michael Niemeyer, Andreas Geiger |
3592 | CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation | Xingran Zhou, Bo Zhang, Ting Zhang, Pan Zhang, Jianmin Bao, Dong Chen, Zhongfei Zhang, Fang Wen |
1802 | Your “Flamingo” is My “Bird”: Fine-Grained, or Not | Dongliang Chang, Kaiyue Pang, Yixiao Zheng, Zhanyu Ma, Yi-Zhe Song, Jun Guo |
4146 | Inception Convolution With Efficient Dilation Search | Jie Liu, Chuming Li, Feng Liang, Chen Lin, Ming Sun, Junjie Yan, Wanli Ouyang, Dong Xu |
10409 | Geo-FARM: Geodesic Factor Regression Model for Misaligned Pre-Shape Responses in Statistical Shape Analysis | Chao Huang, Anuj Srivastava, Rongjie Liu |
2823 | UnrealPerson: An Adaptive Pipeline Towards Costless Person Re-Identification | Tianyu Zhang, Lingxi Xie, Longhui Wei, Zijie Zhuang, Yongfei Zhang, Bo Li, Qi Tian |
11469 | Transferable Semantic Augmentation for Domain Adaptation | Shuang Li, Mixue Xie, Kaixiong Gong, Chi Harold Liu, Yulin Wang, Wei Li |
1391 | Jigsaw Clustering for Unsupervised Visual Representation Learning | Pengguang Chen, Shu Liu, Jiaya Jia |
1165 | SliceNet: Deep Dense Depth Estimation From a Single Indoor Panorama Using a Slice-Based Representation | Giovanni Pintore, Marco Agus, Eva Almansa, Jens Schneider, Enrico Gobbetti |
5623 | Fully Convolutional Scene Graph Generation | Hengyue Liu, Ning Yan, Masood Mortazavi, Bir Bhanu |
2061 | Meta Pseudo Labels | Hieu Pham, Zihang Dai, Qizhe Xie, Quoc V. Le |
3468 | ArtEmis: Affective Language for Visual Art | Panos Achlioptas, Maks Ovsjanikov, Kilichbek Haydarov, Mohamed Elhoseiny, Leonidas J. Guibas |
872 | RobustNet: Improving Domain Generalization in Urban-Scene Segmentation via Instance Selective Whitening | Sungha Choi, Sanghun Jung, Huiwon Yun, Joanne T. Kim, Seungryong Kim, Jaegul Choo |
765 | Simultaneously Localize, Segment and Rank the Camouflaged Objects | Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, Deng-Ping Fan |
7467 | Interpolation-Based Semi-Supervised Learning for Object Detection | Jisoo Jeong, Vikas Verma, Minsung Hyun, Juho Kannala, Nojun Kwak |
5510 | There Is More Than Meets the Eye: Self-Supervised Multi-Object Detection and Tracking With Sound by Distilling Multimodal Knowledge | Francisco Rivera Valverde, Juana Valeria Hurtado, Abhinav Valada |
11502 | Variational Pedestrian Detection | Yuang Zhang, Huanyu He, Jianguo Li, Yuxi Li, John See, Weiyao Lin |
1513 | Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection | Xiang Li, Wenhai Wang, Xiaolin Hu, Jun Li, Jinhui Tang, Jian Yang |
7272 | Unveiling the Potential of Structure Preserving for Weakly Supervised Object Localization | Xingjia Pan, Yingguo Gao, Zhiwen Lin, Fan Tang, Weiming Dong, Haolei Yuan, Feiyue Huang, Changsheng Xu |
2774 | Deep Active Surface Models | Udaranga Wickramasinghe, Pascal Fua, Graham Knott |
6565 | Normalized Avatar Synthesis Using StyleGAN and Perceptual Refinement | Huiwen Luo, Koki Nagano, Han-Wei Kung, Qingguo Xu, Zejian Wang, Lingyu Wei, Liwen Hu, Hao Li |
280 | Omni-Supervised Point Cloud Segmentation via Gradual Receptive Field Component Reasoning | Jingyu Gong, Jiachen Xu, Xin Tan, Haichuan Song, Yanyun Qu, Yuan Xie, Lizhuang Ma |
2972 | PU-GCN: Point Cloud Upsampling Using Graph Convolutional Networks | Guocheng Qian, Abdulellah Abualshour, Guohao Li, Ali Thabet, Bernard Ghanem |
5820 | CGA-Net: Category Guided Aggregation for Point Cloud Semantic Segmentation | Tao Lu, Limin Wang, Gangshan Wu |
738 | UV-Net: Learning From Boundary Representations | Pradeep Kumar Jayaraman, Aditya Sanghi, Joseph G. Lambourne, Karl D.D. Willis, Thomas Davies, Hooman Shayani, Nigel Morris |
1247 | Joint Learning of 3D Shape Retrieval and Deformation | Mikaela Angelina Uy, Vladimir G. Kim, Minhyuk Sung, Noam Aigerman, Siddhartha Chaudhuri, Leonidas J. Guibas |
3360 | Square Root Bundle Adjustment for Large-Scale Reconstruction | Nikolaus Demmel, Christiane Sommer, Daniel Cremers, Vladyslav Usenko |
5522 | Pixel-Aligned Volumetric Avatars | Amit Raj, Michael Zollhöfer, Tomas Simon, Jason Saragih, Shunsuke Saito, James Hays, Stephen Lombardi |
4105 | Learning To Identify Correct 2D-2D Line Correspondences on Sphere | Haoang Li, Kai Chen, Ji Zhao, Jiangliu Wang, Pyojin Kim, Zhe Liu, Yun-Hui Liu |
2104 | SpinNet: Learning a General Surface Descriptor for 3D Point Cloud Registration | Sheng Ao, Qingyong Hu, Bo Yang, Andrew Markham, Yulan Guo |
5511 | Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On | Igor Santesteban, Nils Thuerey, Miguel A. Otaduy, Dan Casas |
7345 | End-to-End Rotation Averaging With Multi-Source Propagation | Luwei Yang, Heng Li, Jamal Ahmed Rahim, Zhaopeng Cui, Ping Tan |
3493 | Center-Based 3D Object Detection and Tracking | Tianwei Yin, Xingyi Zhou, Philipp Krähenbühl |
6010 | PointAugmenting: Cross-Modal Augmentation for 3D Object Detection | Chunwei Wang, Chao Ma, Ming Zhu, Xiaokang Yang |
4008 | Removing the Background by Adding the Background: Towards Background Robust Self-Supervised Video Representation Learning | Jinpeng Wang, Yuting Gao, Ke Li, Yiqi Lin, Andy J. Ma, Hao Cheng, Pai Peng, Feiyue Huang, Rongrong Ji, Xing Sun |
5981 | Trajectory Prediction With Latent Belief Energy-Based Model | Bo Pang, Tianyang Zhao, Xu Xie, Ying Nian Wu |
2505 | End-to-End Human Object Interaction Detection With HOI Transformer | Cheng Zou, Bohan Wang, Yue Hu, Junqi Liu, Qian Wu, Yu Zhao, Boxun Li, Chenguang Zhang, Chi Zhang, Yichen Wei, Jian Sun |
4438 | Simulating Unknown Target Models for Query-Efficient Black-Box Attacks | Chen Ma, Li Chen, Jun-Hai Yong |
3644 | Improving Transferability of Adversarial Patches on Face Recognition With Generative Models | Zihao Xiao, Xianfeng Gao, Chilin Fu, Yinpeng Dong, Wei Gao, Xiaolu Zhang, Jun Zhou, Jun Zhu |
3353 | When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks | Jiahang Wang, Sheng Jin, Wentao Liu, Weizhong Liu, Chen Qian, Ping Luo |
1281 | Body2Hands: Learning To Infer 3D Hands From Conversational Gesture Body Dynamics | Evonne Ng, Shiry Ginosar, Trevor Darrell, Hanbyul Joo |
4685 | SMPLicit: Topology-Aware Generative Model for Clothed People | Enric Corona, Albert Pumarola, Guillem Alenyà, Gerard Pons-Moll, Francesc Moreno-Noguer |
5051 | Multi-View Multi-Person 3D Pose Estimation With Plane Sweep Stereo | Jiahao Lin, Gim Hee Lee |
2234 | Progressive Semantic-Aware Style Transformation for Blind Face Restoration | Chaofeng Chen, Xiaoming Li, Lingbo Yang, Xianhui Lin, Lei Zhang, Kwan-Yee K. Wong |
1651 | Variational Prototype Learning for Deep Face Recognition | Jiankang Deng, Jia Guo, Jing Yang, Alexandros Lattas, Stefanos Zafeiriou |
3649 | Learning Spatial-Semantic Relationship for Facial Attribute Recognition With Limited Labeled Data | Ying Shu, Yan Yan, Si Chen, Jing-Hao Xue, Chunhua Shen, Hanzi Wang |
3279 | Intra-Inter Camera Similarity for Unsupervised Person Re-Identification | Shiyu Xuan, Shiliang Zhang |
8428 | Digital Gimbal: End-to-End Deep Image Stabilization With Learnable Exposure Times | Omer Dahary, Matan Jacoby, Alex M. Bronstein |
2803 | Learning Scalable l¥-Constrained Near-Lossless Image Compression via Joint Lossy Image and Residual Compression | Yuanchao Bai, Xianming Liu, Wangmeng Zuo, Yaowei Wang, Xiangyang Ji |
5198 | Explore Image Deblurring via Encoded Blur Kernel Space | Phong Tran, Anh Tuan Tran, Quynh Phung, Minh Hoai |
788 | Self-Aligned Video Deraining With Transmission-Depth Consistency | Wending Yan, Robby T. Tan, Wenhan Yang, Dengxin Dai |
3474 | Nighttime Visibility Enhancement by Increasing the Dynamic Range and Suppression of Light Effects | Aashish Sharma, Robby T. Tan |
2151 | High-Quality Stereo Image Restoration From Double Refraction | Hakyeong Kim, Andreas Meuleman, Daniel S. Jeon, Min H. Kim |
5857 | Spk2ImgNet: Learning To Reconstruct Dynamic Scene From Continuous Spike Stream | Jing Zhao, Ruiqin Xiong, Hangfan Liu, Jian Zhang, Tiejun Huang |
5882 | Learning Tensor Low-Rank Prior for Hyperspectral Image Reconstruction | Shipeng Zhang, Lizhi Wang, Lei Zhang, Hua Huang |
1760 | ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic | Xiangtao Kong, Hengyuan Zhao, Yu Qiao, Chao Dong |
8483 | Scene Text Telescope: Text-Focused Scene Image Super-Resolution | Jingye Chen, Bin Li, Xiangyang Xue |
5698 | Real-Time Selfie Video Stabilization | Jiyang Yu, Ravi Ramamoorthi, Keli Cheng, Michel Sarkis, Ning Bi |
5430 | Rethinking Text Segmentation: A Novel Dataset and a Text-Specific Refinement Approach | Xingqian Xu, Zhifei Zhang, Zhaowen Wang, Brian Price, Zhonghao Wang, Humphrey Shi |
3951 | PQA: Perceptual Question Answering | Yonggang Qi, Kai Zhang, Aneeshan Sain, Yi-Zhe Song |
8341 | Communication Efficient SGD via Gradient Sampling With Bayes Prior | Liuyihan Song, Kang Zhao, Pan Pan, Yu Liu, Yingya Zhang, Yinghui Xu, Rong Jin |
4920 | Student-Teacher Learning From Clean Inputs to Noisy Inputs | Guanzhe Hong, Zhiyuan Mao, Xiaojun Lin, Stanley H. Chan |
6370 | Towards Extremely Compact RNNs for Video Recognition With Fully Decomposed Hierarchical Tucker Structure | Miao Yin, Siyu Liao, Xiao-Yang Liu, Xiaodong Wang, Bo Yuan |
11155 | Optimal Quantization Using Scaled Codebook | Yerlan Idelbayev, Pavlo Molchanov, Maying Shen, Hongxu Yin, Miguel Á. Carreira-Perpiñán, Jose M. Alvarez |
5163 | Causal Hidden Markov Model for Time Series Disease Forecasting | Jing Li, Botong Wu, Xinwei Sun, Yizhou Wang |
8035 | Fair Feature Distillation for Visual Recognition | Sangwon Jung, Donggyu Lee, Taeeon Park, Taesup Moon |
10280 | DISCO: Dynamic and Invariant Sensitive Channel Obfuscation for Deep Neural Networks | Abhishek Singh, Ayush Chopra, Ethan Garza, Emily Zhang, Praneeth Vepakomma, Vivek Sharma, Ramesh Raskar |
2849 | Person Re-Identification Using Heterogeneous Local Graph Attention Networks | Zhong Zhang, Haijia Zhang, Shuang Liu |
7258 | Hierarchical Video Prediction Using Relational Layouts for Human-Object Interactions | Navaneeth Bodla, Gaurav Shrivastava, Rama Chellappa, Abhinav Shrivastava |
1346 | Content-Aware GAN Compression | Yuchen Liu, Zhixin Shu, Yijun Li, Zhe Lin, Federico Perazzi, Sun-Yuan Kung |
8339 | Efficient Conditional GAN Transfer With Knowledge Propagation Across Classes | Mohamad Shahbazi, Zhiwu Huang, Danda Pani Paudel, Ajad Chhatkuli, Luc Van Gool |
4467 | Discovering Interpretable Latent Space Directions of GANs Beyond Binary Attributes | Huiting Yang, Liangyu Chai, Qiang Wen, Shuang Zhao, Zixun Sun, Shengfeng He |
3750 | Leveraging Line-Point Consistence To Preserve Structures for Wide Parallax Image Stitching | Qi Jia, ZhengJun Li, Xin Fan, Haotian Zhao, Shiyu Teng, Xinchen Ye, Longin Jan Latecki |
4836 | Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes | Dmytro Kotovenko, Matthias Wright, Arthur Heimbrecht, Björn Ommer |
1070 | Scene-Aware Generative Network for Human Motion Synthesis | Jingbo Wang, Sijie Yan, Bo Dai, Dahua Lin |
875 | Stable View Synthesis | Gernot Riegler, Vladlen Koltun |
10334 | Understanding and Simplifying Perceptual Distances | Dan Amir, Yair Weiss |
7155 | Behavior-Driven Synthesis of Human Dynamics | Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Björn Ommer |
10843 | Adaptive Image Transformer for One-Shot Object Detection | Ding-Jie Chen, He-Yen Hsieh, Tyng-Luh Liu |
10455 | Quality-Agnostic Image Recognition via Invertible Decoder | Insoo Kim, Seungju Han, Ji-won Baek, Seong-Jin Park, Jae-Joon Han, Jinwoo Shin |
6614 | Self-Supervised Wasserstein Pseudo-Labeling for Semi-Supervised Image Classification | Fariborz Taherkhani, Ali Dabouei, Sobhan Soleymani, Jeremy Dawson, Nasser M. Nasrabadi |
3724 | Improving Unsupervised Image Clustering With Robust Learning | Sungwon Park, Sungwon Han, Sundong Kim, Danu Kim, Sungkyu Park, Seunghoon Hong, Meeyoung Cha |
1094 | Group Collaborative Learning for Co-Salient Object Detection | Qi Fan, Deng-Ping Fan, Huazhu Fu, Chi-Keung Tang, Ling Shao, Yu-Wing Tai |
2435 | Pre-Trained Image Processing Transformer | Hanting Chen, Yunhe Wang, Tianyu Guo, Chang Xu, Yiping Deng, Zhenhua Liu, Siwei Ma, Chunjing Xu, Chao Xu, Wen Gao |
805 | DOTS: Decoupling Operation and Topology in Differentiable Architecture Search | Yu-Chao Gu, Li-Juan Wang, Yun Liu, Yi Yang, Yu-Huan Wu, Shao-Ping Lu, Ming-Ming Cheng |
2909 | Involution: Inverting the Inherence of Convolution for Visual Recognition | Duo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong Zhang, Qifeng Chen |
5829 | Cross-Iteration Batch Normalization | Zhuliang Yao, Yue Cao, Shuxin Zheng, Gao Huang, Stephen Lin |
415 | Learning Calibrated Medical Image Segmentation via Multi-Rater Agreement Modeling | Wei Ji, Shuang Yu, Junde Wu, Kai Ma, Cheng Bian, Qi Bi, Jingjing Li, Hanruo Liu, Li Cheng, Yefeng Zheng |
511 | Track To Detect and Segment: An Online Multi-Object Tracker | Jialian Wu, Jiale Cao, Liangchen Song, Yu Wang, Ming Yang, Junsong Yuan |
7330 | Rotation Equivariant Siamese Networks for Tracking | Deepak K. Gupta, Devanshu Arya, Efstratios Gavves |
9904 | SiamMOT: Siamese Multi-Object Tracking | Bing Shuai, Andrew Berneshawi, Xinyu Li, Davide Modolo, Joseph Tighe |
3384 | On Feature Normalization and Data Augmentation | Boyi Li, Felix Wu, Ser-Nam Lim, Serge Belongie, Kilian Q. Weinberger |
2278 | Learning a Self-Expressive Network for Subspace Clustering | Shangzhi Zhang, Chong You, René Vidal, Chun-Guang Li |
3960 | Dual-GAN: Joint BVP and Noise Modeling for Remote Physiological Measurement | Hao Lu, Hu Han, S. Kevin Zhou |
717 | Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation | Pan Zhang, Bo Zhang, Ting Zhang, Dong Chen, Yong Wang, Fang Wen |
3337 | RPN Prototype Alignment for Domain Adaptive Object Detector | Yixin Zhang, Zilei Wang, Yushi Mao |
10370 | PixMatch: Unsupervised Domain Adaptation via Pixelwise Consistency Training | Luke Melas-Kyriazi, Arjun K. Manrai |
10692 | Adversarial Invariant Learning | Nanyang Ye, Jingxuan Tang, Huayu Deng, Xiao-Yun Zhou, Qianxiao Li, Zhenguo Li, Guang-Zhong Yang, Zhanxing Zhu |
6619 | Few-Shot Incremental Learning With Continually Evolved Classifiers | Chi Zhang, Nan Song, Guosheng Lin, Yun Zheng, Pan Pan, Yinghui Xu |
7960 | Unsupervised Hyperbolic Metric Learning | Jiexi Yan, Lei Luo, Cheng Deng, Heng Huang |
1704 | Audio-Visual Instance Discrimination with Cross-Modal Agreement | Pedro Morgado, Nuno Vasconcelos, Ishan Misra |
10375 | CoCoNets: Continuous Contrastive 3D Scene Representations | Shamit Lal, Mihir Prabhudesai, Ishita Mediratta, Adam W. Harley, Katerina Fragkiadaki |
1461 | Bilateral Grid Learning for Stereo Matching Networks | Bin Xu, Yuhua Xu, Xiaoli Yang, Wei Jia, Yulan Guo |
10612 | Radar-Camera Pixel Depth Association for Depth Completion | Yunfei Long, Daniel Morris, Xiaoming Liu, Marcos Castro, Punarjay Chakravarty, Praveen Narayanan |
4372 | Panoptic Segmentation Forecasting | Colin Graber, Grace Tsai, Michael Firman, Gabriel Brostow, Alexander G. Schwing |
3630 | Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation | Gengcong Yang, Jingyi Zhang, Yong Zhang, Baoyuan Wu, Yujiu Yang |
1976 | Learning Statistical Texture for Semantic Segmentation | Lanyun Zhu, Deyi Ji, Shiping Zhu, Weihao Gan, Wei Wu, Junjie Yan |
4498 | (AF)2-S3Net: Attentive Feature Fusion With Adaptive Feature Selection for Sparse Semantic Segmentation Network | Ran Cheng, Ryan Razani, Ehsan Taghavi, Enxu Li, Bingbing Liu |
6104 | Scale-Localized Abstract Reasoning | Yaniv Benny, Niv Pekar, Lior Wolf |
5901 | Few-Shot Open-Set Recognition by Transformation Consistency | Minki Jeong, Seokeon Choi, Changick Kim |
2548 | I3Net: Implicit Instance-Invariant Network for Adapting One-Stage Object Detectors | Chaoqi Chen, Zebiao Zheng, Yue Huang, Xinghao Ding, Yizhou Yu |
4591 | Unsupervised Feature Learning by Cross-Level Instance-Group Discrimination | Xudong Wang, Ziwei Liu, Stella X. Yu |
207 | Multi-Shot Temporal Event Localization: A Benchmark | Xiaolong Liu, Yao Hu, Song Bai, Fei Ding, Xiang Bai, Philip H. S. Torr |
3921 | Learning the Predictability of the Future | Dídac Surís, Ruoshi Liu, Carl Vondrick |
8466 | SSAN: Separable Self-Attention Network for Video Representation Learning | Xudong Guo, Xun Guo, Yan Lu |
2811 | Action Shuffle Alternating Learning for Unsupervised Action Segmentation | Jun Li, Sinisa Todorovic |
1215 | Towards Accurate Text-Based Image Captioning With Content Diversity Exploration | Guanghui Xu, Shuaicheng Niu, Mingkui Tan, Yucheng Luo, Qing Du, Qi Wu |
1104 | Kaleido-BERT: Vision-Language Pre-Training on Fashion Domain | Mingchen Zhuge, Dehong Gao, Deng-Ping Fan, Linbo Jin, Ben Chen, Haoming Zhou, Minghui Qiu, Ling Shao |
11014 | Transitional Adaptation of Pretrained Models for Visual Storytelling | Youngjae Yu, Jiwan Chung, Heeseung Yun, Jongseok Kim, Gunhee Kim |
8091 | Multi-Stage Aggregated Transformer Network for Temporal Language Localization in Videos | Mingxing Zhang, Yang Yang, Xinghan Chen, Yanli Ji, Xing Xu, Jingjing Li, Heng Tao Shen |
3613 | Connecting What To Say With Where To Look by Modeling Human Attention Traces | Zihang Meng, Licheng Yu, Ning Zhang, Tamara L. Berg, Babak Damavandi, Vikas Singh, Amy Bearman |
6452 | SOON: Scenario Oriented Object Navigation With Graph-Based Exploration | Fengda Zhu, Xiwen Liang, Yi Zhu, Qizhi Yu, Xiaojun Chang, Xiaodan Liang |
3123 | Counterfactual VQA: A Cause-Effect Look at Language Bias | Yulei Niu, Kaihua Tang, Hanwang Zhang, Zhiwu Lu, Xian-Sheng Hua, Ji-Rong Wen |
778 | Learning by Watching | Jimuyang Zhang, Eshed Ohn-Bar |
5920 | Personalized Outfit Recommendation With Learnable Anchors | Zhi Lu, Yang Hu, Yan Chen, Bing Zeng |
7318 | Safe Local Motion Planning With Self-Supervised Freespace Forecasting | Peiyun Hu, Aaron Huang, John Dolan, David Held, Deva Ramanan |
6805 | Anomaly Detection in Video via Self-Supervised and Multi-Task Learning | Mariana-Iuliana Georgescu, Antonio Bărbălău, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, Mubarak Shah |
Date: Thursday, June 24, 2021 11:00 – 13:30
Paper Session Ten:
Paper ID | Paper Title | Authors |
6333 | Learning High Fidelity Depths of Dressed Humans by Watching Social Media Dance Videos | Yasamin Jafarian, Hyun Soo Park |
4375 | PointNetLK Revisited | Xueqian Li, Jhony Kaesemodel Pontes, Simon Lucey |
4318 | BRepNet: A Topological Message Passing System for Solid Models | Joseph G. Lambourne, Karl D.D. Willis, Pradeep Kumar Jayaraman, Aditya Sanghi, Peter Meltzer, Hooman Shayani |
7395 | KeypointDeformer: Unsupervised 3D Keypoint Discovery for Shape Control | Tomas Jakab, Richard Tucker, Ameesh Makadia, Jiajun Wu, Noah Snavely, Angjoo Kanazawa |
2408 | Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization | Long Zhao, Yuxiao Wang, Jiaping Zhao, Liangzhe Yuan, Jennifer J. Sun, Florian Schroff, Hartwig Adam, Xi Peng, Dimitris Metaxas, Ting Liu |
1927 | i3DMM: Deep Implicit 3D Morphable Model of Human Heads | Tarun Yenamandra, Ayush Tewari, Florian Bernard, Hans-Peter Seidel, Mohamed Elgharib, Daniel Cremers, Christian Theobalt |
3521 | Reconstructing 3D Human Pose by Watching Humans in the Mirror | Qi Fang, Qing Shuai, Junting Dong, Hujun Bao, Xiaowei Zhou |
1539 | EventZoom: Learning To Denoise and Super Resolve Neuromorphic Events | Peiqi Duan, Zihao W. Wang, Xinyu Zhou, Yi Ma, Boxin Shi |
3102 | Spatially-Varying Outdoor Lighting Estimation From Intrinsics | Yongjie Zhu, Yinda Zhang, Si Li, Boxin Shi |
5385 | Knowledge Evolution in Neural Networks | Ahmed Taha, Abhinav Shrivastava, Larry S. Davis |
10484 | Understanding Failures of Deep Networks via Robust Feature Extraction | Sahil Singla, Besmira Nushi, Shital Shah, Ece Kamar, Eric Horvitz |
5532 | StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation | Zongze Wu, Dani Lischinski, Eli Shechtman |
4270 | Taming Transformers for High-Resolution Image Synthesis | Patrick Esser, Robin Rombach, Björn Ommer |
7246 | Benchmarking Representation Learning for Natural World Image Collections | Grant Van Horn, Elijah Cole, Sara Beery, Kimberly Wilber, Serge Belongie, Oisin Mac Aodha |
10649 | Scaling Local Self-Attention for Parameter Efficient Visual Backbones | Ashish Vaswani, Prajit Ramachandran, Aravind Srinivas, Niki Parmar, Blake Hechtman, Jonathon Shlens |
10078 | IMODAL: Creating Learnable User-Defined Deformation Models | Leander Lacroix, Benjamin Charlier, Alain Trouvé, Barbara Gris |
3008 | Unsupervised Multi-Source Domain Adaptation for Person Re-Identification | Zechen Bai, Zhigang Wang, Jian Wang, Di Hu, Errui Ding |
5556 | Generalization on Unseen Domains via Inference-Time Label-Preserving Target Projections | Prashant Pandey, Mrigank Raman, Sumanth Varambally, Prathosh AP |
2616 | Robust Audio-Visual Instance Discrimination | Pedro Morgado, Ishan Misra, Nuno Vasconcelos |
1269 | Binary TTC: A Temporal Geofence for Autonomous Navigation | Abhishek Badki, Orazio Gallo, Jan Kautz, Pradeep Sen |
800 | LED2-Net: Monocular 360° Layout Estimation via Differentiable Depth Rendering | Fu-En Wang, Yu-Hsuan Yeh, Min Sun, Wei-Chen Chiu, Yi-Hsuan Tsai |
2062 | A Realistic Evaluation of Semi-Supervised Learning for Fine-Grained Classification | Jong-Chyi Su, Zezhou Cheng, Subhransu Maji |
2425 | Seeing Out of the Box: End-to-End Pre-Training for Vision-Language Representation Learning | Zhicheng Huang, Zhaoyang Zeng, Yupan Huang, Bei Liu, Dongmei Fu, Jianlong Fu |
2757 | Intentonomy: A Dataset and Study Towards Human Intent Understanding | Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, Ser-Nam Lim |
1178 | Mutual Graph Learning for Camouflaged Object Detection | Qiang Zhai, Xin Li, Fan Yang, Chenglizhao Chen, Hong Cheng, Deng-Ping Fan |
10700 | Hallucination Improves Few-Shot Object Detection | Weilin Zhang, Yu-Xiong Wang |
7685 | Learning To Predict Visual Attributes in the Wild | Khoi Pham, Kushal Kafle, Zhe Lin, Zhihong Ding, Scott Cohen, Quan Tran, Abhinav Shrivastava |
745 | Scaled-YOLOv4: Scaling Cross Stage Partial Network | Chien-Yao Wang, Alexey Bochkovskiy, Hong-Yuan Mark Liao |
2620 | You Only Look One-Level Feature | Qiang Chen, Yingming Wang, Tong Yang, Xiangyu Zhang, Jian Cheng, Jian Sun |
3884 | Neighborhood Normalization for Robust Geometric Feature Learning | Xingtong Liu, Benjamin D. Killeen, Ayushi Sinha, Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath |
2987 | High-Fidelity Face Tracking for AR/VR via Deep Lighting Adaptation | Lele Chen, Chen Cao, Fernando De la Torre, Jason Saragih, Chenliang Xu, Yaser Sheikh |
6689 | Cuboids Revisited: Learning Robust 3D Shape Fitting to Single RGB Images | Florian Kluger, Hanno Ackermann, Eric Brachmann, Michael Ying Yang, Bodo Rosenhahn |
587 | Cycle4Completion: Unpaired Point Cloud Completion Using Cycle Transformation With Missing Region Coding | Xin Wen, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu |
3093 | LiDAR-Based Panoptic Segmentation via Dynamic Shifting Network | Fangzhou Hong, Hui Zhou, Xinge Zhu, Hongsheng Li, Ziwei Liu |
6286 | RPSRNet: End-to-End Trainable Rigid Point Set Registration Network Using Barnes-Hut 2D-Tree Representation | Sk Aziz Ali, Kerem Kahraman, Gerd Reis, Didier Stricker |
2397 | Online Learning of a Probabilistic and Adaptive Scene Representation | Zike Yan, Xin Wang, Hongbin Zha |
1679 | Quantum Permutation Synchronization | Tolga Birdal, Vladislav Golyanik, Christian Theobalt, Leonidas J. Guibas |
375 | Wide-Baseline Multi-Camera Calibration Using Person Re-Identification | Yan Xu, Yu-Jhe Li, Xinshuo Weng, Kris Kitani |
10046 | STaR: Self-Supervised Tracking and Reconstruction of Rigid Objects in Motion With Neural Rendering | Wentao Yuan, Zhaoyang Lv, Tanner Schmidt, Steven Lovegrove |
10321 | PatchMatch-Based Neighborhood Consensus for Semantic Correspondence | Jae Yong Lee, Joseph DeGol, Victor Fragoso, Sudipta N. Sinha |
1038 | Learning Feature Aggregation for Deep 3D Morphable Models | Zhixiang Chen, Tae-Kyun Kim |
7342 | A Functional Approach to Rotation Equivariant Non-Linearities for Tensor Field Networks. | Adrien Poulenard, Leonidas J. Guibas |
8801 | Generalizing to the Open World: Deep Visual Odometry With Online Adaptation | Shunkai Li, Xin Wu, Yingdian Cao, Hongbin Zha |
6665 | Panoptic-PolarNet: Proposal-Free LiDAR Point Cloud Panoptic Segmentation | Zixiang Zhou, Yang Zhang, Hassan Foroosh |
1136 | 3D Spatial Recognition Without Spatially Labeled 3D | Zhongzheng Ren, Ishan Misra, Alexander G. Schwing, Rohit Girdhar |
4240 | ACTION-Net: Multipath Excitation for Action Recognition | Zhengwei Wang, Qi She, Aljosa Smolic |
7689 | Anticipating Human Actions by Correlating Past With the Future With Jaccard Similarity Measures | Basura Fernando, Samitha Herath |
5941 | Glance and Gaze: Inferring Action-Aware Points for One-Stage Human-Object Interaction Detection | Xubin Zhong, Xian Qu, Changxing Ding, Dacheng Tao |
5952 | How Robust Are Randomized Smoothing Based Defenses to Data Poisoning? | Akshay Mehra, Bhavya Kailkhura, Pin-Yu Chen, Jihun Hamm |
5477 | FaceSec: A Fine-Grained Robustness Evaluation Framework for Face Recognition Systems | Liang Tong, Zhengzhang Chen, Jingchao Ni, Wei Cheng, Dongjin Song, Haifeng Chen, Yevgeniy Vorobeychik |
3534 | Rethinking the Heatmap Regression for Bottom-Up Human Pose Estimation | Zhengxiong Luo, Zhicheng Wang, Yan Huang, Liang Wang, Tieniu Tan, Erjin Zhou |
5102 | Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration | Xingyu Chen, Yufeng Liu, Chongyang Ma, Jianlong Chang, Huayan Wang, Tian Chen, Xiaoyan Guo, Pengfei Wan, Wen Zheng |
5498 | S3: Neural Shape, Skeleton, and Skinning Fields for 3D Human Modeling | Ze Yang, Shenlong Wang, Sivabalan Manivasagam, Zeng Huang, Wei-Chiu Ma, Xinchen Yan, Ersin Yumer, Raquel Urtasun |
5404 | CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild | Bastian Wandt, Marco Rudolph, Petrissa Zell, Helge Rhodin, Bodo Rosenhahn |
5284 | Lipstick Ain’t Enough: Beyond Color Matching for In-the-Wild Makeup Transfer | Thao Nguyen, Anh Tuan Tran, Minh Hoai |
1804 | Virtual Fully-Connected Layer: Training a Large-Scale Face Recognition Dataset With Limited Computational Resources | Pengyu Li, Biao Wang, Lei Zhang |
3673 | Learning From the Master: Distilling Cross-Modal Advanced Knowledge for Lip Reading | Sucheng Ren, Yong Du, Jianming Lv, Guoqiang Han, Shengfeng He |
4504 | Watching You: Global-Guided Reciprocal Learning for Video-Based Person Re-Identification | Xuehu Liu, Pingping Zhang, Chenyang Yu, Huchuan Lu, Xiaoyun Yang |
11579 | Sparse Multi-Path Corrections in Fringe Projection Profilometry | Yu Zhang, Daniel Lau, David Wipf |
3126 | Attention-Guided Image Compression by Deep Reconstruction of Compressive Sensed Saliency Skeleton | Xi Zhang, Xiaolin Wu |
603 | Invertible Denoising Network: A Light Solution for Real Noise Removal | Yang Liu, Zhenyue Qin, Saeed Anwar, Pan Ji, Dongwoo Kim, Sabrina Caldwell, Tom Gedeon |
1473 | Multi-Decoding Deraining Network and Quasi-Sparsity Based Training | Yinglong Wang, Chao Ma, Bing Zeng |
3690 | Unsupervised Real-World Image Super Resolution via Domain-Distance Aware Training | Yunxuan Wei, Shuhang Gu, Yawei Li, Radu Timofte, Longcun Jin, Hengjie Song |
2424 | Single Image Reflection Removal With Absorption Effect | Qian Zheng, Boxin Shi, Jinnan Chen, Xudong Jiang, Ling-Yu Duan, Alex C. Kot |
5944 | Exploiting Aliasing for Manga Restoration | Minshan Xie, Menghan Xia, Tien-Tsin Wong |
5908 | Rich Context Aggregation With Reflection Prior for Glass Surface Detection | Jiaying Lin, Zebang He, Rynson W.H. Lau |
2140 | MR Image Super-Resolution With Squeeze and Excitation Reasoning Attention Network | Yulun Zhang, Kai Li, Kunpeng Li, Yun Fu |
5684 | Rich Features for Perceptual Quality Assessment of UGC Videos | Yilin Wang, Junjie Ke, Hossein Talebi, Joong Gon Yim, Neil Birkbeck, Balu Adsumilli, Peyman Milanfar, Feng Yang |
5317 | A 3D GAN for Improved Large-Pose Facial Recognition | Richard T. Marriott, Sami Romdhani, Liming Chen |
5433 | Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and Benchmark | Joakim Bruslund Haurum, Thomas B. Moeslund |
6109 | AGORA: Avatars in Geography Optimized for Regression Analysis | Priyanka Patel, Chun-Hao P. Huang, Joachim Tesch, David T. Hoffmann, Shashank Tripathi, Michael J. Black |
8582 | SKFAC: Training Neural Networks With Faster Kronecker-Factored Approximate Curvature | Zedong Tang, Fenlong Jiang, Maoguo Gong, Hao Li, Yue Wu, Fan Yu, Zidong Wang, Min Wang |
6628 | Tree-Like Decision Distillation | Jie Song, Haofei Zhang, Xinchao Wang, Mengqi Xue, Ying Chen, Li Sun, Dacheng Tao, Mingli Song |
7774 | How Does Topology Influence Gradient Propagation and Model Performance of Deep Networks With DenseNet-Type Skip Connections? | Kartikeya Bhardwaj, Guihong Li, Radu Marculescu |
3422 | EnD: Entangling and Disentangling Deep Representations for Bias Correction | Enzo Tartaglione, Carlo Alberto Barbano, Marco Grangetto |
5223 | Learning Decision Trees Recurrently Through Communication | Stephan Alaniz, Diego Marcos, Bernt Schiele, Zeynep Akata |
4928 | Neural Response Interpretation Through the Lens of Critical Pathways | Ashkan Khakzar, Soroosh Baselizadeh, Saurabh Khanduja, Christian Rupprecht, Seong Tae Kim, Nassir Navab |
7469 | Masksembles for Uncertainty Estimation | Nikita Durasov, Timur Bagautdinov, Pierre Baque, Pascal Fua |
3170 | Self-Supervised Video Hashing via Bidirectional Transformers | Shuyan Li, Xiu Li, Jiwen Lu, Jie Zhou |
8530 | 3D Shape Generation With Grid-Based Implicit Functions | Moritz Ibing, Isaak Lim, Leif Kobbelt |
1512 | Positional Encoding As Spatial Inductive Bias in GANs | Rui Xu, Xintao Wang, Kai Chen, Bolei Zhou, Chen Change Loy |
8688 | Blur, Noise, and Compression Robust Generative Adversarial Networks | Takuhiro Kaneko, Tatsuya Harada |
4909 | Learning by Planning: Language-Guided Global Image Editing | Jing Shi, Ning Xu, Yihang Xu, Trung Bui, Franck Dernoncourt, Chenliang Xu |
531 | Teachers Do More Than Teach: Compressing Image-to-Image Models | Qing Jin, Jian Ren, Oliver J. Woodford, Jiazhuo Wang, Geng Yuan, Yanzhi Wang, Sergey Tulyakov |
5205 | Autoregressive Stylized Motion Synthesis With Generative Flow | Yu-Hui Wen, Zhipeng Yang, Hongbo Fu, Lin Gao, Yanan Sun, Yong-Jin Liu |
1325 | MUST-GAN: Multi-Level Statistics Transfer for Self-Driven Person Image Generation | Tianxiang Ma, Bo Peng, Wei Wang, Jing Dong |
1834 | House-GAN++: Generative Adversarial Layout Refinement Network towards Intelligent Computational Agent for Professional Architects | Nelson Nauata, Sepidehsadat Hosseini, Kai-Hung Chang, Hang Chu, Chin-Yi Cheng, Yasutaka Furukawa |
1234 | Variational Transformer Networks for Layout Generation | Diego Martín Arroyo, Janis Postels, Federico Tombari |
11392 | Motion Representations for Articulated Animation | Aliaksandr Siarohin, Oliver J. Woodford, Jian Ren, Menglei Chai, Sergey Tulyakov |
10917 | Pareto Self-Supervised Training for Few-Shot Learning | Zhengyu Chen, Jixie Ge, Heshen Zhan, Siteng Huang, Donglin Wang |
3045 | RaScaNet: Learning Tiny Models by Raster-Scanning Images | Jaehyoung Yoo, Dongwook Lee, Changyong Son, Sangil Jung, ByungIn Yoo, Changkyu Choi, Jae-Joon Han, Bohyung Han |
7379 | AlphaMatch: Improving Consistency for Semi-Supervised Learning With Alpha-Divergence | Chengyue Gong, Dilin Wang, Qiang Liu |
4001 | Nearest Neighbor Matching for Deep Clustering | Zhiyuan Dang, Cheng Deng, Xu Yang, Kun Wei, Heng Huang |
1985 | DeepACG: Co-Saliency Detection via Semantic-Aware Contrast Gromov-Wasserstein Distance | Kaihua Zhang, Mingliang Dong, Bo Liu, Xiao-Tong Yuan, Qingshan Liu |
2807 | Coordinate Attention for Efficient Mobile Network Design | Qibin Hou, Daquan Zhou, Jiashi Feng |
1157 | Landmark Regularization: Ranking Guided Super-Net Training in Neural Architecture Search | Kaicheng Yu, René Ranftl, Mathieu Salzmann |
7140 | RepVGG: Making VGG-Style ConvNets Great Again | Xiaohan Ding, Xiangyu Zhang, Ningning Ma, Jungong Han, Guiguang Ding, Jian Sun |
3841 | 3D Graph Anatomy Geometry-Integrated Network for Pancreatic Mass Segmentation, Diagnosis, and Quantitative Patient Management | Tianyi Zhao, Kai Cao, Jiawen Yao, Isabella Nogues, Le Lu, Lingyun Huang, Jing Xiao, Zhaozheng Yin, Ling Zhang |
789 | Exploring and Distilling Posterior and Prior Knowledge for Radiology Report Generation | Fenglin Liu, Xian Wu, Shen Ge, Wei Fan, Yuexian Zou |
1435 | Towards More Flexible and Accurate Object Tracking With Natural Language: Algorithms and Benchmark | Xiao Wang, Xiujun Shu, Zhipeng Zhang, Bo Jiang, Yaowei Wang, Yonghong Tian, Feng Wu |
7848 | STMTrack: Template-Free Visual Tracking With Space-Time Memory Networks | Zhihong Fu, Qingjie Liu, Zehua Fu, Yunhong Wang |
6241 | DyGLIP: A Dynamic Graph Model With Link Prediction for Accurate Multi-Camera Multiple Object Tracking | Kha Gia Quach, Pha Nguyen, Huu Le, Thanh-Dat Truong, Chi Nhan Duong, Minh-Triet Tran, Khoa Luu |
3387 | SuperMix: Supervising the Mixing Data Augmentation | Ali Dabouei, Sobhan Soleymani, Fariborz Taherkhani, Nasser M. Nasrabadi |
1924 | Monte Carlo Scene Search for 3D Scene Understanding | Shreyas Hampali, Sinisa Stekovic, Sayan Deb Sarkar, Chetan S. Kumar, Friedrich Fraundorfer, Vincent Lepetit |
6437 | MAZE: Data-Free Model Stealing Attack Using Zeroth-Order Gradient Estimation | Sanjay Kariyappa, Atul Prakash, Moinuddin K Qureshi |
955 | Visualizing Adapted Knowledge in Domain Transfer | Yunzhong Hou, Liang Zheng |
3431 | Prototypical Cross-Domain Self-Supervised Learning for Few-Shot Unsupervised Domain Adaptation | Xiangyu Yue, Zangwei Zheng, Shanghang Zhang, Yang Gao, Trevor Darrell, Kurt Keutzer, Alberto Sangiovanni Vincentelli |
10462 | KSM: Fast Multiple Task Adaption via Kernel-Wise Soft Mask Learning | Li Yang, Zhezhi He, Junshan Zhang, Deliang Fan |
1510 | Picasso: A CUDA-Based Library for Deep Learning Over 3D Meshes | Huan Lei, Naveed Akhtar, Ajmal Mian |
6646 | Efficient Feature Transformations for Discriminative and Generative Continual Learning | Vinay Kumar Verma, Kevin J Liang, Nikhil Mehta, Piyush Rai, Lawrence Carin |
8858 | Spatial Assembly Networks for Image Representation Learning | Yang Li, Shichao Kan, Jianhe Yuan, Wenming Cao, Zhihai He |
1846 | Self-Supervised Video Representation Learning by Context and Motion Decoupling | Lianghua Huang, Yu Liu, Bin Wang, Pan Pan, Yinghui Xu, Rong Jin |
2036 | Learning Probabilistic Ordinal Embeddings for Uncertainty-Aware Regression | Wanhua Li, Xiaoke Huang, Jiwen Lu, Jianjiang Feng, Jie Zhou |
1728 | CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching | Zhelun Shen, Yuchao Dai, Zhibo Rao |
10892 | Depth Completion Using Plane-Residual Representation | Byeong-Uk Lee, Kyunghyun Lee, In So Kweon |
4697 | Look Closer To Segment Better: Boundary Patch Refinement for Instance Segmentation | Chufeng Tang, Hang Chen, Xiao Li, Jianmin Li, Zhaoxiang Zhang, Xiaolin Hu |
5562 | Energy-Based Learning for Scene Graph Generation | Mohammed Suhail, Abhay Mittal, Behjat Siddiquie, Chris Broaddus, Jayan Eledath, Gerard Medioni, Leonid Sigal |
2178 | Heterogeneous Grid Convolution for Adaptive, Efficient, and Controllable Computation | Ryuhei Hamaguchi, Yasutaka Furukawa, Masaki Onishi, Ken Sakurada |
4537 | DCNAS: Densely Connected Neural Architecture Search for Semantic Image Segmentation | Xiong Zhang, Hongmin Xu, Hong Mo, Jianchao Tan, Cheng Yang, Lei Wang, Wenqi Ren |
2546 | Weakly Supervised Instance Segmentation for Videos With Temporal Mask Consistency | Qing Liu, Vignesh Ramanathan, Dhruv Mahajan, Alan Yuille, Zhenheng Yang |
10766 | Few-Shot Segmentation Without Meta-Learning: A Good Transductive Inference Is All You Need? | Malik Boudiaf, Hoel Kervadec, Ziko Imtiaz Masud, Pablo Piantanida, Ismail Ben Ayed, Jose Dolz |
3520 | Conditional Bures Metric for Domain Adaptation | You-Wei Luo, Chuan-Xian Ren |
5175 | Relative Order Analysis and Optimization for Unsupervised Deep Metric Learning | Shichao Kan, Yigang Cen, Yang Li, Vladimir Mladenovic, Zhihai He |
1937 | MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection | Jia-Chang Feng, Fa-Ting Hong, Wei-Shi Zheng |
4873 | Patch-VQ: ‘Patching Up’ the Video Quality Problem | Zhenqiang Ying, Maniratnam Mandal, Deepti Ghadiyaram, Alan Bovik |
10743 | Boosting Video Representation Learning With Multi-Faceted Integration | Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Xiao-Ping Zhang, Dong Wu, Tao Mei |
3294 | Delving Deep Into Many-to-Many Attention for Few-Shot Video Object Segmentation | Haoxin Chen, Hanjie Wu, Nanxuan Zhao, Sucheng Ren, Shengfeng He |
2690 | FAIEr: Fidelity and Adequacy Ensured Image Caption Evaluation | Sijin Wang, Ziwei Yao, Ruiping Wang, Zhongqin Wu, Xilin Chen |
2869 | Iterative Shrinking for Referring Expression Grounding Using Deep Reinforcement Learning | Mingjie Sun, Jimin Xiao, Eng Gee Lim |
265 | Repetitive Activity Counting by Sight and Sound | Yunhua Zhang, Ling Shao, Cees G. M. Snoek |
822 | Audio-Driven Emotional Video Portraits | Xinya Ji, Hang Zhou, Kaisiyuan Wang, Wayne Wu, Chen Change Loy, Xun Cao, Feng Xu |
4853 | Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation | Liwei Wang, Jing Huang, Yin Li, Kun Xu, Zhengyuan Yang, Dong Yu |
6533 | Hierarchical and Partially Observable Goal-Driven Policy Learning With Goals Relational Graph | Xin Ye, Yezhou Yang |
7420 | KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA | Kenneth Marino, Xinlei Chen, Devi Parikh, Abhinav Gupta, Marcus Rohrbach |
2468 | Focus on Local: Detecting Lane Marker From Bottom Up via Key Point | Zhan Qu, Huan Jin, Yang Zhou, Zhen Yang, Wei Zhang |
6162 | VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization | Seunghwan Choi, Sunghyun Park, Minsoo Lee, Jaegul Choo |
4969 | Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition | Stephen Hausler, Sourav Garg, Ming Xu, Michael Milford, Tobias Fischer |
911 | DeRF: Decomposed Radiance Fields | Daniel Rebain, Wei Jiang, Soroosh Yazdani, Ke Li, Kwang Moo Yi, Andrea Tagliasacchi |
Date: Thursday, June 24, 2021 22:00 – 24:30
Paper Session Eleven:
Paper ID | Paper Title | Authors |
3090 | POSEFusion: Pose-Guided Selective Fusion for Single-View Human Volumetric Capture | Zhe Li, Tao Yu, Zerong Zheng, Kaiwen Guo, Yebin Liu |
4439 | FESTA: Flow Estimation via Spatial-Temporal Attention for Scene Point Clouds | Haiyan Wang, Jiahao Pang, Muhammad A. Lodhi, Yingli Tian, Dong Tian |
4757 | Isometric Multi-Shape Matching | Maolin Gao, Zorah Lähner, Johan Thunberg, Daniel Cremers, Florian Bernard |
2475 | PatchmatchNet: Learned Multi-View Patchmatch Stereo | Fangjinhua Wang, Silvano Galliani, Christoph Vogel, Pablo Speciale, Marc Pollefeys |
840 | Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos | Hehe Fan, Yi Yang, Mohan Kankanhalli |
4104 | Learning To Aggregate and Personalize 3D Face From In-the-Wild Photo Collection | Zhenyu Zhang, Yanhao Ge, Renwang Chen, Ying Tai, Yan Yan, Jian Yang, Chengjie Wang, Jilin Li, Feiyue Huang |
250 | MagFace: A Universal Representation for Face Recognition and Quality Assessment | Qiang Meng, Shichao Zhao, Zhida Huang, Feng Zhou |
5291 | Event-Based Synthetic Aperture Imaging With a Hybrid Network | Xiang Zhang, Wei Liao, Lei Yu, Wen Yang, Gui-Song Xia |
300 | GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution | Kelvin C.K. Chan, Xintao Wang, Xiangyu Xu, Jinwei Gu, Chen Change Loy |
10597 | NPAS: A Compiler-Aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration | Zhengang Li, Geng Yuan, Wei Niu, Pu Zhao, Yanyu Li, Yuxuan Cai, Xuan Shen, Zheng Zhan, Zhenglun Kong, Qing Jin, Zhiyu Chen, Sijia Liu, Kaiyuan Yang, Bin Ren, Yanzhi Wang, Xue Lin |
283 | Privacy-Preserving Image Features via Adversarial Affine Subspace Embeddings | Mihai Dusmanu, Johannes L. Schönberger, Sudipta N. Sinha, Marc Pollefeys |
8523 | Image Generators With Conditionally-Independent Pixel Synthesis | Ivan Anokhin, Kirill Demochkin, Taras Khakhulin, Gleb Sterkin, Victor Lempitsky, Denis Korzhenkov |
7086 | CoMoGAN: Continuous Model-Guided Image-to-Image Translation | Fabio Pizzati, Pietro Cerri, Raoul de Charette |
3638 | Positive-Congruent Training: Towards Regression-Free Model Updates | Sijie Yan, Yuanjun Xiong, Kaustav Kundu, Shuo Yang, Siqi Deng, Meng Wang, Wei Xia, Stefano Soatto |
5052 | Capsule Network Is Not More Robust Than Convolutional Network | Jindong Gu, Volker Tresp, Han Hu |
1228 | Dual-Stream Multiple Instance Learning Network for Whole Slide Image Classification With Self-Supervised Contrastive Learning | Bin Li, Yin Li, Kevin W. Eliceiri |
919 | Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking | Fatemeh Saleh, Sadegh Aliakbarian, Hamid Rezatofighi, Mathieu Salzmann, Stephen Gould |
6367 | Adaptive Methods for Real-World Domain Generalization | Abhimanyu Dubey, Vignesh Ramanathan, Alex Pentland, Dhruv Mahajan |
4382 | Self-Supervised Geometric Perception | Heng Yang, Wei Dong, Luca Carlone, Vladlen Koltun |
1702 | HITNet: Hierarchical Iterative Tile Refinement Network for Real-time Stereo Matching | Vladimir Tankovich, Christian Häne, Yinda Zhang, Adarsh Kowdle, Sean Fanello, Sofien Bouaziz |
1348 | Bidirectional Projection Network for Cross Dimension Scene Understanding | Wenbo Hu, Hengshuang Zhao, Li Jiang, Jiaya Jia, Tien-Tsin Wong |
3364 | A Fourier-Based Framework for Domain Generalization | Qinwei Xu, Ruipeng Zhang, Ya Zhang, Yanfeng Wang, Qi Tian |
5421 | Open-Vocabulary Object Detection Using Captions | Alireza Zareian, Kevin Dela Rosa, Derek Hao Hu, Shih-Fu Chang |
7124 | MP3: A Unified Model To Map, Perceive, Predict and Plan | Sergio Casas, Abbas Sadat, Raquel Urtasun |
6158 | Hierarchical Lovász Embeddings for Proposal-Free Panoptic Segmentation | Tommi Kerola, Jie Li, Atsushi Kanehira, Yasunori Kudo, Alexis Vallet, Adrien Gaidon |
10741 | Accurate Few-Shot Object Detection With Support-Query Mutual Guidance and Hybrid Loss | Lu Zhang, Shuigeng Zhou, Jihong Guan, Ji Zhang |
7790 | GLAVNet: Global-Local Audio-Visual Cues for Fine-Grained Material Recognition | Fengmin Shi, Jie Guo, Haonan Zhang, Shan Yang, Xiying Wang, Yanwen Guo |
4506 | Multi-Scale Aligned Distillation for Low-Resolution Detection | Lu Qi, Jason Kuen, Jiuxiang Gu, Zhe Lin, Yi Wang, Yukang Chen, Yanwei Li, Jiaya Jia |
2829 | Sparse R-CNN: End-to-End Object Detection With Learnable Proposals | Peize Sun, Rufeng Zhang, Yi Jiang, Tao Kong, Chenfeng Xu, Wei Zhan, Masayoshi Tomizuka, Lei Li, Zehuan Yuan, Changhu Wang, Ping Luo |
5547 | Learning View Selection for 3D Scenes | Yifan Sun, Qixing Huang, Dun-Yu Hsiao, Li Guan, Gang Hua |
3305 | Multi-Person Implicit Reconstruction From a Single Image | Armin Mustafa, Akin Caliskan, Lourdes Agapito, Adrian Hilton |
8601 | Neural Descent for Visual 3D Human Pose and Shape | Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Zanfir, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu |
801 | SE-SSD: Self-Ensembling Single-Stage Object Detector From Point Cloud | Wu Zheng, Weiliang Tang, Li Jiang, Chi-Wing Fu |
3198 | SCF-Net: Learning Spatial Contextual Features for Large-Scale Point Cloud Segmentation | Siqi Fan, Qiulei Dong, Fenghua Zhu, Yisheng Lv, Peijun Ye, Fei-Yue Wang |
6371 | Equivariant Point Network for 3D Point Cloud Analysis | Haiwei Chen, Shichen Liu, Weikai Chen, Hao Li, Randall Hill |
4353 | DeepSurfels: Learning Online Appearance Fusion | Marko Mihajlovic, Silvan Weder, Marc Pollefeys, Martin R. Oswald |
4550 | Efficient Deformable Shape Correspondence via Multiscale Spectral Manifold Wavelets Preservation | Ling Hu, Qinsong Li, Shengjun Liu, Xinru Liu |
2531 | Efficient Initial Pose-Graph Generation for Global SfM | Daniel Barath, Dmytro Mishkin, Iván Eichhardt, Ilia Shipachev, Jiří Matas |
10099 | AutoInt: Automatic Integration for Fast Neural Volume Rendering | David B. Lindell, Julien N. P. Martel, Gordon Wetzstein |
1287 | Extreme Rotation Estimation Using Dense Correlation Volumes | Ruojin Cai, Bharath Hariharan, Noah Snavely, Hadar Averbuch-Elor |
468 | A Quasiconvex Formulation for Radial Cameras | Carl Olsson, Viktor Larsson, Fredrik Kahl |
10278 | ReAgent: Point Cloud Registration Using Imitation and Reinforcement Learning | Dominik Bauer, Timothy Patten, Markus Vincze |
10091 | Monocular Depth Estimation via Listwise Ranking Using the Plackett-Luce Model | Julian Lienen, Eyke Hüllermeier, Ralph Ewerth, Nils Nommensen |
7531 | HVPR: Hybrid Voxel-Point Representation for Single-Stage 3D Object Detection | Jongyoun Noh, Sanghoon Lee, Bumsub Ham |
2503 | 3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object Detection | He Wang, Yezhen Cong, Or Litany, Yue Gao, Leonidas J. Guibas |
5573 | Multi-Label Activity Recognition Using Activity-Specific Features and Activity Correlations | Yanyi Zhang, Xinyu Li, Ivan Marsic |
8473 | LaPred: Lane-Aware Prediction of Multi-Modal Future Trajectories of Dynamic Agents | ByeoungDo Kim, Seong Hyeon Park, Seokhwan Lee, Elbek Khoshimjonov, Dongsuk Kum, Junsoo Kim, Jeong Soo Kim, Jun Won Choi |
6457 | Detecting Human-Object Interaction via Fabricated Compositional Learning | Zhi Hou, Baosheng Yu, Yu Qiao, Xiaojiang Peng, Dacheng Tao |
6161 | Understanding the Robustness of Skeleton-Based Action Recognition Under Adversarial Attack | He Wang, Feixiang He, Zhexi Peng, Tianjia Shao, Yong-Liang Yang, Kun Zhou, David Hogg |
6375 | Invisible Perturbations: Physical Adversarial Examples Exploiting the Rolling Shutter Effect | Athena Sayles, Ashish Hooda, Mohit Gupta, Rahul Chatterjee, Earlence Fernandes |
5160 | Bottom-Up Human Pose Estimation via Disentangled Keypoint Regression | Zigang Geng, Ke Sun, Bin Xiao, Zhaoxiang Zhang, Jingdong Wang |
6336 | Semi-Supervised 3D Hand-Object Poses Estimation With Interactions in Time | Shaowei Liu, Hanwen Jiang, Jiarui Xu, Sifei Liu, Xiaolong Wang |
5809 | Inverse Simulation: Reconstructing Dynamic Geometry of Clothed Humans via Optimal Control | Jingfan Guo, Jie Li, Rahul Narain, Hyun Soo Park |
5950 | Populating 3D Scenes by Learning Human-Scene Interaction | Mohamed Hassan, Partha Ghosh, Joachim Tesch, Dimitrios Tzionas, Michael J. Black |
6522 | Towards High Fidelity Face Relighting With Realistic Shadows | Andrew Hou, Ze Zhang, Michel Sarkis, Ning Bi, Yiying Tong, Xiaoming Liu |
2484 | VirFace: Enhancing Face Recognition via Unlabeled Shallow Data | Wenyu Li, Tianchu Guo, Pengyu Li, Binghui Chen, Biao Wang, Wangmeng Zuo, Lei Zhang |
4317 | Birds of a Feather: Capturing Avian Shape Models From Images | Yufu Wang, Nikos Kolotouros, Kostas Daniilidis, Marc Badger |
5010 | Unsupervised Pre-Training for Person Re-Identification | Dengpan Fu, Dongdong Chen, Jianmin Bao, Hao Yang, Lu Yuan, Lei Zhang, Houqiang Li, Dong Chen |
2450 | Indoor Lighting Estimation Using an Event Camera | Zehao Chen, Qian Zheng, Peisong Niu, Huajin Tang, Gang Pan |
6538 | Checkerboard Context Model for Efficient Learned Image Compression | Dailan He, Yaoyan Zheng, Baocheng Sun, Yan Wang, Hongwei Qin |
2144 | Neighbor2Neighbor: Self-Supervised Denoising From Single Noisy Images | Tao Huang, Songjiang Li, Xu Jia, Huchuan Lu, Jianzhuang Liu |
1798 | From Rain Generation to Rain Removal | Hong Wang, Zongsheng Yue, Qi Xie, Qian Zhao, Yefeng Zheng, Deyu Meng |
4459 | Rank-One Prior: Toward Real-Time Scene Recovery | Jun Liu, Wen Liu, Jianing Sun, Tieyong Zeng |
2469 | Robust Reflection Removal With Reflection-Free Flash-Only Cues | Chenyang Lei, Qifeng Chen |
6551 | Multi-Stage Progressive Image Restoration | Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao |
6186 | Shape From Sky: Polarimetric Normal Recovery Under the Sky | Tomoki Ichikawa, Matthew Purri, Ryo Kawahara, Shohei Nobuhara, Kristin Dana, Ko Nishino |
2168 | Cross-MPI: Cross-Scale Stereo for Image Super-Resolution Using Multiplane Images | Yuemei Zhou, Gaochang Wu, Ying Fu, Kun Li, Yebin Liu |
8247 | Deep Perceptual Preprocessing for Video Coding | Aaron Chadha, Yiannis Andreopoulos |
10973 | StyleMix: Separating Content and Style for Enhanced Data Augmentation | Minui Hong, Jinwoo Choi, Gunhee Kim |
5521 | Spoken Moments: Learning Joint Audio-Visual Representations From Video Descriptions | Mathew Monfort, SouYoung Jin, Alexander Liu, David Harwath, Rogerio Feris, James Glass, Aude Oliva |
1509 | Spatially-Adaptive Pixelwise Networks for Fast Image Translation | Tamar Rott Shaham, Michaël Gharbi, Richard Zhang, Eli Shechtman, Tomer Michaeli |
10136 | No Frame Left Behind: Full Video Action Recognition | Xin Liu, Silvia L. Pintea, Fatemeh Karimi Nejadasl, Olaf Booij, Jan C. van Gemert |
10109 | Multiresolution Knowledge Distillation for Anomaly Detection | Mohammadreza Salehi, Niousha Sadjadi, Soroosh Baselizadeh, Mohammad H. Rohban, Hamid R. Rabiee |
4970 | Convolutional Neural Network Pruning With Structural Redundancy Reduction | Zi Wang, Chengcheng Li, Xiangyang Wang |
409 | Representative Forgery Mining for Fake Face Detection | Chengrui Wang, Weihong Deng |
5419 | Neural Prototype Trees for Interpretable Fine-Grained Image Recognition | Meike Nauta, Ron van Bree, Christin Seifert |
7931 | Relevance-CAM: Your Model Already Knows Where To Look | Jeong Ryong Lee, Sewon Kim, Inyong Park, Taejoon Eo, Dosik Hwang |
329 | Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval | Yang Liu, Qingchao Chen, Samuel Albanie |
4936 | Efficient Object Embedding for Spliced Image Retrieval | Bor-Chun Chen, Zuxuan Wu, Larry S. Davis, Ser-Nam Lim |
10581 | Generative PointNet: Deep Energy-Based Learning on Unordered Point Sets for 3D Generation, Reconstruction and Classification | Jianwen Xie, Yifei Xu, Zilong Zheng, Song-Chun Zhu, Ying Nian Wu |
2535 | Anycost GANs for Interactive Image Synthesis and Editing | Ji Lin, Richard Zhang, Frieder Ganz, Song Han, Jun-Yan Zhu |
10382 | Ensembling With Deep Generative Views | Lucy Chai, Jun-Yan Zhu, Eli Shechtman, Phillip Isola, Richard Zhang |
5457 | Continuous Face Aging via Self-Estimated Residual Age Embedding | Zeqi Li, Ruowei Jiang, Parham Aarabi |
633 | ReMix: Towards Image-to-Image Translation With Limited Data | Jie Cao, Luanxuan Hou, Ming-Hsuan Yang, Ran He, Zhenan Sun |
5888 | Unbalanced Feature Transport for Exemplar-Based Image Translation | Fangneng Zhan, Yingchen Yu, Kaiwen Cui, Gongjie Zhang, Shijian Lu, Jianxiong Pan, Changgong Zhang, Feiying Ma, Xuansong Xie, Chunyan Miao |
2759 | Pose-Guided Human Animation From a Single Image in the Wild | Jae Shin Yoon, Lingjie Liu, Vladislav Golyanik, Kripasindhu Sarkar, Hyun Soo Park, Christian Theobalt |
2063 | Context-Aware Layout to Image Generation With Enhanced Object Appearance | Sen He, Wentong Liao, Michael Ying Yang, Yongxin Yang, Yi-Zhe Song, Bodo Rosenhahn, Tao Xiang |
3670 | SetVAE: Learning Hierarchical Composition for Generative Modeling of Set-Structured Data | Jinwoo Kim, Jaehoon Yoo, Juho Lee, Seunghoon Hong |
5460 | Are Labels Always Necessary for Classifier Accuracy Evaluation? | Weijian Deng, Liang Zheng |
2103 | Graph-Based High-Order Relation Discovery for Fine-Grained Recognition | Yifan Zhao, Ke Yan, Feiyue Huang, Jia Li |
384 | Long-Tailed Multi-Label Visual Recognition by Collaborative Training on Uniform and Re-Balanced Samplings | Hao Guo, Song Wang |
10610 | SimPLE: Similar Pseudo Label Exploitation for Semi-Supervised Classification | Zijian Hu, Zhengyu Yang, Xuefeng Hu, Ram Nevatia |
6149 | Cluster-Wise Hierarchical Generative Model for Deep Amortized Clustering | Huafeng Liu, Jiaqi Wang, Liping Jing |
2852 | From Semantic Categories to Fixations: A Novel Weakly-Supervised Visual-Auditory Saliency Detection Approach | Guotao Wang, Chenglizhao Chen, Deng-Ping Fan, Aimin Hao, Hong Qin |
7968 | Gaussian Context Transformer | Dongsheng Ruan, Daiyin Wang, Yuan Zheng, Nenggan Zheng, Min Zheng |
2777 | FP-NAS: Fast Probabilistic Neural Architecture Search | Zhicheng Yan, Xiaoliang Dai, Peizhao Zhang, Yuandong Tian, Bichen Wu, Matt Feiszli |
7620 | Time Adaptive Recurrent Neural Network | Anil Kag, Venkatesh Saligrama |
3925 | Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging Studies | Jinzheng Cai, Youbao Tang, Ke Yan, Adam P. Harrison, Jing Xiao, Gigin Lin, Le Lu |
1251 | Reciprocal Landmark Detection and Tracking With Extremely Few Annotations | Jianzhe Lin, Ghazal Sahebzamani, Christina Luong, Fatemeh Taheri Dezaki, Mohammad Jafari, Purang Abolmaesumi, Teresa Tsang |
1629 | LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search | Bin Yan, Houwen Peng, Kan Wu, Dong Wang, Jianlong Fu, Huchuan Lu |
922 | TesseTrack: End-to-End Learnable Multi-Person Articulated 3D Pose Tracking | N Dinesh Reddy, Laurent Guigues, Leonid Pishchulin, Jayan Eledath, Srinivasa G. Narasimhan |
980 | Learning Optical Flow From Still Images | Filippo Aleotti, Matteo Poggi, Stefano Mattoccia |
3576 | Towards Robust Classification Model by Counterfactual and Invariant Data Generation | Chun-Hao Chang, George Alexandru Adam, Anna Goldenberg |
1205 | StablePose: Learning 6D Object Poses From Geometrically Stable Patches | Yifei Shi, Junwen Huang, Xin Xu, Yifan Zhang, Kai Xu |
6959 | The Translucent Patch: A Physical and Universal Attack on Object Detectors | Alon Zolfi, Moshe Kravchik, Yuval Elovici, Asaf Shabtai |
1908 | Dynamic Weighted Learning for Unsupervised Domain Adaptation | Ni Xiao, Lei Zhang |
3461 | DRANet: Disentangling Representation and Adaptation Networks for Unsupervised Cross-Domain Adaptation | Seunghun Lee, Sunghyun Cho, Sunghoon Im |
10659 | Natural Adversarial Examples | Dan Hendrycks, Kevin Zhao, Steven Basart, Jacob Steinhardt, Dawn Song |
10237 | Fast End-to-End Learning on Protein Surfaces | Freyr Sverrisson, Jean Feydy, Bruno E. Correia, Michael M. Bronstein |
7358 | Rectification-Based Knowledge Retention for Continual Learning | Pravendra Singh, Pratik Mazumder, Piyush Rai, Vinay P. Namboodiri |
10227 | Cross-Domain Similarity Learning for Face Recognition in Unseen Domains | Masoud Faraki, Xiang Yu, Yi-Hsuan Tsai, Yumin Suh, Manmohan Chandraker |
2866 | Sequence-to-Sequence Contrastive Learning for Text Recognition | Aviad Aberdam, Ron Litman, Shahar Tsiper, Oron Anschel, Ron Slossberg, Shai Mazor, R. Manmatha, Pietro Perona |
6254 | MOOD: Multi-Level Out-of-Distribution Detection | Ziqian Lin, Sreya Dutta Roy, Yixuan Li |
4245 | DeepVideoMVS: Multi-View Stereo on Video With Recurrent Spatio-Temporal Fusion | Arda Düzçeker, Silvano Galliani, Christoph Vogel, Pablo Speciale, Mihai Dusmanu, Marc Pollefeys |
607 | Boundary IoU: Improving Object-Centric Image Segmentation Evaluation | Bowen Cheng, Ross Girshick, Piotr Dollár, Alexander C. Berg, Alexander Kirillov |
6876 | A2-FPN: Attention Aggregation Based Feature Pyramid Network for Instance Segmentation | Miao Hu, Yali Li, Lu Fang, Shengjin Wang |
769 | SSLayout360: Semi-Supervised Indoor Layout Estimation From 360° Panorama | Phi Vu Tran |
2326 | Complete & Label: A Domain Adaptation Approach to Semantic Segmentation of LiDAR Point Clouds | Li Yi, Boqing Gong, Thomas Funkhouser |
4564 | Improved Image Matting via Real-Time User Clicks and Uncertainty Estimation | Tianyi Wei, Dongdong Chen, Wenbo Zhou, Jing Liao, Hanqing Zhao, Weiming Zhang, Nenghai Yu |
2894 | Self-Supervised Augmentation Consistency for Adapting Semantic Segmentation | Nikita Araslanov, Stefan Roth |
11486 | Few-Shot Object Detection via Classification Refinement and Distractor Retreatment | Yiting Li, Haiyue Zhu, Yu Cheng, Wenxin Wang, Chek Sing Teo, Cheng Xiang, Prahlad Vadakkepat, Tong Heng Lee |
4951 | Counterfactual Zero-Shot and Open-Set Visual Recognition | Zhongqi Yue, Tan Wang, Qianru Sun, Xian-Sheng Hua, Hanwang Zhang |
7328 | Learning Deep Latent Variable Models by Short-Run MCMC Inference With Optimal Transport Correction | Dongsheng An, Jianwen Xie, Ping Li |
5886 | Learning Normal Dynamics in Videos With Meta Prototype Network | Hui Lv, Chen Chen, Zhen Cui, Chunyan Xu, Yong Li, Jian Yang |
5092 | MotionRNN: A Flexible Model for Video Prediction With Spacetime-Varying Motions | Haixu Wu, Zhiyu Yao, Jianmin Wang, Mingsheng Long |
364 | Learning To Recommend Frame for Interactive Video Object Segmentation in the Wild | Zhaoyuan Yin, Jia Zheng, Weixin Luo, Shenhan Qian, Hanling Zhang, Shenghua Gao |
3672 | Reciprocal Transformations for Unsupervised Video Object Segmentation | Sucheng Ren, Wenxi Liu, Yongtuo Liu, Haoxin Chen, Guoqiang Han, Shengfeng He |
4241 | RSTNet: Captioning With Adaptive Attention on Visual and Non-Visual Words | Xuying Zhang, Xiaoshuai Sun, Yunpeng Luo, Jiayi Ji, Yiyi Zhou, Yongjian Wu, Feiyue Huang, Rongrong Ji |
9903 | Revamping Cross-Modal Recipe Retrieval With Hierarchical Transformers and Self-Supervised Learning | Amaia Salvador, Erhan Gundogdu, Loris Bazzani, Michael Donoser |
845 | Visually Informed Binaural Audio Generation without Binaural Audios | Xudong Xu, Hang Zhou, Ziwei Liu, Bo Dai, Xiaogang Wang, Dahua Lin |
1298 | VisualVoice: Audio-Visual Speech Separation With Cross-Modal Consistency | Ruohan Gao, Kristen Grauman |
5128 | Encoder Fusion Network With Co-Attention Embedding for Referring Image Segmentation | Guang Feng, Zhiwei Hu, Lihe Zhang, Huchuan Lu |
9940 | Semantic Audio-Visual Navigation | Changan Chen, Ziad Al-Halah, Kristen Grauman |
8058 | Bridge To Answer: Structure-Aware Graph Interaction Network for Video Question Answering | Jungin Park, Jiyoung Lee, Kwanghoon Sohn |
4588 | Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-View Transformation | Weixiang Yang, Qi Li, Wenxi Liu, Yuanlong Yu, Yuexin Ma, Shengfeng He, Jia Pan |
6278 | Toward Accurate and Realistic Outfits Visualization With Attention to Details | Kedan Li, Min Jin Chong, Jeffrey Zhang, Jingen Liu |
7055 | Interpretable Social Anchors for Human Trajectory Forecasting in Crowds | Parth Kothari, Brian Sifringer, Alexandre Alahi |
Date: Friday, June 25, 2021 6:00– 8:30
Paper Session Twelve:
Paper ID | Paper Title | Authors |
4741 | Deep Polarization Imaging for 3D Shape and SVBRDF Acquisition | Valentin Deschaintre, Yiming Lin, Abhijeet Ghosh |
7791 | Self-Point-Flow: Self-Supervised Scene Flow Estimation From Point Clouds With Optimal Transport and Random Walk | Ruibo Li, Guosheng Lin, Lihua Xie |
1195 | Exploring Data-Efficient 3D Scene Understanding With Contrastive Scene Contexts | Ji Hou, Benjamin Graham, Matthias Nießner, Saining Xie |
3433 | NeuralRecon: Real-Time Coherent 3D Reconstruction From Monocular Video | Jiaming Sun, Yiming Xie, Linghao Chen, Xiaowei Zhou, Hujun Bao |
9966 | FrameExit: Conditional Early Exiting for Efficient Video Recognition | Amir Ghodrati, Babak Ehteshami Bejnordi, Amirhossein Habibian |
5180 | Inverting Generative Adversarial Renderer for Face Reconstruction | Jingtan Piao, Keqiang Sun, Quan Wang, Kwan-Yee Lin, Hongsheng Li |
4453 | Spherical Confidence Learning for Face Recognition | Shen Li, Jianqing Xu, Xiaqing Xu, Pengcheng Shen, Shaoxin Li, Bryan Hooi |
8049 | Event-Based Bispectral Photometry Using Temporally Modulated Illumination | Tsuyoshi Takatani, Yuzuha Ito, Ayaka Ebisu, Yinqiang Zheng, Takahito Aoto |
1565 | AdderSR: Towards Energy Efficient Image Super-Resolution | Dehua Song, Yunhe Wang, Hanting Chen, Chang Xu, Chunjing Xu, Dacheng Tao |
1442 | Diversifying Sample Generation for Accurate Data-Free Quantization | Xiangguo Zhang, Haotong Qin, Yifu Ding, Ruihao Gong, Qinghua Yan, Renshuai Tao, Yuhang Li, Fengwei Yu, Xianglong Liu |
2057 | How Privacy-Preserving Are Line Clouds? Recovering Scene Details From 3D Lines | Kunal Chelani, Fredrik Kahl, Torsten Sattler |
10978 | Unsupervised Learning of Depth and Depth-of-Field Effect From Natural Images With Aperture Rendering Generative Adversarial Networks | Takuhiro Kaneko |
7633 | Stylized Neural Painting | Zhengxia Zou, Tianyang Shi, Shuang Qiu, Yi Yuan, Zhenwei Shi |
4812 | Permute, Quantize, and Fine-Tune: Efficient Compression of Neural Networks | Julieta Martinez, Jashan Shewakramani, Ting Wei Liu, Ioan Andrei Bârsan, Wenyuan Zeng, Raquel Urtasun |
3053 | Amalgamating Knowledge From Heterogeneous Graph Neural Networks | Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, Dacheng Tao |
4015 | XProtoNet: Diagnosis in Chest Radiography With Global and Local Explanations | Eunji Kim, Siwon Kim, Minji Seo, Sungroh Yoon |
2782 | LQF: Linear Quadratic Fine-Tuning | Alessandro Achille, Aditya Golatkar, Avinash Ravichandran, Marzia Polito, Stefano Soatto |
1934 | DECOR-GAN: 3D Shape Detailization by Conditional Refinement | Zhiqin Chen, Vladimir G. Kim, Matthew Fisher, Noam Aigerman, Hao Zhang, Siddhartha Chaudhuri |
4418 | Exploring Simple Siamese Representation Learning | Xinlei Chen, Kaiming He |
2219 | Camera Pose Matters: Improving Depth Prediction by Mitigating Pose Distribution Bias | Yunhan Zhao, Shu Kong, Charless Fowlkes |
2080 | DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation | Xinyi Wu, Zhenyao Wu, Hao Guo, Lili Ju, Song Wang |
4113 | OTCE: A Transferability Metric for Cross-Domain Cross-Task Representations | Yang Tan, Yang Li, Shao-Lun Huang |
524 | Learning the Best Pooling Strategy for Visual Semantic Embedding | Jiacheng Chen, Hexiang Hu, Hao Wu, Yuning Jiang, Changhu Wang |
10362 | Divide-and-Conquer for Lane-Aware Diverse Trajectory Prediction | Sriram Narayanan, Ramin Moslemi, Francesco Pittaluga, Buyu Liu, Manmohan Chandraker |
4722 | Magic Layouts: Structural Prior for Component Detection in User Interface Designs | Dipu Manandhar, Hailin Jin, John Collomosse |
368 | Dense Label Encoding for Boundary Discontinuity Free Rotation Detection | Xue Yang, Liping Hou, Yue Zhou, Wentao Wang, Junchi Yan |
1273 | MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition | Ayan Kumar Bhunia, Shuvozit Ghose, Amandeep Kumar, Pinaki Nath Chowdhury, Aneeshan Sain, Yi-Zhe Song |
6128 | PSRR-MaxpoolNMS: Pyramid Shifted MaxpoolNMS With Relationship Recovery | Tianyi Zhang, Jie Lin, Peng Hu, Bin Zhao, Mohamed M. Sabry Aly |
5013 | End-to-End Object Detection With Fully Convolutional Network | Jianfeng Wang, Lin Song, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng |
5688 | PointDSC: Robust Point Cloud Registration Using Deep Spatial Consistency | Xuyang Bai, Zixin Luo, Lei Zhou, Hongkai Chen, Lei Li, Zeyu Hu, Hongbo Fu, Chiew-Lan Tai |
3328 | Wide-Depth-Range 6D Object Pose Estimation in Space | Yinlin Hu, Sébastien Speierer, Wenzel Jakob, Pascal Fua, Mathieu Salzmann |
8649 | Fostering Generalization in Single-View 3D Reconstruction by Learning a Hierarchy of Local and Global Shape Priors | Jan Bechtold, Maxim Tatarchenko, Volker Fischer, Thomas Brox |
1024 | View-Guided Point Cloud Completion | Xuancheng Zhang, Yutong Feng, Siqi Li, Changqing Zou, Hai Wan, Xibin Zhao, Yandong Guo, Yue Gao |
3200 | Regularization Strategy for Point Cloud via Rigidly Mixed Sample | Dogyoon Lee, Jaeha Lee, Junhyeop Lee, Hyeongmin Lee, Minhyeok Lee, Sungmin Woo, Sangyoun Lee |
6624 | PWCLO-Net: Deep LiDAR Odometry in 3D Point Clouds Using Hierarchical Embedding Mask Optimization | Guangming Wang, Xinrui Wu, Zhe Liu, Hesheng Wang |
5368 | Co-Attention for Conditioned Image Matching | Olivia Wiles, Sébastien Ehrhardt, Andrew Zisserman |
6480 | A Dual Iterative Refinement Method for Non-Rigid Shape Matching | Rui Xiang, Rongjie Lai, Hongkai Zhao |
3192 | NeRD: Neural 3D Reflection Symmetry Detector | Yichao Zhou, Shichen Liu, Yi Ma |
2612 | Deep Lucas-Kanade Homography for Multimodal Image Alignment | Yiming Zhao, Xinming Huang, Ziming Zhang |
1822 | DeepI2P: Image-to-Point Cloud Registration via Deep Classification | Jiaxin Li, Gim Hee Lee |
1997 | NormalFusion: Real-Time Acquisition of Surface Normals for High-Resolution RGB-D Scanning | Hyunho Ha, Joo Ho Lee, Andreas Meuleman, Min H. Kim |
5496 | LASR: Learning Articulated Shape Reconstruction From a Monocular Video | Gengshan Yang, Deqing Sun, Varun Jampani, Daniel Vlasic, Forrester Cole, Huiwen Chang, Deva Ramanan, William T. Freeman, Ce Liu |
10153 | Mirror3D: Depth Refinement for Mirror Surfaces | Jiaqi Tan, Weijie Lin, Angel X. Chang, Manolis Savva |
10259 | To the Point: Efficient 3D Object Detection in the Range Image With Graph Convolution Kernels | Yuning Chai, Pei Sun, Jiquan Ngiam, Weiyue Wang, Benjamin Caine, Vijay Vasudevan, Xiao Zhang, Dragomir Anguelov |
686 | CoLA: Weakly-Supervised Temporal Action Localization With Snippet Contrastive Learning | Can Zhang, Meng Cao, Dongming Yang, Jie Chen, Yuexian Zou |
5629 | MoViNets: Mobile Video Networks for Efficient Video Recognition | Dan Kondratyuk, Liangzhe Yuan, Yandong Li, Li Zhang, Mingxing Tan, Matthew Brown, Boqing Gong |
534 | Few-Shot Transformation of Common Actions Into Time and Space | Pengwan Yang, Pascal Mettes, Cees G. M. Snoek |
7883 | Learning Asynchronous and Sparse Human-Object Interaction in Videos | Romero Morais, Vuong Le, Svetha Venkatesh, Truyen Tran |
6251 | Adversarial Imaging Pipelines | Buu Phan, Fahim Mannan, Felix Heide |
6511 | Adversarial Laser Beam: Effective Physical-World Attack to DNNs in a Blink | Ranjie Duan, Xiaofeng Mao, A. K. Qin, Yuefeng Chen, Shaokai Ye, Yuan He, Yun Yang |
5887 | ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search | Lumin Xu, Yingda Guan, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang, Xiaogang Wang |
428 | SCALE: Modeling Clothed Humans with a Surface Codec of Articulated Local Elements | Qianli Ma, Shunsuke Saito, Jinlong Yang, Siyu Tang, Michael J. Black |
6228 | Probabilistic 3D Human Shape and Pose Estimation From Multiple Unconstrained Images in the Wild | Akash Sengupta, Ignas Budvytis, Roberto Cipolla |
8832 | Graph Stacked Hourglass Networks for 3D Human Pose Estimation | Tianhan Xu, Wataru Takano |
6885 | High-Fidelity and Arbitrary Face Editing | Yue Gao, Fangyun Wei, Jianmin Bao, Shuyang Gu, Dong Chen, Fang Wen, Zhouhui Lian |
4816 | IronMask: Modular Architecture for Protecting Deep Face Template | Sunpill Kim, Yunseong Jeong, Jinsu Kim, Jungkon Kim, Hyung Tae Lee, Jae Hong Seo |
1198 | Meta-Mining Discriminative Samples for Kinship Verification | Wanhua Li, Shiwei Wang, Jiwen Lu, Jianjiang Feng, Jie Zhou |
5182 | Generalizable Person Re-Identification With Relevance-Aware Mixture of Experts | Yongxing Dai, Xiaotong Li, Jun Liu, Zekun Tong, Ling-Yu Duan |
3301 | Time Lens: Event-Based Video Frame Interpolation | Stepan Tulyakov, Daniel Gehrig, Stamatios Georgoulis, Julius Erbach, Mathias Gehrig, Yuanyou Li, Davide Scaramuzza |
6839 | How To Exploit the Transferability of Learned Image Compression to Conventional Codecs | Jan P. Klopp, Keng-Chi Liu, Liang-Gee Chen, Shao-Yi Chien |
3377 | Pseudo 3D Auto-Correlation Network for Real Image Denoising | Xiaowan Hu, Ruijun Ma, Zhihong Liu, Yuanhao Cai, Xiaole Zhao, Yulun Zhang, Haoqian Wang |
2207 | Ultra-High-Definition Image Dehazing via Multi-Guided Bilateral Learning | Zhuoran Zheng, Wenqi Ren, Xiaochun Cao, Xiaobin Hu, Tao Wang, Fenglong Song, Xiuyi Jia |
4502 | HLA-Face: Joint High-Low Adaptation for Low Light Face Detection | Wenjing Wang, Wenhan Yang, Jiaying Liu |
3345 | Zero-Shot Single Image Restoration Through Controlled Perturbation of Koschmieder’s Model | Aupendu Kar, Sobhan Kanti Dhara, Debashis Sen, Prabir Kumar Biswas |
6576 | Deep Gaussian Scale Mixture Prior for Spectral Compressive Imaging | Tao Huang, Weisheng Dong, Xin Yuan, Jinjian Wu, Guangming Shi |
6893 | Multi-View 3D Reconstruction of a Texture-Less Smooth Surface of Unknown Generic Reflectance | Ziang Cheng, Hongdong Li, Yuta Asano, Yinqiang Zheng, Imari Sato |
2382 | Tackling the Ill-Posedness of Super-Resolution Through Adaptive Target Generation | Younghyun Jo, Seoung Wug Oh, Peter Vajda, Seon Joo Kim |
8381 | Memory-Efficient Network for Large-Scale Video Compressive Sensing | Ziheng Cheng, Bo Chen, Guanliang Liu, Hao Zhang, Ruiying Lu, Zhengjue Wang, Xin Yuan |
6387 | Troubleshooting Blind Image Quality Models in the Wild | Zhihua Wang, Haotao Wang, Tianlong Chen, Zhangyang Wang, Kede Ma |
6737 | UAV-Human: A Large Benchmark for Human Behavior Understanding With Unmanned Aerial Vehicles | Tianjiao Li, Jun Liu, Wei Zhang, Yun Ni, Wenqian Wang, Zhiheng Li |
4956 | FBNetV3: Joint Architecture-Recipe Search Using Predictor Pretraining | Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Bichen Wu, Zijian He, Zhen Wei, Kan Chen, Yuandong Tian, Matthew Yu, Peter Vajda, Joseph E. Gonzalez |
10777 | Domain-Independent Dominance of Adaptive Methods | Pedro Savarese, David McAllester, Sudarshan Babu, Michael Maire |
10552 | Wasserstein Contrastive Representation Distillation | Liqun Chen, Dong Wang, Zhe Gan, Jingjing Liu, Ricardo Henao, Lawrence Carin |
6316 | The Lottery Tickets Hypothesis for Supervised and Self-Supervised Pre-Training in Computer Vision Models | Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yang Zhang, Michael Carbin, Zhangyang Wang |
3034 | Generalizing Face Forgery Detection With High-Frequency Features | Yuchen Luo, Yong Zhang, Junchi Yan, Wei Liu |
5703 | CAMERAS: Enhanced Resolution and Sanity Preserving Class Activation Mapping for Image Saliency | Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian |
1591 | See Through Gradients: Image Batch Recovery via GradInversion | Hongxu Yin, Arun Mallya, Arash Vahdat, Jose M. Alvarez, Jan Kautz, Pavlo Molchanov |
2586 | Seeking the Shape of Sound: An Adaptive Framework for Learning Voice-Face Association | Peisong Wen, Qianqian Xu, Yangbangyan Jiang, Zhiyong Yang, Yuan He, Qingming Huang |
5248 | Prototype-Supervised Adversarial Network for Targeted Attack of Deep Hashing | Xunguang Wang, Zheng Zhang, Baoyuan Wu, Fumin Shen, Guangming Lu |
6256 | Intrinsic Image Harmonization | Zonghui Guo, Haiyong Zheng, Yufeng Jiang, Zhaorui Gu, Bing Zheng |
4968 | DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network | Rui Liu, Yixiao Ge, Ching Lam Choi, Xiaogang Wang, Hongsheng Li |
2074 | Combining Semantic Guidance and Deep Reinforcement Learning for Generating Human Level Paintings | Jaskirat Singh, Liang Zheng |
6376 | No Shadow Left Behind: Removing Objects and Their Shadows Using Approximate Lighting and Geometry | Edward Zhang, Ricardo Martin-Brualla, Janne Kontkanen, Brian L. Curless |
1129 | The Spatially-Correlative Loss for Various Image Translation Tasks | Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai |
6351 | Unpaired Image-to-Image Translation via Latent Energy Transport | Yang Zhao, Changyou Chen |
2859 | Model-Aware Gesture-to-Gesture Translation | Hezhen Hu, Weilun Wang, Wengang Zhou, Weichao Zhao, Houqiang Li |
3833 | Layout-Guided Novel View Synthesis From a Single Indoor Panorama | Jiale Xu, Jia Zheng, Yanyu Xu, Rui Tang, Shenghua Gao |
2122 | Progressive Temporal Feature Alignment Network for Video Inpainting | Xueyan Zou, Linjie Yang, Ding Liu, Yong Jae Lee |
8234 | On Robustness and Transferability of Convolutional Neural Networks | Josip Djolonga, Jessica Yung, Michael Tschannen, Rob Romijnders, Lucas Beyer, Alexander Kolesnikov, Joan Puigcerver, Matthias Minderer, Alexander D'Amour, Dan Moldovan, Sylvain Gelly, Neil Houlsby, Xiaohua Zhai, Mario Lucic |
6331 | Boosting Ensemble Accuracy by Revisiting Ensemble Diversity Metrics | Yanzhao Wu, Ling Liu, Zhongwei Xie, Ka-Ho Chow, Wenqi Wei |
4378 | General Multi-Label Image Classification With Transformers | Jack Lanchantin, Tianlu Wang, Vicente Ordonez, Yanjun Qi |
741 | Improving Calibration for Long-Tailed Recognition | Zhisheng Zhong, Jiequan Cui, Shu Liu, Jiaya Jia |
937 | Hardness Sampling for Self-Training Based Transductive Zero-Shot Learning | Liu Bo, Qiulei Dong, Zhanyi Hu |
4152 | Saliency-Guided Image Translation | Lai Jiang, Mai Xu, Xiaofei Wang, Leonid Sigal |
10487 | Bottleneck Transformers for Visual Recognition | Aravind Srinivas, Tsung-Yi Lin, Niki Parmar, Jonathon Shlens, Pieter Abbeel, Ashish Vaswani |
3193 | One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking | Minghao Chen, Jianlong Fu, Haibin Ling |
7842 | Multi-Perspective LSTM for Joint Visual Representation Learning | Alireza Sepas-Moghaddam, Fernando Pereira, Paulo Lobato Correia, Ali Etemad |
5337 | Multiple Instance Captioning: Learning Representations From Histopathology Textbooks and Articles | Jevgenij Gamper, Nasir Rajpoot |
2257 | clDice – A Novel Topology-Preserving Loss Function for Tubular Structure Segmentation | Suprosanna Shit, Johannes C. Paetzold, Anjany Sekuboyina, Ivan Ezhov, Alexander Unger, Andrey Zhylka, Josien P. W. Pluim, Ulrich Bauer, Bjoern H. Menze |
2146 | Learning To Fuse Asymmetric Feature Maps in Siamese Trackers | Wencheng Han, Xingping Dong, Fahad Shahbaz Khan, Ling Shao, Jianbing Shen |
7264 | Track, Check, Repeat: An EM Approach to Unsupervised Tracking | Adam W. Harley, Yiming Zuo, Jing Wen, Ayush Mangal, Shubhankar Potdar, Ritwick Chaudhry, Katerina Fragkiadaki |
3221 | Learning Optical Flow From a Few Matches | Shihao Jiang, Yao Lu, Hongdong Li, Richard Hartley |
5036 | AutoDO: Robust AutoAugment for Biased Data With Label Noise via Scalable Probabilistic Implicit Differentiation | Denis Gudovskiy, Luca Rigazio, Shun Ishizaka, Kazuki Kozuka, Sotaro Tsukizawa |
2415 | GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation | Gu Wang, Fabian Manhardt, Federico Tombari, Xiangyang Ji |
10200 | Robust and Accurate Object Detection via Adversarial Learning | Xiangning Chen, Cihang Xie, Mingxing Tan, Li Zhang, Cho-Jui Hsieh, Boqing Gong |
2228 | Domain Adaptation With Auxiliary Target Domain-Oriented Classifier | Jian Liang, Dapeng Hu, Jiashi Feng |
3555 | MetaAlign: Coordinating Domain Alignment and Classification for Unsupervised Domain Adaptation | Guoqiang Wei, Cuiling Lan, Wenjun Zeng, Zhibo Chen |
11107 | Partial Feature Selection and Alignment for Multi-Source Domain Adaptation | Yangye Fu, Ming Zhang, Xing Xu, Zuo Cao, Chao Ma, Yanli Ji, Kai Zuo, Huimin Lu |
10645 | MongeNet: Efficient Sampler for Geometric Deep Learning | Léo Lebrat, Rodrigo Santa Cruz, Clinton Fookes, Olivier Salvado |
7483 | Continual Learning via Bit-Level Information Preserving | Yujun Shi, Li Yuan, Yunpeng Chen, Jiashi Feng |
1480 | Believe It or Not: Counterfactual Bias Manipulation in Visual Question Answering | Muli Yang, Cheng Deng, Junchi Yan |
4765 | Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning | Zhenda Xie, Yutong Lin, Zheng Zhang, Yue Cao, Stephen Lin, Han Hu |
289 | Toward Joint Thing-and-Stuff Mining for Weakly Supervised Panoptic Segmentation | Yunhang Shen, Liujuan Cao, Zhiwei Chen, Feihong Lian, Baochang Zhang, Chi Su, Yongjian Wu, Feiyue Huang, Rongrong Ji |
4720 | S3: Learnable Sparse Signal Superdensity for Guided Depth Estimation | Yu-Kai Huang, Yueh-Cheng Liu, Tsung-Han Wu, Hung-Ting Su, Yu-Cheng Chang, Tsung-Lin Tsou, Yu-An Wang, Winston H. Hsu |
1394 | Residential Floor Plan Recognition and Reconstruction | Xiaolei Lv, Shengchu Zhao, Xinyang Yu, Binqiang Zhao |
8148 | ColorRL: Reinforced Coloring for End-to-End Instance Segmentation | Tran Anh Tuan, Nguyen Tuan Khoa, Tran Minh Quan, Won-Ki Jeong |
4585 | Harmonious Semantic Line Detection via Maximal Weight Clique Selection | Dongkwon Jin, Wonhui Park, Seong-Gyun Jeong, Chang-Su Kim |
2403 | LPSNet: A Lightweight Solution for Fast Panoptic Segmentation | Weixiang Hong, Qingpei Guo, Wei Zhang, Jingdong Chen, Wei Chu |
5077 | Progressive Semantic Segmentation | Chuong Huynh, Anh Tuan Tran, Khoa Luu, Minh Hoai |
3554 | Embedded Discriminative Attention Mechanism for Weakly Supervised Semantic Segmentation | Tong Wu, Junshi Huang, Guangyu Gao, Xiaoming Wei, Xiaolin Wei, Xuan Luo, Chi Harold Liu |
1504 | SelfSAGCN: Self-Supervised Semantic Alignment for Graph Convolution Network | Xu Yang, Cheng Deng, Zhiyuan Dang, Kun Wei, Junchi Yan |
7255 | Wasserstein Barycenter for Multi-Source Domain Adaptation | Eduardo Fernandes Montesuma, Fred Maurice Ngolè Mboula |
7574 | PiCIE: Unsupervised Semantic Segmentation Using Invariance and Equivariance in Clustering | Jang Hyun Cho, Utkarsh Mall, Kavita Bala, Bharath Hariharan |
1360 | Global2Local: Efficient Structure Search for Video Action Segmentation | Shang-Hua Gao, Qi Han, Zhong-Yu Li, Pai Peng, Liang Wang, Ming-Ming Cheng |
6468 | Introvert: Human Trajectory Prediction via Conditional 3D Attention | Nasim Shafiee, Taskin Padir, Ehsan Elhamifar |
766 | Weakly Supervised Video Salient Object Detection | Wangbo Zhao, Jing Zhang, Long Li, Nick Barnes, Nian Liu, Junwei Han |
3801 | Video Object Segmentation Using Global and Instance Embedding Learning | Wenbin Ge, Xiankai Lu, Jianbing Shen |
6940 | Human-Like Controllable Image Captioning With Verb-Specific Semantic Roles | Long Chen, Zhihong Jiang, Jun Xiao, Wei Liu |
1204 | Read and Attend: Temporal Localisation in Sign Language Videos | Gül Varol, Liliane Momeni, Samuel Albanie, Triantafyllos Afouras, Andrew Zisserman |
1283 | Localizing Visual Sounds the Hard Way | Honglie Chen, Weidi Xie, Triantafyllos Afouras, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman |
4377 | Look Before You Speak: Visually Contextualized Utterances | Paul Hongsuck Seo, Arsha Nagrani, Cordelia Schmid |
5906 | Look Before You Leap: Learning Landmark Features for One-Stage Visual Grounding | Binbin Huang, Dongze Lian, Weixin Luo, Shenghua Gao |
11061 | Visual Navigation With Spatial Attention | Bar Mayo, Tamir Hazan, Ayellet Tal |
10634 | Perception Matters: Detecting Perception Failures of VQA Models Using Metamorphic Testing | Yuanyuan Yuan, Shuai Wang, Mingyue Jiang, Tsong Yueh Chen |
5224 | Pixel-Wise Anomaly Detection in Complex Driving Scenes | Giancarlo Di Biase, Hermann Blum, Roland Siegwart, Cesar Cadena |
6703 | Disentangled Cycle Consistency for Highly-Realistic Virtual Try-On | Chongjian Ge, Yibing Song, Yuying Ge, Han Yang, Wei Liu, Ping Luo |
8750 | Hyperdimensional Computing as a Framework for Systematic Aggregation of Image Descriptors | Peer Neubert, Stefan Schubert |