MAIN CONFERENCE
All papers will be presented in the same manner. Each paper will have a five minute pre-recorded video and a PDF of the poster. An asynchronous text chat will be available for each paper. Attendees can view the papers and videos on demand at any time. Authors will also have individual Q&A sessions at the posted times below.
All posted times are EDT but the chart linked below has all time zones’ conversions. When the virtual site is up, you will be able to select which sessions you are interested in and it will populate your own schedule.
Presentation Schedule
-
All times are Eastern Daylight Time
Date: Monday, June 21, 2021 11:00 – 13:30
Paper Session One:
Paper ID | Paper Title | Authors |
2364 | Single-Stage Instance Shadow Detection With Bidirectional Relation Learning | Tianyu Wang, Xiaowei Hu, Chi-Wing Fu, Pheng-Ann Heng |
5745 | DeepMetaHandles: Learning Deformation Meta-Handles of 3D Meshes With Biharmonic Coordinates | Minghua Liu, Minhyuk Sung, Radomir Mech, Hao Su |
9986 | Learning Delaunay Surface Elements for Mesh Reconstruction | Marie-Julie Rakotosaona, Paul Guerrero, Noam Aigerman, Niloy J. Mitra, Maks Ovsjanikov |
1678 | Fusing the Old with the New: Learning Relative Camera Pose with Geometry-Guided Uncertainty | Bingbing Zhuang, Manmohan Chandraker |
3609 | Skeleton Merger: An Unsupervised Aligned Keypoint Detector | Ruoxi Shi, Zhengrong Xue, Yang You, Cewu Lu |
711 | Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection | Wenfei Yang, Tianzhu Zhang, Xiaoyuan Yu, Tian Qi, Yongdong Zhang, Feng Wu |
7701 | Pixel Codec Avatars | Shugao Ma, Tomas Simon, Jason Saragih, Dawei Wang, Yuecheng Li, Fernando De la Torre, Yaser Sheikh |
8522 | HOTR: End-to-End Human-Object Interaction Detection With Transformers | Bumsoo Kim, Junhyun Lee, Jaewoo Kang, Eun-Sol Kim, Hyunwoo J. Kim |
8686 | Tuning IR-Cut Filter for Illumination-Aware Spectral Reconstruction From RGB | Bo Sun, Junchi Yan, Xiao Zhou, Yinqiang Zheng |
7447 | DeFlow: Learning Complex Image Degradations From Unpaired Data With Conditional Flows | Valentin Wolf, Andreas Lugmayr, Martin Danelljan, Luc Van Gool, Radu Timofte |
2372 | AQD: Towards Accurate Quantized Object Detection | Peng Chen, Jing Liu, Bohan Zhuang, Mingkui Tan, Chunhua Shen |
3665 | Privacy-Preserving Collaborative Learning With Automatic Transformation Search | Wei Gao, Shangwei Guo, Tianwei Zhang, Han Qiu, Yonggang Wen, Yang Liu |
1300 | Rethinking and Improving the Robustness of Image Style Transfer | Pei Wang, Yijun Li, Nuno Vasconcelos |
11563 | Style-Aware Normalized Loss for Improving Arbitrary Style Transfer | Jiaxin Cheng, Ayush Jaiswal, Yue Wu, Pradeep Natarajan, Prem Natarajan |
2453 | Faster Meta Update Strategy for Noise-Robust Deep Learning | Youjiang Xu, Linchao Zhu, Lu Jiang, Yi Yang |
3302 | A Hyperbolic-to-Hyperbolic Graph Convolutional Network | Jindou Dai, Yuwei Wu, Zhi Gao, Yunde Jia |
1542 | Quasi-Dense Similarity Learning for Multiple Object Tracking | Jiangmiao Pang, Linlu Qiu, Xia Li, Haofeng Chen, Qi Li, Trevor Darrell, Fisher Yu |
7579 | MetricOpt: Learning To Optimize Black-Box Evaluation Metrics | Chen Huang, Shuangfei Zhai, Pengsheng Guo, Josh Susskind |
656 | Training Networks in Null Space of Feature Covariance for Continual Learning | Shipeng Wang, Xiaorong Li, Jian Sun, Zongben Xu |
5080 | Exponential Moving Average Normalization for Self-Supervised and Semi-Supervised Learning | Zhaowei Cai, Avinash Ravichandran, Subhransu Maji, Charless Fowlkes, Zhuowen Tu, Stefano Soatto |
3235 | Learning To Recover 3D Scene Shape From a Single Image | Wei Yin, Jianming Zhang, Oliver Wang, Simon Niklaus, Long Mai, Simon Chen, Chunhua Shen |
3790 | Fully Convolutional Networks for Panoptic Segmentation | Yanwei Li, Hengshuang Zhao, Xiaojuan Qi, Liwei Wang, Zeming Li, Jian Sun, Jiaya Jia |
8040 | Progressive Domain Expansion Network for Single Domain Generalization | Lei Li, Ke Gao, Juan Cao, Ziyao Huang, Yepeng Weng, Xiaoyue Mi, Zhengze Yu, Xiaoya Li, Boyang Xia |
3550 | Sketch, Ground, and Refine: Top-Down Dense Video Captioning | Chaorui Deng, Shizhe Chen, Da Chen, Yuan He, Qi Wu |
4325 | Shared Cross-Modal Trajectory Prediction for Autonomous Driving | Chiho Choi, Joon Hee Choi, Jiachen Li, Srikanth Malla |
2180 | Glancing at the Patch: Anomaly Localization With Global and Local Feature Comparison | Shenzhi Wang, Liwei Wu, Lei Cui, Yujun Shen |
6945 | RankDetNet: Delving Into Ranking Constraints for Object Detection | Ji Liu, Dong Li, Rongzhang Zheng, Lu Tian, Yi Shan |
2208 | GAIA: A Transfer Learning System of Object Detection That Fits Your Needs | Xingyuan Bu, Junran Peng, Junjie Yan, Tieniu Tan, Zhaoxiang Zhang |
1958 | Primitive Representation Learning for Scene Text Recognition | Ruijie Yan, Liangrui Peng, Shanyu Xiao, Gang Yao |
7161 | Keep Your Eyes on the Lane: Real-Time Attention-Guided Lane Detection | Lucas Tabelini, Rodrigo Berriel, Thiago M. Paixão, Claudine Badue, Alberto F. De Souza, Thiago Oliveira-Santos |
5021 | OTA: Optimal Transport Assignment for Object Detection | Zheng Ge, Songtao Liu, Zeming Li, Osamu Yoshie, Jian Sun |
5926 | StickyPillars: Robust and Efficient Feature Matching on Point Clouds Using Graph Neural Networks | Kai Fischer, Martin Simon, Florian Ölsner, Stefan Milz, Horst-Michael Groß, Patrick Mäder |
4293 | Semantic Scene Completion via Integrating Instances and Scene In-the-Loop | Yingjie Cai, Xuesong Chen, Chao Zhang, Kwan-Yee Lin, Xiaogang Wang, Hongsheng Li |
10588 | Holistic 3D Human and Scene Mesh Estimation From Single View Images | Zhenzhen Weng, Serena Yeung |
1120 | Point Cloud Upsampling via Disentangled Refinement | Ruihui Li, Xianzhi Li, Pheng-Ann Heng, Chi-Wing Fu |
3453 | DyCo3D: Robust Instance Segmentation of 3D Point Clouds Through Dynamic Convolution | Tong He, Chunhua Shen, Anton van den Hengel |
6746 | HCRF-Flow: Scene Flow From Point Clouds With Continuous High-Order CRFs and Position-Aware Flow Embedding | Ruibo Li, Guosheng Lin, Tong He, Fayao Liu, Chunhua Shen |
5444 | Iso-Points: Optimizing Neural Implicit Surfaces With Hybrid Representations | Wang Yifan, Shihao Wu, Cengiz Öztireli, Olga Sorkine-Hornung |
10374 | Fast Sinkhorn Filters: Using Matrix Scaling for Non-Rigid Shape Correspondence With Functional Maps | Gautam Pai, Jing Ren, Simone Melzi, Peter Wonka, Maks Ovsjanikov |
5280 | Globally Optimal Relative Pose Estimation With Gravity Prior | Yaqing Ding, Daniel Barath, Jian Yang, Hui Kong, Zuzana Kukelova |
555 | Discovering Relationships Between Object Categories via Universal Canonical Maps | Natalia Neverova, Artsiom Sanakoyeu, Patrick Labatut, David Novotny, Andrea Vedaldi |
2202 | Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation | Hugo Germain, Vincent Lepetit, Guillaume Bourmaud |
2029 | Rotation-Only Bundle Adjustment | Seong Hun Lee, Javier Civera |
10524 | PAUL: Procrustean Autoencoder for Unsupervised Lifting | Chaoyang Wang, Simon Lucey |
2179 | Robust Multimodal Vehicle Detection in Foggy Weather Using Complementary Lidar and Radar Signals | Kun Qian, Shilin Zhu, Xinyu Zhang, Li Erran Li |
943 | Depth-Conditioned Dynamic Message Propagation for Monocular 3D Object Detection | Li Wang, Liang Du, Xiaoqing Ye, Yanwei Fu, Guodong Guo, Xiangyang Xue, Jianfeng Feng, Li Zhang |
1583 | Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization | Junting Pan, Siyu Chen, Mike Zheng Shou, Yu Liu, Jing Shao, Hongsheng Li |
6171 | Temporal-Relational CrossTransformers for Few-Shot Action Recognition | Toby Perrett, Alessandro Masullo, Tilo Burghardt, Majid Mirmehdi, Dima Damen |
708 | Temporal Context Aggregation Network for Temporal Action Proposal Refinement | Zhiwu Qing, Haisheng Su, Weihao Gan, Dongliang Wang, Wei Wu, Xiang Wang, Yu Qiao, Junjie Yan, Changxin Gao, Nong Sang |
8560 | Affordance Transfer Learning for Human-Object Interaction Detection | Zhi Hou, Baosheng Yu, Yu Qiao, Xiaojiang Peng, Dacheng Tao |
6792 | Achieving Robustness in Classification Using Optimal Transport With Hinge Regularization | Mathieu Serrurier, Franck Mamalet, Alberto González-Sanz, Thibaut Boissin, Jean-Michel Loubes, Eustasio del Barrio |
7308 | Over-the-Air Adversarial Flickering Attacks Against Video Recognition Networks | Roi Pony, Itay Naeh, Shie Mannor |
6101 | Deep Dual Consecutive Network for Human Pose Estimation | Zhenguang Liu, Haoming Chen, Runyang Feng, Shuang Wu, Shouling Ji, Bailin Yang, Xun Wang |
707 | StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision | Yang Hong, Juyong Zhang, Boyi Jiang, Yudong Guo, Ligang Liu, Hujun Bao |
901 | Body Meshes as Points | Jianfeng Zhang, Dongdong Yu, Jun Hao Liew, Xuecheng Nie, Jiashi Feng |
350 | Cross-View Cross-Scene Multi-View Crowd Counting | Qi Zhang, Wei Lin, Antoni B. Chan |
7372 | GANmut: Learning Interpretable Conditional Space for Gamut of Emotions | Stefano d'Apolito, Danda Pani Paudel, Zhiwu Huang, Andrés Romero, Luc Van Gool |
5287 | Consistent Instance False Positive Improves Fairness in Face Recognition | Xingkun Xu, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jilin Li, Feiyue Huang, Yong Li, Zhen Cui |
233 | Neural Feature Search for RGB-Infrared Person Re-Identification | Yehansen Chen, Lin Wan, Zhihang Li, Qianyan Jing, Zongyuan Sun |
5953 | Coarse-To-Fine Person Re-Identification With Auxiliary-Domain Classification and Second-Order Information Bottleneck | Anguo Zhang, Yueming Gao, Yuzhen Niu, Wenxi Liu, Yongcheng Zhou |
3757 | EvDistill: Asynchronous Events To End-Task Learning via Bidirectional Reconstruction-Guided Cross-Modal Knowledge Distillation | Lin Wang, Yujeong Chae, Sung-Hoon Yoon, Tae-Kyun Kim, Kuk-Jin Yoon |
7525 | iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression | Shifeng Zhang, Chen Zhang, Ning Kang, Zhenguo Li |
3703 | Deep Convolutional Dictionary Learning for Image Denoising | Hongyi Zheng, Hongwei Yong, Lei Zhang |
2429 | Semi-Supervised Video Deraining With Dynamical Rain Generator | Zongsheng Yue, Jianwen Xie, Qian Zhao, Deyu Meng |
5340 | PPR10K: A Large-Scale Portrait Photo Retouching Dataset With Human-Region Mask and Group-Level Consistency | Jie Liang, Hui Zeng, Miaomiao Cui, Xuansong Xie, Lei Zhang |
3719 | Removing Diffraction Image Artifacts in Under-Display Camera via Dynamic Skip Connection Network | Ruicheng Feng, Chongyi Li, Huaijin Chen, Shuai Li, Chen Change Loy, Jinwei Gu |
6983 | GAN Prior Embedded Network for Blind Face Restoration in the Wild | Tao Yang, Peiran Ren, Xuansong Xie, Lei Zhang |
7591 | Polarimetric Normal Stereo | Yoshiki Fukao, Ryo Kawahara, Shohei Nobuhara, Ko Nishino |
2384 | Practical Single-Image Super-Resolution Using Look-Up Table | Younghyun Jo, Seon Joo Kim |
10569 | Deep Learning in Latent Space for Video Prediction and Compression | Bowen Liu, Yu Chen, Shiyu Liu, Hun-Seok Kim |
11775 | Debiased Subjective Assessment of Real-World Image Enhancement | Peibei Cao, Zhangyang Wang, Kede Ma |
6843 | BABEL: Bodies, Action and Behavior With English Labels | Abhinanda R. Punnakkal, Arjun Chandrasekaran, Nikos Athanasiou, Alejandra Quirós-Ramírez, Michael J. Black |
5043 | Rethinking Channel Dimensions for Efficient Model Design | Dongyoon Han, Sangdoo Yun, Byeongho Heo, YoungJoon Yoo |
11193 | Automated Log-Scale Quantization for Low-Cost Deep Neural Networks | Sangyun Oh, Hyeonuk Sim, Sugil Lee, Jongeun Lee |
11437 | ProSelfLC: Progressive Self Label Correction for Training Robust Deep Neural Networks | Xinshao Wang, Yang Hua, Elyor Kodirov, David A. Clifton, Neil M. Robertson |
10479 | The Lottery Ticket Hypothesis for Object Recognition | Sharath Girish, Shishira R Maiya, Kamal Gupta, Hao Chen, Larry S. Davis, Abhinav Shrivastava |
3707 | Spatial-Phase Shallow Learning: Rethinking Face Forgery Detection in Frequency Domain | Honggu Liu, Xiaodan Li, Wenbo Zhou, Yuefeng Chen, Yuan He, Hui Xue, Weiming Zhang, Nenghai Yu |
6064 | Transformer Interpretability Beyond Attention Visualization | Hila Chefer, Shir Gur, Lior Wolf |
2988 | Mixed-Privacy Forgetting in Deep Networks | Aditya Golatkar, Alessandro Achille, Avinash Ravichandran, Marzia Polito, Stefano Soatto |
2635 | CoSMo: Content-Style Modulation for Image Retrieval With Text Feedback | Seungmin Lee, Dongwan Kim, Bohyung Han |
7836 | Discrimination-Aware Mechanism for Fine-Grained Representation Learning | Furong Xu, Meng Wang, Wei Zhang, Yuan Cheng, Wei Chu |
6303 | Dual Contradistinctive Generative Autoencoder | Gaurav Parmar, Dacheng Li, Kwonjoon Lee, Zhuowen Tu |
4981 | Cross-Modal Contrastive Learning for Text-to-Image Generation | Han Zhang, Jing Yu Koh, Jason Baldridge, Honglak Lee, Yinfei Yang |
644 | Bridging the Visual Gap: Wide-Range Image Blending | Chia-Ni Lu, Ya-Chu Chang, Wei-Chen Chiu |
11462 | Exploiting Spatial Dimensions of Latent in GAN for Real-Time Image Editing | Hyunsu Kim, Yunjey Choi, Junho Kim, Sungjoo Yoo, Youngjung Uh |
1398 | ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows | Jie An, Siyu Huang, Yibing Song, Dejing Dou, Wei Liu, Jiebo Luo |
6872 | DualAST: Dual Style-Learning Networks for Artistic Style Transfer | Haibo Chen, Lei Zhao, Zhizhong Wang, Huiming Zhang, Zhiwen Zuo, Ailin Li, Wei Xing, Dongming Lu |
2885 | Single-Shot Freestyle Dance Reenactment | Oran Gafni, Oron Ashual, Lior Wolf |
5794 | SceneGen: Learning To Generate Realistic Traffic Scenes | Shuhan Tan, Kelvin Wong, Shenlong Wang, Sivabalan Manivasagam, Mengye Ren, Raquel Urtasun |
2642 | Learning Semantic-Aware Dynamics for Video Prediction | Xinzhu Bei, Yanchao Yang, Stefano Soatto |
2367 | Reinforced Attention for Few-Shot Learning and Beyond | Jie Hong, Pengfei Fang, Weihao Li, Tong Zhang, Christian Simon, Mehrtash Harandi, Lars Petersson |
5620 | Fast and Accurate Model Scaling | Piotr Dollár, Mannat Singh, Ross Girshick |
10021 | Multi-Label Learning From Single Positive Labels | Elijah Cole, Oisin Mac Aodha, Titouan Lorieul, Pietro Perona, Dan Morris, Nebojsa Jojic |
1950 | Contrastive Learning Based Hybrid Networks for Long-Tailed Image Classification | Peng Wang, Kai Han, Xiu-Shen Wei, Lei Zhang, Lei Wang |
1098 | Learning Graph Embeddings for Compositional Zero-Shot Learning | Muhammad Ferjad Naeem, Yongqin Xian, Federico Tombari, Zeynep Akata |
4080 | Multispectral Photometric Stereo for Spatially-Varying Spectral Reflectances: A Well Posed Problem? | Heng Guo, Fumio Okura, Boxin Shi, Takuya Funatomi, Yasuhiro Mukaigawa, Yasuyuki Matsushita |
6708 | LiBRe: A Practical Bayesian Approach to Adversarial Detection | Zhijie Deng, Xiao Yang, Shizhen Xu, Hang Su, Jun Zhu |
3487 | Searching by Generating: Flexible and Efficient One-Shot NAS With Architecture Generator | Sian-Yao Huang, Wei-Ta Chu |
11676 | Densely Connected Multi-Dilated Convolutional Networks for Dense Prediction Tasks | Naoya Takahashi, Yuki Mitsufuji |
5808 | DARCNN: Domain Adaptive Region-Based Convolutional Neural Network for Unsupervised Instance Segmentation in Biomedical Images | Joy Hsu, Wah Chiu, Serena Yeung |
2534 | FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space | Quande Liu, Cheng Chen, Jing Qin, Qi Dou, Pheng-Ann Heng |
2349 | Distractor-Aware Fast Tracking via Dynamic Convolutions and MOT Philosophy | Zikai Zhang, Bineng Zhong, Shengping Zhang, Zhenjun Tang, Xin Liu, Zhaoxiang Zhang |
5053 | Mining Better Samples for Contrastive Learning of Temporal Correspondence | Sangryul Jeon, Dongbo Min, Seungryong Kim, Kwanghoon Sohn |
4405 | UPFlow: Upsampling Pyramid for Unsupervised Optical Flow Learning | Kunming Luo, Chuan Wang, Shuaicheng Liu, Haoqiang Fan, Jue Wang, Jian Sun |
7376 | KeepAugment: A Simple Information-Preserving Data Augmentation Approach | Chengyue Gong, Dilin Wang, Meng Li, Vikas Chandra, Qiang Liu |
6966 | Keypoint-Graph-Driven Learning Framework for Object Pose Estimation | Shaobo Zhang, Wanqing Zhao, Ziyu Guan, Xianlin Peng, Jinye Peng |
2536 | AdCo: Adversarial Contrast for Efficient Learning of Unsupervised Representations From Self-Trained Negative Adversaries | Qianjiang Hu, Xiao Wang, Wei Hu, Guo-Jun Qi |
2474 | Generalized Domain Adaptation | Yu Mitsuzumi, Go Irie, Daiki Ikami, Takashi Shibata |
4071 | FixBi: Bridging Domain Spaces for Unsupervised Domain Adaptation | Jaemin Na, Heechul Jung, Hyung Jin Chang, Wonjun Hwang |
11397 | Learning Invariant Representations and Risks for Semi-Supervised Domain Adaptation | Bo Li, Yezhen Wang, Shanghang Zhang, Dongsheng Li, Kurt Keutzer, Trevor Darrell, Han Zhao |
1906 | Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations | Umberto Michieli, Pietro Zanuttigh |
10436 | Incremental Learning via Rate Reduction | Ziyang Wu, Christina Baek, Chong You, Yi Ma |
4186 | Partially View-Aligned Representation Learning With Noise-Robust Contrastive Loss | Mouxing Yang, Yunfan Li, Zhenyu Huang, Zitao Liu, Peng Hu, Xi Peng |
5671 | Spatially Consistent Representation Learning | Byungseok Roh, Wuhyun Shin, Ildoo Kim, Sungwoong Kim |
592 | Mask Guided Matting via Progressive Refinement Network | Qihang Yu, Jianming Zhang, He Zhang, Yilin Wang, Zhe Lin, Ning Xu, Yutong Bai, Alan Yuille |
4851 | The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth | Jamie Watson, Oisin Mac Aodha, Victor Prisacariu, Gabriel Brostow, Michael Firman |
1361 | Exemplar-Based Open-Set Panoptic Segmentation Network | Jaedong Hwang, Seoung Wug Oh, Joon-Young Lee, Bohyung Han |
10320 | Incremental Few-Shot Instance Segmentation | Dan Andrei Ganea, Bas Boom, Ronald Poppe |
398 | DoDNet: Learning To Segment Multi-Organ and Tumors From Multiple Partially Labeled Datasets | Jianpeng Zhang, Yutong Xie, Yong Xia, Chunhua Shen |
2529 | Semi-Supervised Semantic Segmentation With Directional Context-Aware Consistency | Xin Lai, Zhuotao Tian, Li Jiang, Shu Liu, Hengshuang Zhao, Liwei Wang, Jiaya Jia |
5229 | Source-Free Domain Adaptation for Semantic Segmentation | Yuang Liu, Wei Zhang, Jun Wang |
4002 | Learning the Superpixel in a Non-Iterative and Lifelong Manner | Lei Zhu, Qi She, Bin Zhang, Yanye Lu, Zhilin Lu, Duo Li, Jie Hu |
2335 | ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation | Xinyue Huo, Lingxi Xie, Jianzhong He, Zijie Yang, Wengang Zhou, Houqiang Li, Qi Tian |
7507 | Can We Characterize Tasks Without Labels or Features? | Bram Wallace, Ziyang Wu, Bharath Hariharan |
8301 | Reconsidering Representation Alignment for Multi-View Clustering | Daniel J. Trosten, Sigurd Løkse, Robert Jenssen, Michael Kampffmeyer |
5491 | Learning To Segment Rigid Motions From Two Frames | Gengshan Yang, Deva Ramanan |
692 | Self-Supervised Motion Learning From Static Images | Ziyuan Huang, Shiwei Zhang, Jianwen Jiang, Mingqian Tang, Rong Jin, Marcelo H. Ang |
1558 | Efficient Regional Memory Network for Video Object Segmentation | Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Wenxiu Sun |
3953 | SwiftNet: Real-Time Video Object Segmentation | Haochen Wang, Xiaolong Jiang, Haibing Ren, Yao Hu, Song Bai |
10268 | Improving OCR-Based Image Captioning by Incorporating Geometrical Relationship | Jing Wang, Jinhui Tang, Mingkun Yang, Xiang Bai, Jiebo Luo |
1525 | Improving Sign Language Translation With Monolingual Data by Sign Back-Translation | Hao Zhou, Wengang Zhou, Weizhen Qi, Junfu Pu, Houqiang Li |
1603 | Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing | Yu Wu, Yi Yang |
8136 | Looking Into Your Speech: Learning Cross-Modal Affinity for Audio-Visual Speech Separation | Jiyoung Lee, Soo-Whan Chung, Sunok Kim, Hong-Goo Kang, Kwanghoon Sohn |
6406 | Co-Grounding Networks With Semantic Attention for Referring Expression Comprehension in Videos | Sijie Song, Xudong Lin, Jiaying Liu, Zongming Guo, Shih-Fu Chang |
683 | Explicit Knowledge Incorporation for Visual Reasoning | Yifeng Zhang, Ming Jiang, Qi Zhao |
978 | Deep Gradient Projection Networks for Pan-sharpening | Shuang Xu, Jiangshe Zhang, Zixiang Zhao, Kai Sun, Junmin Liu, Chunxia Zhang |
5425 | Capturing Omni-Range Context for Omnidirectional Segmentation | Kailun Yang, Jiaming Zhang, Simon Reiß, Xinxin Hu, Rainer Stiefelhagen |
1302 | Gradient-Based Algorithms for Machine Teaching | Pei Wang, Kabir Nagrecha, Nuno Vasconcelos |
1159 | Depth From Camera Motion and Object Detection | Brent A. Griffin, Jason J. Corso |