MAIN CONFERENCE
All papers will be presented in the same manner. Each paper will have a five minute pre-recorded video and a PDF of the poster. An asynchronous text chat will be available for each paper. Attendees can view the papers and videos on demand at any time. Authors will also have individual Q&A sessions at the posted times below.
All posted times are EDT but the chart linked below has all time zones’ conversions. When the virtual site is up, you will be able to select which sessions you are interested in and it will populate your own schedule.
Presentation Schedule
-
All times are Eastern Daylight Time
Date: Wednesday, June 23, 2021 22:00 – 24:30
Paper Session Eight:
Paper ID | Paper Title | Authors |
1881 | Informative and Consistent Correspondence Mining for Cross-Domain Weakly Supervised Object Detection | Luwei Hou, Yu Zhang, Kui Fu, Jia Li |
2936 | Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation | Xinge Zhu, Hui Zhou, Tai Wang, Fangzhou Hong, Yuexin Ma, Wei Li, Hongsheng Li, Dahua Lin |
8752 | Neural Splines: Fitting 3D Surfaces With Infinitely-Wide Neural Networks | Francis Williams, Matthew Trager, Joan Bruna, Denis Zorin |
5721 | Learning Neural Representation of Camera Pose with Matrix Representation of Pose Shift via View Synthesis | Yaxuan Zhu, Ruiqi Gao, Siyuan Huang, Song-Chun Zhu, Ying Nian Wu |
1797 | Action Unit Memory Network for Weakly Supervised Temporal Action Localization | Wang Luo, Tianzhu Zhang, Wenfei Yang, Jingen Liu, Tao Mei, Feng Wu, Yongdong Zhang |
1668 | Weakly-Supervised Physically Unconstrained Gaze Estimation | Rakshit Kothari, Shalini De Mello, Umar Iqbal, Wonmin Byeon, Seonwook Park, Jan Kautz |
1078 | On Self-Contact and Human Pose | Lea Müller, Ahmed A. A. Osman, Siyu Tang, Chun-Hao P. Huang, Michael J. Black |
6914 | Seeing in Extra Darkness Using a Deep-Red Flash | Jinhui Xiong, Jian Wang, Wolfgang Heidrich, Shree Nayar |
1468 | Light Field Super-Resolution With Zero-Shot Learning | Zhen Cheng, Zhiwei Xiong, Chang Chen, Dong Liu, Zheng-Jun Zha |
10249 | Neural Cellular Automata Manifold | Alejandro Hernandez, Armand Vilalta, Francesc Moreno-Noguer |
9920 | Convolutional Dynamic Alignment Networks for Interpretable Classifications | Moritz Böhle, Mario Fritz, Bernt Schiele |
517 | One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing | Ting-Chun Wang, Arun Mallya, Ming-Yu Liu |
2069 | Stay Positive: Non-Negative Image Synthesis for Augmented Reality | Katie Luo, Guandao Yang, Wenqi Xian, Harald Haraldsson, Bharath Hariharan, Serge Belongie |
8433 | Playable Video Generation | Willi Menapace, Stéphane Lathuilière, Sergey Tulyakov, Aliaksandr Siarohin, Elisa Ricci |
771 | Uncertainty-Aware Joint Salient Object and Camouflaged Object Detection | Aixuan Li, Jing Zhang, Yunqiu Lv, Bowen Liu, Tong Zhang, Yuchao Dai |
372 | Discovering Hidden Physics Behind Transport Dynamics | Peirong Liu, Lin Tian, Yubo Zhang, Stephen Aylward, Yueh Lee, Marc Niethammer |
1698 | AutoFlow: Learning a Better Training Set for Optical Flow | Deqing Sun, Daniel Vlasic, Charles Herrmann, Varun Jampani, Michael Krainin, Huiwen Chang, Ramin Zabih, William T. Freeman, Ce Liu |
11153 | Unsupervised Multi-Source Domain Adaptation Without Access to Source Data | Sk Miraj Ahmed, Dripta S. Raychaudhuri, Sujoy Paul, Samet Oymak, Amit K. Roy-Chowdhury |
1407 | A Second-Order Approach to Learning With Instance-Dependent Label Noise | Zhaowei Zhu, Tongliang Liu, Yang Liu |
10043 | Post-Hoc Uncertainty Calibration for Domain Drift Scenarios | Christian Tomani, Sebastian Gruber, Muhammed Ebrar Erdem, Daniel Cremers, Florian Buettner |
3174 | Cross-View Regularization for Domain Adaptive Panoptic Segmentation | Jiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu |
6508 | DatasetGAN: Efficient Labeled Data Factory With Minimal Human Effort | Yuxuan Zhang, Huan Ling, Jun Gao, Kangxue Yin, Jean-Francois Lafleche, Adela Barriuso, Antonio Torralba, Sanja Fidler |
6466 | Learning To Segment Actions From Visual and Language Instructions via Differentiable Weak Sequence Alignment | Yuhan Shen, Lu Wang, Ehsan Elhamifar |
7518 | SIPSA-Net: Shift-Invariant Pan Sharpening With Moving Object Alignment for Satellite Imagery | Jaehyup Lee, Soomin Seo, Munchurl Kim |
7921 | Joint-DetNAS: Upgrade Your Detector With NAS, Pruning and Dynamic Distillation | Lewei Yao, Renjie Pi, Hang Xu, Wei Zhang, Zhenguo Li, Tong Zhang |
6629 | Dense Relation Distillation With Context-Aware Aggregation for Few-Shot Object Detection | Hanzhe Hu, Shuai Bai, Aoxue Li, Jinshi Cui, Liwei Wang |
3851 | OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection | Tingting Liang, Yongtao Wang, Zhi Tang, Guosheng Hu, Haibin Ling |
6986 | Pedestrian and Ego-Vehicle Trajectory Prediction From Monocular Camera | Lukáš Neumann, Andrea Vedaldi |
1312 | DetectoRS: Detecting Objects With Recursive Feature Pyramid and Switchable Atrous Convolution | Siyuan Qiao, Liang-Chieh Chen, Alan Yuille |
4229 | Weakly-Supervised Instance Segmentation via Class-Agnostic Learning With Salient Images | Xinggang Wang, Jiapei Feng, Bin Hu, Qi Ding, Longjin Ran, Xiaoxin Chen, Wenyu Liu |
2456 | 3DCaricShop: A Dataset and a Baseline Method for Single-View 3D Caricature Face Reconstruction | Yuda Qiu, Xiaojie Xu, Lingteng Qiu, Yan Pan, Yushuang Wu, Weikai Chen, Xiaoguang Han |
5667 | D2IM-Net: Learning Detail Disentangled Implicit Fields From Single Images | Manyi Li, Hao Zhang |
6656 | Sign-Agnostic Implicit Learning of Surface Self-Similarities for Shape Modeling and Reconstruction From Raw Point Clouds | Wenbin Zhao, Jiabao Lei, Yuxin Wen, Jianguo Zhang, Kui Jia |
2916 | Learning Progressive Point Embeddings for 3D Point Cloud Generation | Cheng Wen, Baosheng Yu, Dacheng Tao |
5312 | Learning Fine-Grained Segmentation of 3D Shapes Without Part Labels | Xiaogang Wang, Xun Sun, Xinyu Cao, Kai Xu, Bin Zhou |
735 | Deformed Implicit Field: Modeling 3D Shapes With Learned Dense Correspondence | Yu Deng, Jiaolong Yang, Xin Tong |
1128 | Recurrent Multi-View Alignment Network for Unsupervised Surface Registration | Wanquan Feng, Juyong Zhang, Hongrui Cai, Haofei Xu, Junhui Hou, Hujun Bao |
615 | DeepLM: Large-Scale Nonlinear Least Squares on Deep Learning Frameworks Using Stochastic Domain Decomposition | Jingwei Huang, Shan Huang, Mingwei Sun |
4667 | D-NeRF: Neural Radiance Fields for Dynamic Scenes | Albert Pumarola, Enric Corona, Gerard Pons-Moll, Francesc Moreno-Noguer |
4004 | AdaStereo: A Simple and Efficient Approach for Adaptive Stereo Matching | Xiao Song, Guorun Yang, Xinge Zhu, Hui Zhou, Zhe Wang, Jianping Shi |
6212 | Tangent Space Backpropagation for 3D Transformation Groups | Zachary Teed, Jia Deng |
8121 | Unsupervised Learning for Robust Fitting: A Reinforcement Learning Approach | Giang Truong, Huu Le, David Suter, Erchuan Zhang, Syed Zulqarnain Gilani |
4801 | Hybrid Rotation Averaging: A Fast and Robust Rotation Averaging Approach | Yu Chen, Ji Zhao, Laurent Kneip |
2241 | ST3D: Self-Training for Unsupervised Domain Adaptation on 3D Object Detection | Jihan Yang, Shaoshuai Shi, Zhe Wang, Hongsheng Li, Xiaojuan Qi |
7023 | MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation | Hansheng Chen, Yuyao Huang, Wei Tian, Zhong Gao, Lu Xiong |
2739 | Semi-Supervised Action Recognition With Temporal Contrastive Learning | Ankit Singh, Omprakash Chakraborty, Ashutosh Varshney, Rameswar Panda, Rogerio Feris, Kate Saenko, Abir Das |
4856 | TrafficSim: Learning To Simulate Realistic Multi-Agent Behaviors | Simon Suo, Sebastian Regalado, Sergio Casas, Raquel Urtasun |
1633 | QPIC: Query-Based Pairwise Human-Object Interaction Detection With Image-Wide Contextual Information | Masato Tamura, Hiroki Ohashi, Tomoaki Yoshinaga |
4368 | Class-Aware Robust Adversarial Training for Object Detection | Pin-Chun Chen, Bo-Han Kung, Jun-Cheng Chen |
11707 | SurFree: A Fast Surrogate-Free Black-Box Attack | Thibault Maho, Teddy Furon, Erwan Le Merrer |
2476 | Lite-HRNet: A Lightweight High-Resolution Network | Changqian Yu, Bin Xiao, Changxin Gao, Lu Yuan, Lei Zhang, Nong Sang, Jingdong Wang |
724 | Model-Based 3D Hand Reconstruction via Self-Supervised Learning | Yujin Chen, Zhigang Tu, Di Kang, Linchao Bao, Ying Zhang, Xuefei Zhe, Ruizhi Chen, Junsong Yuan |
4356 | LEAP: Learning Articulated Occupancy of People | Marko Mihajlovic, Yan Zhang, Michael J. Black, Siyu Tang |
4678 | Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction | Shanyan Guan, Jingwei Xu, Yunbo Wang, Bingbing Ni, Xiaokang Yang |
10889 | Exploiting Semantic Embedding and Visual Feature for Facial Action Unit Detection | Huiyuan Yang, Lijun Yin, Yi Zhou, Jiuxiang Gu |
701 | WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition | Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, Junjie Huang, Xinze Chen, Jiagang Zhu, Tian Yang, Jiwen Lu, Dalong Du, Jie Zhou |
3300 | PML: Progressive Margin Loss for Long-Tailed Age Classification | Zongyong Deng, Hao Liu, Yaoxing Wang, Chenyang Wang, Zekuan Yu, Xuehong Sun |
3046 | Fine-Grained Shape-Appearance Mutual Learning for Cloth-Changing Person Re-Identification | Peixian Hong, Tao Wu, Ancong Wu, Xintong Han, Wei-Shi Zheng |
6134 | Physics-Based Iterative Projection Complex Neural Network for Phase Retrieval in Lensless Microscopy Imaging | Feilong Zhang, Xianming Liu, Cheng Guo, Shiyi Lin, Junjun Jiang, Xiangyang Ji |
1492 | Asymmetric Gained Deep Image Compression With Continuous Rate Adaptation | Ze Cui, Jing Wang, Shangyin Gao, Tiansheng Guo, Yihui Feng, Bo Bai |
5141 | Learning a Non-Blind Deblurring Network for Night Blurry Images | Liang Chen, Jiawei Zhang, Jinshan Pan, Songnan Lin, Faming Fang, Jimmy S. Ren |
640 | Contrastive Learning for Compact Single Image Dehazing | Haiyan Wu, Yanyun Qu, Shaohui Lin, Jian Zhou, Ruizhi Qiao, Zhizhong Zhang, Yuan Xie, Lizhuang Ma |
3037 | Retinex-Inspired Unrolling With Cooperative Prior Architecture Search for Low-Light Image Enhancement | Risheng Liu, Long Ma, Jiaao Zhang, Xin Fan, Zhongxuan Luo |
1756 | Auto-Exposure Fusion for Single-Image Shadow Removal | Lan Fu, Changqing Zhou, Qing Guo, Felix Juefei-Xu, Hongkai Yu, Wei Feng, Yang Liu, Song Wang |
5692 | Unsupervised Degradation Representation Learning for Blind Super-Resolution | Longguang Wang, Yingqian Wang, Xiaoyu Dong, Qingyu Xu, Jungang Yang, Wei An, Yulan Guo |
2744 | Lighting, Reflectance and Geometry Estimation From 360° Panoramic Stereo | Junxuan Li, Hongdong Li, Yasuyuki Matsushita |
1253 | Flow-Based Kernel Prior With Application to Blind Super-Resolution | Jingyun Liang, Kai Zhang, Shuhang Gu, Luc Van Gool, Radu Timofte |
7907 | KOALAnet: Blind Super-Resolution Using Kernel-Oriented Adaptive Local Adjustment | Soo Ye Kim, Hyeonjun Sim, Munchurl Kim |
3482 | 3D Video Stabilization With Depth Estimation by CNN-Based Optimization | Yao-Chih Lee, Kuan-Wei Tseng, Yu-Ta Chen, Chien-Cheng Chen, Chu-Song Chen, Yi-Ping Hung |
4730 | iMiGUE: An Identity-Free Video Dataset for Micro-Gesture Understanding and Emotion Analysis | Xin Liu, Henglin Shi, Haoyu Chen, Zitong Yu, Xiaobai Li, Guoying Zhao |
2750 | ACRE: Abstract Causal REasoning Beyond Covariation | Chi Zhang, Baoxiong Jia, Mark Edmonds, Song-Chun Zhu, Yixin Zhu |
7829 | Enhance Curvature Information by Structured Stochastic Quasi-Newton Methods | Minghan Yang, Dong Xu, Hongyu Chen, Zaiwen Wen, Mengyun Chen |
4492 | Refine Myself by Teaching Myself: Feature Refinement via Self-Knowledge Distillation | Mingi Ji, Seungjae Shin, Seunghyun Hwang, Gibeom Park, Il-Chul Moon |
5838 | Towards Efficient Tensor Decomposition-Based DNN Model Compression With Optimization Framework | Miao Yin, Yang Sui, Siyu Liao, Bo Yuan |
8893 | QPP: Real-Time Quantization Parameter Prediction for Deep Neural Networks | Vladimir Kryzhanovskiy, Gleb Balitskiy, Nikolay Kozyrskiy, Aleksandr Zuruev |
4686 | Explaining Classifiers Using Adversarial Perturbations on the Perceptual Ball | Andrew Elliott, Stephen Law, Chris Russell |
7407 | Verifiability and Predictability: Interpreting Utilities of Network Architectures for Point Cloud Processing | Wen Shen, Zhihua Wei, Shikun Huang, Binbin Zhang, Panyue Chen, Ping Zhao, Quanshi Zhang |
8518 | Model-Contrastive Federated Learning | Qinbin Li, Bingsheng He, Dawn Song |
2333 | Compatibility-Aware Heterogeneous Visual Search | Rahul Duggal, Hao Zhou, Shuo Yang, Yuanjun Xiong, Wei Xia, Zhuowen Tu, Stefano Soatto |
5834 | Plan2Scene: Converting Floorplans to 3D Scenes | Madhawa Vidanapathirana, Qirui Wu, Yasutaka Furukawa, Angel X. Chang, Manolis Savva |
1268 | Few-Shot Image Generation via Cross-Domain Correspondence | Utkarsh Ojha, Yijun Li, Jingwan Lu, Alexei A. Efros, Yong Jae Lee, Eli Shechtman, Richard Zhang |
8135 | Adversarial Generation of Continuous Images | Ivan Skorokhodov, Savva Ignatyev, Mohamed Elhoseiny |
3556 | DeFLOCNet: Deep Image Editing via Flexible Low-Level Controls | Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao, Bin Jiang, Wei Liu |
7804 | Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE | Jialun Peng, Dong Liu, Songcen Xu, Houqiang Li |
4721 | Smoothing the Disentangled Latent Style Space for Unsupervised Image-to-Image Translation | Yahui Liu, Enver Sangineto, Yajing Chen, Linchao Bao, Haoxian Zhang, Nicu Sebe, Bruno Lepri, Wei Wang, Marco De Nadai |
532 | Flow Guided Transformable Bottleneck Networks for Motion Retargeting | Jian Ren, Menglei Chai, Oliver J. Woodford, Kyle Olszewski, Sergey Tulyakov |
8229 | Learning Semantic Person Image Generation by Region-Adaptive Normalization | Zhengyao Lv, Xiaoming Li, Xin Li, Fu Li, Tianwei Lin, Dongliang He, Wangmeng Zuo |
5754 | Large-Capacity Image Steganography Based on Invertible Neural Networks | Shao-Ping Lu, Rong Wang, Tao Zhong, Paul L. Rosin |
6861 | Self-Supervised Video GANs: Learning for Appearance Consistency and Motion Coherency | Sangeek Hyun, Jihwan Kim, Jae-Pil Heo |
10643 | Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning | Mamshad Nayeem Rizve, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah |
9959 | Clusformer: A Transformer Based Clustering Approach to Unsupervised Large-Scale Face and Visual Landmark Recognition | Xuan-Bac Nguyen, Duc Toan Bui, Chi Nhan Duong, Tien D. Bui, Khoa Luu |
1297 | CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning | Chen Wei, Kihyuk Sohn, Clayton Mellina, Alan Yuille, Fan Yang |
868 | Neighborhood Contrastive Learning for Novel Class Discovery | Zhun Zhong, Enrico Fini, Subhankar Roy, Zhiming Luo, Elisa Ricci, Nicu Sebe |
443 | Predicting Human Scanpaths in Visual Question Answering | Xianyu Chen, Ming Jiang, Qi Zhao |
7718 | Diverse Branch Block: Building a Convolution as an Inception-Like Unit | Xiaohan Ding, Xiangyu Zhang, Jungong Han, Guiguang Ding |
636 | HourNAS: Extremely Fast Neural Architecture Search Through an Hourglass Lens | Zhaohui Yang, Yunhe Wang, Xinghao Chen, Jianyuan Guo, Wei Zhang, Chao Xu, Chunjing Xu, Dacheng Tao, Chang Xu |
11940 | Neural Architecture Search With Random Labels | Xuanyang Zhang, Pengfei Hou, Xiangyu Zhang, Jian Sun |
3675 | Metadata Normalization | Mandy Lu, Qingyu Zhao, Jiequan Zhang, Kilian M. Pohl, Li Fei-Fei, Juan Carlos Niebles, Ehsan Adeli |
11821 | Learning-Based Image Registration With Meta-Regularization | Ebrahim Al Safadi, Xubo Song |
7029 | Lesion-Aware Transformers for Diabetic Retinopathy Grading | Rui Sun, Yihao Li, Tianzhu Zhang, Zhendong Mao, Feng Wu, Yongdong Zhang |
7038 | CapsuleRRT: Relationships-Aware Regression Tracking via Capsules | Ding Ma, Xiangqian Wu |
8685 | Improving Multiple Pedestrian Tracking by Track Management and Occlusion Handling | Daniel Stadler, Jürgen Beyerer |
10845 | Prioritized Architecture Sampling With Monto-Carlo Tree Search | Xiu Su, Tao Huang, Yanxi Li, Shan You, Fei Wang, Chen Qian, Changshui Zhang, Chang Xu |
7797 | Stochastic Whitening Batch Normalization | Shengdong Zhang, Ehsan Nezhadarya, Homa Fashandi, Jiayi Liu, Darin Graham, Mohak Shah |
8641 | Revisiting Superpixels for Active Learning in Semantic Segmentation With Realistic Annotation Costs | Lile Cai, Xun Xu, Jun Hao Liew, Chuan Sheng Foo |
564 | Dynamic Transfer for Multi-Source Domain Adaptation | Yunsheng Li, Lu Yuan, Yinpeng Chen, Pei Wang, Nuno Vasconcelos |
3292 | Multi-Source Domain Adaptation With Collaborative Learning for Semantic Segmentation | Jianzhong He, Xu Jia, Shuaijun Chen, Jianzhuang Liu |
7481 | Semi-Supervised Domain Adaptation Based on Dual-Level Domain Mixing for Semantic Segmentation | Shuaijun Chen, Xu Jia, Jianzhong He, Yongjie Shi, Jianzhuang Liu |
7452 | What Can Style Transfer and Paintings Do for Model Robustness? | Hubert Lin, Mitchell van Zuijlen, Sylvia C. Pont, Maarten W.A. Wijntjes, Kavita Bala |
6297 | IIRC: Incremental Implicitly-Refined Classification | Mohamed Abdelsalam, Mojtaba Faramarzi, Shagun Sodhani, Sarath Chandar |
5718 | LayoutGMN: Neural Graph Matching for Structural Layout Similarity | Akshay Gadi Patil, Manyi Li, Matthew Fisher, Manolis Savva, Hao Zhang |
399 | CASTing Your Model: Learning To Localize Improves Self-Supervised Representations | Ramprasaath R. Selvaraju, Karan Desai, Justin Johnson, Nikhil Naik |
9917 | Representation Learning via Global Temporal Alignment and Cycle-Consistency | Isma Hadji, Konstantinos G. Derpanis, Allan D. Jepson |
1188 | Sparse Auxiliary Networks for Unified Monocular Depth Prediction and Completion | Vitor Guizilini, Rareș Ambruș, Wolfram Burgard, Adrien Gaidon |
10204 | Single Image Depth Prediction With Wavelet Decomposition | Michaël Ramamonjisoa, Michael Firman, Jamie Watson, Vincent Lepetit, Daniyar Turmukhambetov |
3844 | FAPIS: A Few-Shot Anchor-Free Part-Based Instance Segmenter | Khoi Nguyen, Sinisa Todorovic |
2839 | Bipartite Graph Network With Adaptive Message Passing for Unbiased Scene Graph Generation | Rongjie Li, Songyang Zhang, Bo Wan, Xuming He |
1868 | Semantic Image Matting | Yanan Sun, Chi-Keung Tang, Yu-Wing Tai |
4384 | Three Ways To Improve Semantic Segmentation With Self-Supervised Depth Estimation | Lukas Hoyer, Dengxin Dai, Yuhua Chen, Adrian Köring, Suman Saha, Luc Van Gool |
6055 | Robust Instance Segmentation Through Reasoning About Multi-Object Occlusion | Xiaoding Yuan, Adam Kortylewski, Yihong Sun, Alan Yuille |
5566 | Learning Graphs for Knowledge Transfer With Limited Labels | Pallabi Ghosh, Nirat Saini, Larry S. Davis, Abhinav Shrivastava |
410 | VirTex: Learning Visual Representations From Textual Annotations | Karan Desai, Justin Johnson |
4138 | COMPLETER: Incomplete Multi-View Clustering via Contrastive Prediction | Yijie Lin, Yuanbiao Gou, Zitao Liu, Boyun Li, Jiancheng Lv, Xi Peng |
10838 | Home Action Genome: Cooperative Compositional Action Understanding | Nishant Rai, Haofeng Chen, Jingwei Ji, Rishi Desai, Kazuki Kozuka, Shun Ishizaka, Ehsan Adeli, Juan Carlos Niebles |
2328 | Learning Goals From Failure | Dave Epstein, Carl Vondrick |
5184 | VideoMoCo: Contrastive Video Representation Learning With Temporally Adversarial Examples | Tian Pan, Yibing Song, Tianyu Yang, Wenhao Jiang, Wei Liu |
2487 | Spatial Feature Calibration and Temporal Fusion for Effective One-Stage Video Instance Segmentation | Minghan Li, Shuai Li, Lida Li, Lei Zhang |
8337 | Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation | Saquib Sarfraz, Naila Murray, Vivek Sharma, Ali Diba, Luc Van Gool, Rainer Stiefelhagen |
619 | Context-Aware Biaffine Localizing Network for Temporal Sentence Grounding | Daizong Liu, Xiaoye Qu, Jianfeng Dong, Pan Zhou, Yu Cheng, Wei Wei, Zichuan Xu, Yulai Xie |
7889 | Towards Diverse Paragraph Captioning for Untrimmed Videos | Yuqing Song, Shizhe Chen, Qin Jin |
7231 | Intelligent Carpet: Inferring 3D Human Pose From Tactile Signals | Yiyue Luo, Yunzhu Li, Michael Foshey, Wan Shou, Pratyusha Sharma, Tomás Palacios, Antonio Torralba, Wojciech Matusik |
3484 | Bottom-Up Shift and Reasoning for Referring Image Segmentation | Sibei Yang, Meng Xia, Guanbin Li, Hong-Yu Zhou, Yizhou Yu |
3645 | Topological Planning With Transformers for Vision-and-Language Navigation | Kevin Chen, Junshen K. Chen, Jo Chuang, Marynel Vázquez, Silvio Savarese |
2764 | AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning | Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala |
4327 | HDR Environment Map Estimation for Real-Time Augmented Reality | Gowri Somanath, Daniel Kurz |
5741 | Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback | Hui Wu, Yupeng Gao, Xiaoxiao Guo, Ziad Al-Halah, Steven Rennie, Kristen Grauman, Rogerio Feris |
5680 | TPCN: Temporal Point Cloud Networks for Motion Forecasting | Maosheng Ye, Tongyi Cao, Qifeng Chen |
230 | Generalizable Pedestrian Detection: The Elephant in the Room | Irtiza Hasan, Shengcai Liao, Jinpeng Li, Saad Ullah Akram, Ling Shao |