大合集!CVPR2020論文分方向整理: 目標檢測/圖像分割/姿態估計等,附打包下載(持續更新)

CVPR2020在2月24日公佈了所有接受論文ID,相關報道:1470篇!CVPR2020結果出爐,你中了嗎?(附部分論文鏈接/開源代碼/解讀)。自論文ID公佈以來,許多開發者都分享了自己的優秀工作。

從論文ID公佈以來,極市一直在對CVPR進行實時跟進,本文是對CVPR2020論文整理和分類,均有論文鏈接,部分含開源代碼,涵蓋的方向有:目標檢測、目標跟蹤、圖像分割、人臉識別、姿態估計、三維點雲、視頻分析、模型加速、GAN、OCR等方向。

爲了方便大家閱讀,小極已經將全部論文下載並打包。掃描下方二維碼 關注 極市平臺 公衆號,回覆 CVPR2020 即可獲取下載鏈接。同時,可訪問 極市社區,後續論文收錄會在這裏保持更新關注極市平臺,獲取CVPR2020論文合集下載鏈接
聲明:本文爲極市平臺原創整理,未經許可,不得擅自轉載。

此外,我們也會在Github和極市社區上保持更新,歡迎大家關注:
https://github.com/extreme-assistant/cvpr2020/blob/master/CVPR2020.md

目錄

1. 目標檢測

2. 圖像分割

3. 人臉識別

4. 目標跟蹤

5. 三維點雲/三維重建

6. 圖像處理

7. 圖像分類

8. 姿態估計/動作識別

9. 視頻分析

10. OCR

11. GAN

12. 小樣本/零樣本

13. 弱監督/無監督/自監督

14. 行人跟蹤/行人檢測/ReID

15. 神經網絡/模型加速/模型壓縮

16. 超分辨率

17. 視覺常識/數據集/其他



目標檢測

  1. Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection

    論文地址:https://arxiv.org/abs/1912.02424

    代碼:https://github.com/sfzhang15/ATSS

  2. Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

    論文地址:https://arxiv.org/abs/1908.01998

  3. AugFPN: Improving Multi-scale Feature Learning for Object Detection

    論文地址:https://arxiv.org/abs/1912.05384

  4. Hit-Detector: Hierarchical Trinity Architecture Search for Object Detection

    論文地址:https://arxiv.org/abs/2003.11818

    代碼:https://github.com/ggjy/HitDet.pytorch

  5. Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation

    論文地址:https://arxiv.org/abs/2003.08813

  6. CentripetalNet: Pursuing High-quality Keypoint Pairs for Object Detection

    論文地址:https://arxiv.org/abs/2003.09119

    代碼:https://github.com/KiveeDong/CentripetalNet



圖像分割

  1. Semi-Supervised Semantic Image Segmentation with Self-correcting Networks

    論文地址:https://arxiv.org/abs/1811.07073

  2. Deep Snake for Real-Time Instance Segmentation

    論文地址:https://arxiv.org/abs/2001.01629

  3. CenterMask : Real-Time Anchor-Free Instance Segmentation

    論文地址:https://arxiv.org/abs/1911.06667

    代碼:https://github.com/youngwanLEE/CenterMask

  4. SketchGCN: Semantic Sketch Segmentation with Graph Convolutional Networks

    論文地址:https://arxiv.org/abs/2003.00678

  5. PolarMask: Single Shot Instance Segmentation with Polar Representation

    論文地址:https://arxiv.org/abs/1909.13226

    代碼:https://github.com/xieenze/PolarMask

  6. xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation

    論文地址:https://arxiv.org/abs/1911.12676

  7. BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation

    論文地址:https://arxiv.org/abs/2001.00309

  8. Enhancing Generic Segmentation with Learned Region Representations

    論文地址:https://arxiv.org/abs/1911.08564



人臉識別

  1. Towards Universal Representation Learning for Deep Face Recognition

    論文地址:https://arxiv.org/abs/2002.11841

  2. Suppressing Uncertainties for Large-Scale Facial Expression Recognition

    論文地址:https://arxiv.org/abs/2002.10392

    代碼:https://github.com/kaiwang960112/Self-Cure-Network

  3. Face X-ray for More General Face Forgery Detection

    論文地址:https://arxiv.org/pdf/1912.13458.pdf

  4. Pose Agnostic Cross-spectral Hallucination via Disentangling Independent Factors

    論文地址:https://arxiv.org/abs/1909.04365

  5. Deep Spatial Gradient and Temporal Depth Learning for Face Anti-spoofing

    論文地址:https://arxiv.org/abs/2003.08061

    代碼:https://github.com/clks-wzz/FAS-SGTD

  6. Learning Meta Face Recognition in Unseen Domains

    論文地址:https://arxiv.org/abs/2003.07733

    代碼:https://github.com/cleardusk/MFR



目標跟蹤

  1. ROAM: Recurrently Optimizing Tracking Model

    論文地址:https://arxiv.org/abs/1907.12006



三維點雲&重建

  1. PF-Net: Point Fractal Network for 3D Point Cloud Completion

    論文地址:https://arxiv.org/abs/2003.00410

  2. PointAugment: an Auto-Augmentation Framework for Point Cloud Classification

    論文地址:https://arxiv.org/abs/2002.10876

    代碼:https://github.com/liruihui/PointAugment/

  3. Learning multiview 3D point cloud registration

    論文地址:https://arxiv.org/abs/2001.05119

  4. C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds

    論文地址:https://arxiv.org/abs/1912.07009

  5. RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

    論文地址:https://arxiv.org/abs/1911.11236

  6. Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

    論文地址:https://arxiv.org/abs/2002.12212

  7. Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion

    論文地址:https://arxiv.org/abs/2003.01456

  8. In Perfect Shape: Certifiably Optimal 3D Shape Reconstruction from 2D Landmarks

    論文地址:https://arxiv.org/pdf/1911.11924.pdf

  9. Attentive Context Normalization for Robust Permutation-Equivariant Learning

    論文地址:https://arxiv.org/abs/1907.02545 Weiwei Sun, Wei Jiang, Eduard Trulls, Andrea Tagliasacchi, Kwang Moo Yi

  10. PQ-NET: A Generative Part Seq2Seq Network for 3D Shapes

    論文地址:https://arxiv.org/abs/1911.10949

  11. SG-NN: Sparse Generative Neural Networks for Self-Supervised Scene Completion of RGB-D Scans

    論文地址:https://arxiv.org/abs/1912.00036

  12. Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching

    論文地址:https://arxiv.org/abs/1912.06378

    代碼:https://github.com/alibaba/cascade-stereo

  13. Unsupervised Learning of Intrinsic Structural Representation Points

    論文地址:https://arxiv.org/abs/2003.01661

    代碼:https://github.com/NolenChen/3DStructurePoints



圖像處理

  1. Learning to Shade Hand-drawn Sketches

    論文地址:https://arxiv.org/abs/2002.11812

  2. Single Image Reflection Removal through Cascaded Refinement

    論文地址:https://arxiv.org/abs/1911.06634

  3. Generalized ODIN: Detecting Out-of-distribution Image without Learning from Out-of-distribution Data

    論文地址:https://arxiv.org/abs/2002.11297

  4. Deep Image Harmonization via Domain Verification

    論文地址:https://arxiv.org/abs/1911.13239

    代碼:https://github.com/bcmi/Image_Harmonization_Datasets

  5. RoutedFusion: Learning Real-time Depth Map Fusion

    論文地址:https://arxiv.org/pdf/2001.04388.pdf

  6. Neural Contours: Learning to Draw Lines from 3D Shapes

    論文地址:https://arxiv.org/abs/2003.10333

  7. Towards Photo-Realistic Virtual Try-On by Adaptively Generating鈫Preserving Image Content

    論文地址:https://arxiv.org/abs/2003.05863

  8. Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task(圖像處理-圖像特徵匹配)

    論文地址:https://arxiv.org/abs/1912.00623

  9. Correspondence Networks with Adaptive Neighbourhood Consensus(圖像處理-圖像特徵匹配)

    論文地址:https://arxiv.org/abs/2003.12059

  10. Normalized and Geometry-Aware Self-Attention Network for Image Captioning(圖像處理-圖像字幕)

    論文地址:https://arxiv.org/abs/2003.08897


圖像分類

  1. Self-training with Noisy Student improves ImageNet classification

    論文地址:https://arxiv.org/abs/1911.04252

  2. Image Matching across Wide Baselines: From Paper to Practice

    論文地址:https://arxiv.org/abs/2003.01587

  3. Towards Robust Image Classification Using Sequential Attention Models

    論文地址:https://arxiv.org/abs/1912.02184

  4. Learning in the Frequency Domain

    論文地址:https://arxiv.org/abs/2002.12416

  5. Learning from Web Data with Memory Module

    論文地址:https://arxiv.org/abs/1906.12028

  6. Making Better Mistakes: Leveraging Class Hierarchies with Deep Networks

    論文地址:https://arxiv.org/abs/1912.09393



### 姿態估計/動作識別
  1. VIBE: Video Inference for Human Body Pose and Shape Estimation

    論文地址:https://arxiv.org/abs/1912.05656

    代碼:https://github.com/mkocabas/VIBE

  2. Distribution-Aware Coordinate Representation for Human Pose Estimation

    論文地址:https://arxiv.org/abs/1910.06278

    代碼:https://github.com/ilovepose/DarkPose

  3. 4D Association Graph for Realtime Multi-person Motion Capture Using Multiple Video Cameras

    論文地址:https://arxiv.org/abs/2002.12625

  4. Optimal least-squares solution to the hand-eye calibration problem

    論文地址:https://arxiv.org/abs/2002.10838

  5. D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry

    論文地址:https://arxiv.org/abs/2003.01060

  6. Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

    論文地址:https://arxiv.org/abs/2001.09691

  7. Distribution Aware Coordinate Representation for Human Pose Estimation

    論文地址:https://arxiv.org/abs/1910.06278

  8. The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation

    論文地址:https://arxiv.org/abs/1911.07524

  9. PVN3D: A Deep Point-wise 3D Keypoints Voting Network for 6DoF Pose Estimation

    論文地址:https://arxiv.org/abs/1911.04231

  10. Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation

    論文地址:https://arxiv.org/abs/2003.02824

  11. G2L-Net: Global to Local Network for Real-time 6D Pose Estimation with Embedding Vector Features

    論文地址:https://arxiv.org/abs/2003.11089

  12. Deep Image Spatial Transformation for Person Image Generation

    論文地址:https://arxiv.org/abs/2003.00696

    代碼:https://github.com/RenYurui/ Global-Flow-Local-Attention



視頻分析

  1. Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications

    論文地址:https://arxiv.org/abs/2003.01455

    代碼:https://github.com/bbrattoli/ZeroShotVideoClassification

  2. Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

    論文地址:https://arxiv.org/abs/2003.00387

  3. Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning

    論文地址:https://arxiv.org/abs/2003.00392

  4. Object Relational Graph with Teacher-Recommended Learning for Video Captioning

    論文地址:https://arxiv.org/abs/2002.11566

  5. Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution

    論文地址:https://arxiv.org/abs/2002.11616

  6. Blurry Video Frame Interpolation

    論文地址:https://arxiv.org/abs/2002.12259

  7. Hierarchical Conditional Relation Networks for Video Question Answering

    論文地址:https://arxiv.org/abs/2002.10698

  8. Action Modifiers:Learning from Adverbs in Instructional Video

    論文地址:https://arxiv.org/abs/1912.06617

  9. Visual Grounding in Video for Unsupervised Word Translation

    論文地址:https://arxiv.org/abs/2003.05078

    代碼:https://github.com/gsig/visual-grounding

  10. MaskFlownet: Asymmetric Feature Matching with Learnable Occlusion Mask(視頻分析-光流估計)

    論文地址:https://arxiv.org/abs/2003.10955

    代碼:https://github.com/microsoft/MaskFlownet

  11. Use the Force, Luke! Learning to Predict Physical Forces by Simulating Effects(視頻預測)

    論文地址:https://arxiv.org/abs/2003.12045

    代碼:https://ehsanik.github.io/forcecvpr2020



OCR

  1. ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network

    論文地址:https://arxiv.org/abs/2002.10200

    代碼:https://github.com/Yuliang-Liu/bezier_curve_text_spotting,https://github.com/aim-uofa/adet

  2. Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQA

    論文地址:https://arxiv.org/abs/1911.06258



GAN

  1. Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models

    論文地址:https://arxiv.org/abs/1911.12287

    代碼:https://github.com/giannisdaras/ylg

  2. MSG-GAN: Multi-Scale Gradient GAN for Stable Image Synthesis

    論文地址:https://arxiv.org/abs/1903.06048

  3. Robust Design of Deep Neural Networks against Adversarial Attacks based on Lyapunov Theory

    論文地址:https://arxiv.org/abs/1911.04636

  4. PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer

    論文地址:https://arxiv.org/abs/1909.06956



小樣本/零樣本

  1. Improved Few-Shot Visual Classification

    論文地址:https://arxiv.org/pdf/1912.03432.pdf

  2. Meta-Transfer Learning for Zero-Shot Super-Resolution

    論文地址:https://arxiv.org/abs/2002.12213

  3. Instance Credibility Inference for Few-Shot Learning

    論文地址:https://arxiv.org/abs/2003.11853

    代碼:https://github.com/Yikai-Wang/ICI-FSL



弱監督/無監督/自監督

  1. Rethinking the Route Towards Weakly Supervised Object Localization

    論文地址:https://arxiv.org/abs/2002.11359

  2. NestedVAE: Isolating Common Factors via Weak Supervision

    論文地址:https://arxiv.org/abs/2002.11576

  3. Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation

    論文地址:https://arxiv.org/abs/1911.07450

  4. Disentangling Physical Dynamics from Unknown Factors for Unsupervised Video Prediction

    論文地址:https://arxiv.org/abs/2003.01460

  5. ClusterFit: Improving Generalization of Visual Representations

    論文地址:https://arxiv.org/abs/1912.03330

  6. Auto-Encoding Twin-Bottleneck Hashing

    論文地址:https://arxiv.org/abs/2002.11930

  7. Learning Representations by Predicting Bags of Visual Words

    論文地址:https://arxiv.org/abs/2002.12247

  8. A Characteristic Function Approach to Deep Implicit Generative Modeling

    論文地址:https://arxiv.org/abs/1909.07425

  9. Unsupervised Learning of Intrinsic Structural Representation Points

    論文地址:https://arxiv.org/abs/2003.01661

    代碼:https://github.com/NolenChen/3DStructurePoints



行人跟蹤/行人檢測/ReID

  1. Cross-modality Person re-identification with Shared-Specific Feature Transfer

    論文地址:https://arxiv.org/abs/2002.12489

  2. Social-STGCNN: A Social Spatio-Temporal Graph Convolutional Neural Network for Human Trajectory Prediction

    論文地址:https://arxiv.org/abs/2002.11927

  3. The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction

    論文地址:https://arxiv.org/abs/1912.06445



神經網絡/模型壓縮/模型加速

  1. GhostNet: More Features from Cheap Operations

    論文地址:https://arxiv.org/abs/1911.11907

    代碼:https://github.com/iamhankai/ghostnet

  2. Watch your Up-Convolution: CNN Based Generative Deep Neural Networks are Failing to Reproduce Spectral

    論文地址:https://arxiv.org/abs/2003.01826

  3. GPU-Accelerated Mobile Multi-view Style Transfer

    論文地址:https://arxiv.org/abs/2003.00706

  4. Bundle Adjustment on a Graph Processor

    論文地址:https://arxiv.org/abs/2003.03134

    代碼:https://github.com/joeaortiz/gbp

  5. Watch your Up-Convolution: CNN Based Generative Deep Neural Networks are Failing to Reproduce Spectral

    論文地址:https://arxiv.org/abs/2003.01826

  6. Holistically-Attracted Wireframe Parsing

    論文地址:https://arxiv.org/abs/2003.01663

  7. AdderNet: Do We Really Need Multiplications in Deep Learning?

    論文地址:https://arxiv.org/abs/1912.13200

  8. CARS: Contunuous Evolution for Efficient Neural Architecture Search

    論文地址:https://arxiv.org/abs/1909.04977

    代碼:https://github.com/huawei-noah/CARS

  9. Π-nets: Deep Polynomial Neural Networksv

    論文地址:https://arxiv.org/abs/2003.03828

  10. Explaining Knowledge Distillation by Quantifying the Knowledge

    論文地址:https://arxiv.org/abs/2003.03622



超分辨率

  1. Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution

    論文地址:https://arxiv.org/abs/2002.11616

  2. Closed-loop Matters: Dual Regression Networks for Single Image Super-Resolution

    論文地址:https://arxiv.org/abs/2003.07018

    代碼:https://github.com/guoyongcs/DRN



視覺常識/其他

  1. Visual Commonsense R-CNN

    論文地址:https://arxiv.org/abs/2002.12204

    代碼:https://github.com/Wangt-CN/VC-R-CNN

  2. Scalable Uncertainty for Computer Vision with Functional Variational Inference

    論文地址:https://arxiv.org/abs/2003.03396

  3. Deep Representation Learning on Long-tailed Data: A Learnable Embedding Augmentation Perspective

    論文地址:https://arxiv.org/abs/2002.10826

  4. Representations, Metrics and Statistics For Shape Analysis of Elastic Graphs

    論文地址:https://arxiv.org/abs/2003.00287

  5. Filter Grafting for Deep Neural Networks

    論文地址:https://arxiv.org/abs/2001.05868

    代碼:https://github.com/fxmeng/filter-grafting.git

  6. 12-in-1: Multi-Task Vision and Language Representation Learning

    論文地址:https://arxiv.org/abs/1912.02315

  7. Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training

    論文地址:https://arxiv.org/abs/2002.10638

    代碼:https://github.com/weituo12321/PREVALENT

  8. Unbiased Scene Graph Generation from Biased Training

    論文地址:https://arxiv.org/abs/2002.11949

  9. Towards Visually Explaining Variational Autoencoders

    論文地址:https://arxiv.org/abs/1911.07389

  10. BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition

    論文地址:http://www.weixiushen.com/publication/cvpr20_BBN.pdf

    代碼:https://github.com/Megvii-Nanjing/BBN

  11. High Frequency Component Helps Explain the Generalization of Convolutional Neural Networks

    論文地址:https://arxiv.org/abs/1905.13545

  12. SAM: The Sensitivity of Attribution Methods to Hyperparameters

    論文地址:http://s.anhnguyen.me/sam_cvpr2020.pdf

    代碼:https://github.com/anguyen8/sam

  13. Π− nets: Deep Polynomial Neural Networks

    論文地址:https://arxiv.org/abs/2003.03828

  14. Towards Backward-Compatible Representation Learning

    論文地址:https://arxiv.org/abs/2003.11942

  15. On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location

    論文地址:https://arxiv.org/abs/2003.07064

  16. KeypointNet: A Large-scale 3D Keypoint Dataset Aggregated from Numerous Human Annotations(數據集)

    論文地址:https://arxiv.org/abs/2002.12687

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章