PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies

Expanding Language-Image Pretrained Models for General Video Recognition

TinyViT: Fast Pretraining Distillation for Small Vision Transformers

MiniViT: Compressing Vision Transformers with Weight Multiplexing

Cyclic Differentiable Architecture Search

Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language

Rank #1 in HACS Temporal Action Localization Challenge

AutoFormerV2: Searching the Search Space of Vision Transformer

Minghao Chen, Kan Wu, Bolin Ni, Houwen Peng*, Bei Liu, Jianlong Fu, Hongyang Chao, Haibin Ling

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training

Hongwei Xue, Yupan Huang, Bei Liu, Houwen Peng, Jianlong Fu, Houqiang Li, Jiebo Luo

Learning to Track Objects from Unlabled Videos

Jilai Zheng, Chao Ma, Houwen Peng, Xiaokang Yang

Rethinking and Improving Relative Position Encoding for Vision Transformer

Kan Wu, Houwen Peng*, Minghao Chen, Jianlong Fu, Hongyang Chao

AutoFormer: Searching Transformers for Visual Recognition

Minghao Chen, Houwen Peng*, Jianlong Fu, Haibin Ling

Learning Spatio-Temporal Transformer for Visual Tracking

Bin Yan, Houwen Peng*, Jianlong Fu, Dong Wang, Huchuan Lu
Rank #1 in VOT-2021 Challenge RGB-D Track

LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search

Bin Yan, Houwen Peng, Kan Wu, Dong Wang, Jianlong Fu, Huchuan Lu

One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking

Minghao Chen, Jianlong Fu, Haibin Ling

Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search

Houwen Peng, Hao Du, Hongyuan Yu, Qi Li, Jing Liao, Jianlong Fu

Ocean: Object-aware Anchor-free Tracking

Rank #2 in VOT-2020 Challenge Short-term and Real-Time Tracks

A Transductive Approach for Semi-Supervised Video Object Segmentation

Learning 2D Temporal Localization Networks for Moment Localization with Natural Language

Rank #1 in HACS Temporal Action Localization Challenge

Deeper and Wider Siamese Networks for Real-time Visual Tracking

Zhipeng Zhang, Houwen Peng*
Rank #1 in VOT-2019 Challenge RGB-D Track

AI Coach: Deep Human Pose Estimation and Analysis for Personalized Athletic Training Assistance

Multi-view Multi-instance Learning based on Joint Sparse Representation and Multi-view Dictionary Learning

Illumination Estimation based on Bilayer Sparse Coding

Salient Object Detection via Structured Matrix Decomposition

Predicting Image Memorability by Multi-view Adaptive Regression

RGBD Salient Object Detection: A Benchmark and Algorithms

Salient Object Detection via Low-rank and Structured Sparse Matrix Decomposition

Awards and Honors


    Area Chair / Senior PC for
  • ACM International Conference on Multimedia (MM), 2021, 2022
  • AAAI Conference on Artificial Intelligence (AAAI), 2022.
  • Reviewer / Program Committee for
  • International Conference on Learning Representations (ICLR), 2021, 2022
  • International Conference on Machine Learning (ICML), 2021, 2022
  • AAAI Conference on Artificial Intelligence (AAAI), 2019, 2020, 2021, 2022
  • Advances in Neural Information Processing Systems (NIPS), 2020, 2021.
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, 2019, 2020, 2021, 2022
  • IEEE International Conference on Computer Vision (ICCV), 2017, 2019, 2021
  • European Conference on Computer Vision (ECCV), 2018, 2020.
  • Winter Conference on Applications of Computer Vision, 2021, 2022
  • IEEE International Conference on Robotics and Automation (ICRA), 2013, 2015, 2020
  • IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI )
  • IEEE Transactions on Image Processing (TIP)
  • IEEE Transactions on Multimedia (TMM)
  • IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
  • Pattern Recognition (PR)