Publications

Please visit my Google Scholar profile to check out my up-to-date publication list.

# indicates equal contributions; * indicates corresponding authors.

2024

  1. SPA-VL.png
    SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
    Yongting Zhang, Lu Chen, Guodong Zheng, and 10 more authors
    Arxiv, 2024
  2. ch3ef.png
    Assessment of Multimodal Large Language Models in Alignment with Human Values
    Zhelun Shi, Zhipin Wang, Hongxing Fan, and 7 more authors
    Arxiv, 2024
  3. CodeAttack.png
    CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion
    Qibing Ren, Chang Gao, Jing Shao, and 4 more authors
    ACL, 2024
  4. tracing.png
    Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models
    Chen Qian, Jie Zhang, Wei Yao, and 5 more authors
    ACL, 2024
  5. saladbench.png
    SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models
    Lijun Li, Bowen Dong, Ruohui Wang, and 5 more authors
    ACL, 2024
  6. MLLMs.png
    From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
    Chaochao Lu, Chen Qian, Guodong Zheng, and 33 more authors
    Technicle Report, 2024
  7. psysafe.png
    PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety
    Zaibin Zhang, Yongting Zhang, Lijun Li, and 6 more authors
    ACL, 2024

2023

  1. ChEF.png
    ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models
    Zhelun* Shi, Zhipin* Wang, Hongxing* Fan, and 4 more authors
    Arxiv, 2023
  2. LAMM.png
    LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark
    Zhenfei* Yin, Jiong* Wang, JianJian* Cao, and 9 more authors
    NeurIPS, 2023

2022

  1. an1st.png
    1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
    Dong An, Zun Wang, Yangguang Li, and 5 more authors
    CVPR, 2022
  2. ergo.png
    ERGO: Event Relational Graph Transformer for Document-level Event Causality Identification
    Meiqi Chen, Yixin Cao, Kunquan Deng, and 4 more authors
    Arxiv, 2022
  3. democra.png
    Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and Supervision
    Yufeng Cui, Lichen Zhao, Feng Liang, and 2 more authors
    Arxiv, 2022
  4. xlearner.png
    X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation
    Yinan He, Gengshi Huang, Siyu Chen, and 7 more authors
    ECCV, 2022
  5. supervision.png
    Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
    Yangguang Li, Feng Liang, Lichen Zhao, and 5 more authors
    In International Conference on Learning Representations, Mar 2022
  6. mmekg.png
    MMEKG: Multi-modal Event Knowledge Graph towards Universal Representation across Modalities
    Yubo Ma, Zehao Wang, Mukai Li, and 8 more authors
    In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, May 2022
  7. prompt.png
    Prompt for Extraction? PAIE: Prompting Argument Interaction for Event Argument Extraction
    Yubo Ma, Zehao Wang, Yixin Cao, and 4 more authors
    In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), May 2022
  8. stadapt.png
    ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning for Action Recognition
    Junting Pan, Ziyi Lin, Xiatian Zhu, and 2 more authors
    NeurIPS, May 2022
  9. fewshot.png
    Few-shot Forgery Detection via Guided Adversarial Interpolation
    Haonan Qiu, Siyu Chen, Bei Gan, and 4 more authors
    Arxiv, May 2022
  10. taskbalance.png
    Task-Balanced Distillation for Object Detection
    Ruining Tang, Zhenyu Liu, Yangguang Li, and 6 more authors
    PR, May 2022
  11. repre.png
    RePre: Improving Self-Supervised Vision Transformer with Reconstructive Pre-training
    Luya Wang, Feng Liang, Yangguang Li, and 3 more authors
    In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, Jul 2022
  12. sncse.png
    SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples
    Hao Wang, Yangguang Li, Zhen Huang, and 3 more authors
    ICIC, Jul 2022
  13. bamboo.png
    Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy
    Yuanhan Zhang, Qinghong Sun, Yichun Zhou, and 7 more authors
    Arxiv, Jul 2022
  14. benchmark.png
    Benchmarking Omni-Vision Representation through the Lens of Visual Realms
    Yuanhan Zhang, Zhenfei Yin, Jing Shao, and 1 more author
    ECCV, Jul 2022
  15. robustface.png
    Robust Face Anti-Spoofing with Dual Probabilistic Modeling
    Yuanhan Zhang, Yichao Wu, Zhenfei Yin, and 2 more authors
    Arxiv, Jul 2022

2021

  1. Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization
    Junting Pan, Siyu Chen, Mike Zheng Shou, and 3 more authors
    In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021
  2. ForgeryNet - Face Forgery Analysis Challenge 2021: Methods and Results
    Yinan He, Lu Sheng, Jing Shao, and 19 more authors
    CoRR, Jun 2021
  3. ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis
    Yinan He, Bei Gan, Siyu Chen, and 6 more authors
    In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021
  4. A Simple Long-Tailed Recognition Baseline via Vision-Language Model
    Teli Ma, Shijie Geng, Mengmeng Wang, and 5 more authors
    CoRR, Jun 2021
  5. Few-Shot Domain Expansion for Face Anti-Spoofing
    Bowen Yang, Jing Zhang, Zhenfei Yin, and 1 more author
    CoRR, Jun 2021
  6. BlockQNN: Efficient Block-Wise Neural Network Architecture Generation
    Zhao Zhong, Zichen Yang, Boyang Deng, and 4 more authors
    IEEE Transactions on Pattern Analysis and Machine Intelligence, Jul 2021
    Conference Name: IEEE Transactions on Pattern Analysis and Machine Intelligence

2020

  1. 1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020
    Siyu Chen, Junting Pan, Guanglu Song, and 6 more authors
    CoRR, Jul 2020
  2. High-Quality Video Generation from Static Structural Annotations
    Lu Sheng, Junting Pan, Jiaming Guo, and 2 more authors
    International Journal of Computer Vision, Nov 2020
  3. Morphing and Sampling Network for Dense Point Cloud Completion
    Minghua Liu, Lu Sheng, Sheng Yang, and 2 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, Apr 2020
    Number: 07
  4. CelebA-Spoof: Large-Scale Face Anti-spoofing Dataset with Rich Annotations
    Yuanhan Zhang, ZhenFei Yin, Yidong Li, and 4 more authors
    In Computer Vision – ECCV 2020, Apr 2020
  5. Learning Connectivity of Neural Networks from a Topological Perspective
    Kun Yuan, Quanquan Li, Jing Shao, and 1 more author
    In Computer Vision – ECCV 2020, Apr 2020
  6. Powering One-Shot Topological NAS with Stabilized Share-Parameter Proxy
    Ronghao Guo, Chen Lin, Chuming Li, and 4 more authors
    In Computer Vision – ECCV 2020, Apr 2020
  7. Thinking in Frequency: Face Forgery Detection by Mining Frequency-Aware Clues
    Yuyang Qian, Guojun Yin, Lu Sheng, and 2 more authors
    In Computer Vision – ECCV 2020, Apr 2020
  8. PV-NAS: Practical Neural Architecture Search for Video Recognition
    Zihao Wang, Chen Lin, Lu Sheng, and 2 more authors
    CoRR, Apr 2020
  9. PV-NAS: Practical Neural Architecture Search for Video Recognition
    Zihao Wang, Chen Lin, Lu Sheng, and 2 more authors
    CoRR, Apr 2020

2019

  1. Unsupervised Bi-directional Flow-based Video Generation from one Snapshot
    Lu Sheng, Junting Pan, Jiaming Guo, and 3 more authors
    CoRR, Apr 2019
  2. Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing
    Xihui Liu, Zihao Wang, Jing Shao, and 2 more authors
    In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2019
  3. Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis
    Xihui Liu, Guojun Yin, Jing Shao, and 2 more authors
    In Advances in Neural Information Processing Systems, Jun 2019
  4. Video Generation From Single Semantic Label Map
    Junting Pan, Chengyu Wang, Xu Jia, and 4 more authors
    In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2019
  5. CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
    Zihao Wang, Xihui Liu, Hongsheng Li, and 4 more authors
    In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2019
  6. Context and Attribute Grounded Dense Captioning
    Guojun Yin, Lu Sheng, Bin Liu, and 3 more authors
    In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2019
  7. Semantics Disentangling for Text-To-Image Generation
    Guojun Yin, Bin Liu, Lu Sheng, and 3 more authors
    In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2019

2018

  1. Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection
    Yongcheng Liu, Lu Sheng, Jing Shao, and 3 more authors
    In Proceedings of the 26th ACM international conference on Multimedia, Jun 2018
  2. Localization Guided Learning for Pedestrian Attribute Recognition
    Pengze Liu, Xihui Liu, Junjie Yan, and 1 more author
    In British Machine Vision Conference 2018, BMVC 2018, Sep 2018
  3. Exploring Disentangled Feature Representation Beyond Face Identification
    Yu Liu, Fangyin Wei, Jing Shao, and 3 more authors
    In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2018
  4. Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association
    Dapeng Chen, Hongsheng Li, Xihui Liu, and 4 more authors
    In Computer Vision – ECCV 2018, Jun 2018
  5. Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
    Xihui Liu, Hongsheng Li, Jing Shao, and 2 more authors
    In Computer Vision – ECCV 2018, Jun 2018
  6. Transductive Centroid Projection for Semi-supervised Large-Scale Recognition
    Yu Liu, Guanglu Song, Jing Shao, and 2 more authors
    In Computer Vision – ECCV 2018, Jun 2018
  7. Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition
    Guojun Yin, Lu Sheng, Bin Liu, and 4 more authors
    In Computer Vision – ECCV 2018, Jun 2018
  8. Avatar-Net: Multi-scale Zero-Shot Style Transfer by Feature Decoration
    Lu Sheng, Ziyi Lin, Jing Shao, and 1 more author
    In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2018
  9. Practical Block-Wise Neural Network Architecture Generation
    Zhao Zhong, Junjie Yan, Wei Wu, and 2 more authors
    In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2018

2017

  1. Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-identification
    Zhongdao Wang, Luming Tang, Xihui Liu, and 7 more authors
    In 2017 IEEE International Conference on Computer Vision (ICCV), Jun 2017
  2. HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis
    Xihui Liu, Haiyu Zhao, Maoqing Tian, and 5 more authors
    In 2017 IEEE International Conference on Computer Vision (ICCV), Oct 2017
  3. Crowded Scene Understanding by Deeply Learned Volumetric Slices
    Jing Shao, Chen Change Loy, Kai Kang, and 1 more author
    IEEE Transactions on Circuits and Systems for Video Technology, Mar 2017
  4. Learning Scene-Independent Group Descriptors for Crowd Understanding
    Jing Shao, Chen Change Loy, and Xiaogang Wang
    IEEE Transactions on Circuits and Systems for Video Technology, Jun 2017
  5. Spindle Net: Person Re-identification with Human Body Region Guided Feature Decomposition and Fusion
    Haiyu Zhao, Maoqing Tian, Shuyang Sun, and 5 more authors
    In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jul 2017

2016

  1. Slicing Convolutional Neural Network for Crowd Video Understanding
    Jing Shao, Chen Change Loy, Kai Kang, and 1 more author
    In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2016
    ISSN: 1063-6919

2015

  1. Deeply learned attributes for crowded scene understanding
    Jing Shao, Kai Kang, Chen Change Loy, and 1 more author
    In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2015

2014

  1. Scene-Independent Group Profiling in Crowd
    Jing Shao, Chen Change Loy, and Xiaogang Wang
    In 2014 IEEE Conference on Computer Vision and Pattern Recognition, Jun 2014