Yang Liu (刘阳)

IEEE Member, CCF Member, CAAI Member, CSIG Member
Email: liuy856@mail.sysu.edu.cn
Office: School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China

I am currently a research associate professor at HCP Lab, School of Computer Science and Engineering, Sun Yat-sen University, working with Prof. Liang Lin. From 2019-2021, I worked as a posdoctoral fellow at School of Computer Science and Engineering, Sun Yat-sen University. I obtained my Ph.D degree of Telecommunications and Information Systems in June 2019 at Phoenix-eye (XD145) Lab, School of Telecommunications Engineering, Xidian University, advised by Prof. Zhaoyang Lu, Prof. Jing Li and Prof. Tao Yang. I serve as the reviewer of IEEE T-PAMI, T-IP, T-NNLS, T-MM, T-CSVT, CVPR, ICCV, AAAI, ACM MM, ECCV, etc.
[Chinese Version]

  • Computer Vision: Multi-modal Reasoning, Robust and Efficient Spatial-temporal Representation Learning
  • Machine Learning: Causality Inference, Self-supervised Learning, Transfer Learning
  • Urban Computing: Spatial-temporal Prediction, Traffic Signal Control

  • CCF ChinaSoft 2023 Robotic Big Model and Embodied Intelligence Challenge, Third Prize, 2023.
  • The Third Guangdong Province Young Computer Science Academic Show, First Prize, 2023.
  • National Scholarship for PhD Students, 2018.
  • Excellent PhD Students, 2018.
  • National English Competition, Third Prize, 2013.
  • National Mathematical Modeling Competition, Second Prize, 2012.


News

  • 2024-04: The book of multimodal large model (《多模态大模型:新一代人工智能技术范式》) is published!
  • 2023-12: I won the third prize of CCF ChinaSoft 2023 Robotic Big Model and Embodied Intelligence Challenge!
  • 2023-11: One first-author T-IP paper is selected as the ESI Hot Cited Paper!
  • 2023-11: One first-author T-IP paper is selected as the ESI Highly Cited Paper!
  • 2023-10: One accepted ACM MM 2023 paper is recommended as an oral!
  • 2023-10: An invention patent has been granted.
  • 2023-10: I have been selected as a member of the CSIG Visual Big Data Committee.
  • 2023-07: One paper is accepted by ACM MM 2023!
  • 2023-07: Two papers are accepted by ICCV 2023!
  • 2023-06: One paper is accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence!
  • 2023-06: I won the first prize in the Third Guangdong Province Young Computer Science Academic Show.
  • 2023-08: One paper is accepted by PRCV 2023.
  • 2023-06: One paper is accepted by ICIP 2023.
  • 2023-04: One paper is accepted by IJCAI 2023!
  • 2023-03: The open-source framework for causal discovery and visual-linguistic reasoning Causal-VLReasoning is online!
  • 2023-03: I become a member of CAAI (Chinese Association for Artificial Intelligence).
  • 2023-03: I become a member of CCF (China Computer Federation).
  • 2023-03: I am invited as the Program Committee (PC) Member for the Computer Graphics International (CGI) 2023.
  • 2023-03: One paper is accepted by Information Sciences.
  • 2022-12: One review paper about causal reasoning is reported in social media by Machine Intelligence Research.
  • 2022-11: I received the funding from Guangdong Basic and Applied Basic Research Foundation 2023.
  • 2022-11: One review paper for causal reasoning is published online by Machine Intelligence Research.
  • 2022-11: I am invited as the Program Committee (PC) Member for the Computational Visual Media Conference 2023.
  • 2022-11: I am invited as the reviewer of IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023.
  • 2022-09: I become a member of IEEE Industrial Electronics Society.
  • 2022-08: My TCGL paper published in IEEE T-IP is invited to be exhibited as the poster in Valse 2022.
  • 2022-08: One review paper about causal reasoning is accepted by Machine Intelligence Research.
  • 2022-08: I am invited as the Program Committee (PC) Member for the AAAI 2023.
  • 2022-06: I become a member of CSIG.
  • 2022-05: One paper is accepted by IEEE Transactions on Industrial Informatics.
  • 2022-04: I received the funding from Science and Technology Projects in Guangzhou.
  • 2022-03: One paper is accepted by CVPR 2022 as an oral presentation.
  • 2022-01: One paper is accepted by IEEE Transactions on Image Processing. Code is available.
  • 2022-01: One paper is accepted by ICASSP 2022.
  • 2022-01: The code for TCGL is available.
  • 2021-12: I am invited as the reviewer of Chinese Conference on Pattern Recognition and Computer Vision (PRCV) 2022.
  • 2021-11: I am invited as the reviewer of IEEE/CVF Conference on Computer Vision and Pattern Recognition 2022.
  • 2021-10: I start working as a research associate professor at Sun-Yat-Sen University.
  • 2021-05: One paper is accepted by IEEE Transactions on Image Processing. Code is available.
  • 2021-04: The code for SAKDN is available.
  • 2021-02: I become a Member of the IEEE.
  • 2020-12: I am invitied as the reviewing expert for NSFC.
  • 2020-11: I received the funding from National Natural Science Foundation of Guangdong Province of China 2021.
  • 2020-09: I received the funding from National Natural Science Foundation of China 2021.
  • 2020-06: I received the funding from China Postdoctoral Science Foundation 2020.
  • 2020-03: I received the funding from Fundamental Research of the Central Universities 2020.
  • 2020-01: My Ph.D Thesis (psw:ithd) is available now.
  • 2019-10: Codes and datasets for IEEE TIP is available.
  • 2019-10: One paper is accepted by IEEE Transactions on Image Processing.
  • 2019-08: I start working as a postdoctoral fellow at Sun-Yat-Sen University.
  • 2019-06: I obtain my Ph.D degree from Xidian University.

Toolbox and Benchmark

  1. CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning
    Yang Liu, Weixing Chen, Guanbin Li, Liang Lin
    GitHub stars GitHub forks GitHub issues Research topic: Causal Reasoning, Visual-Linguistic Reasoning, Open-source Framework and Benchmark

Demo of Embodied AI

  1. Embodied Interactive Robot
  2. Embodied Interactive Agent for the Coffee Scene

Book

  1. Multimodal Large Models: A New Generation of Artificial Intelligence Technology Paradigm
    多模态大模型:新一代人工智能技术范式
    Publishing House of Electronics Industry (PHE), ISBN 978-7-121-47547-4, 2024.
    电子工业出版社,ISBN 978-7-121-47547-4, 2024.
    Yang Liu, Liang Lin
    GitHub stars GitHub forks GitHub issues

Selected Papers

  1. Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering
    Yang Liu, Guanbin Li, Liang Lin
    IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI), 2023. (JCR Q1, CCF-A)
    GitHub stars GitHub forks GitHub issues Research topic: Causal Reasoning, Video Question Answering
  2. MEIA: Towards Realistic Multimodal Interaction and Manipulation for Embodied Robots
    Yang Liu, Xinshuai Song, Kaixuan Jiang, Weixing Chen, Jingzhou Luo, Guanbin Li, Liang Lin
    Research topic: Embodied AI, Agent, Robotics
  3. CausalGPT: Illuminating Faithfulness and Causality for Knowledge-based Reasoning with LLMs
    Ziyi Tang, Ruilin Wang, Weixing Chen, Yongsen Zheng, Yang Liu, Keze Wang, Tianshui Chen, Liang Lin
    GitHub stars GitHub forks GitHub issues Research topic: Causal Reasoning, Chain-of-Thought
  4. Cross-Modal Causal Intervention for Medical Report Generation
    Weixing Chen, Yang Liu, Ce Wang, Jiarui Zhu, Shen Zhao, Guanbin Li, Cheng-Lin Liu, Liang Lin
    [Code & Dataset] [BibTex]

    GitHub stars GitHub forks GitHub issues Research topic: Causal Reasoning, Medical Report Generation
  5. Visual Causal Scene Refinement for Video Question Answering
    Yushen Wei*, Yang Liu*, Hong Yan, Guanbin Li, Liang Lin
    ACM International Conference on Multimedia (ACM MM), 2023. (* indicates co-first author) (CCF-A, Oral)
    GitHub stars GitHub forks GitHub issues Research topic: Causal Discovery, Cross-Modal Question Reasoning
  6. SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training
    Hong Yan, Yang Liu, Yushen Wei, Zhen Li, Guanbin Li, Liang Lin
    IEEE/CVF International Conference on Computer Vision (ICCV), 2023. (CCF-A)
    GitHub stars GitHub forks GitHub issues Research topic: Self-supervised Learning, Skeleton Action Recognition
  7. Enhanced Soft Label for Semi-Supervised Semantic Segmentation
    Jie Ma, Chuan Wang, Yang Liu, Liang Lin, Guanbin Li
    IEEE/CVF International Conference on Computer Vision (ICCV), 2023. (CCF-A)
    GitHub stars GitHub forks GitHub issues Research topic: Semantic Segmentationg, Semi-Supervised Learning
  8. DenseLight: Efficient Control for Large-scale Traffic Signals with Dense Feedback
    Junfan Lin, Yuying Zhu, Lingbo Liu, Yang Liu, Guanbin Li, Liang Lin
    International Joint Conference on Artificial Intelligence (IJCAI), 2023. (CCF-A)
    GitHub stars GitHub forks GitHub issues Research topic: Traffic Signal Control, Reinforcement Learning from Human Feedback (RLHF)
  9. Self-Supervised Contrastive Learning for Audio-Visual Action Recognition
    Yang Liu, Ying Tan, Haoyuan Lan
    IEEE International Conference on Image Processing (ICIP), 2023.
    Research topic: Audio-visual Self-supervised Representation Learning
  10. VCD: Visual Causality Discovery for Cross-Modal Question Reasoning
    Yang Liu, Ying Tan, Jingzhou Luo, Weixing Chen
    Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2023.
    Research topic: Causal Inference, Visual Question Answering
  11. Urban Regional Function Guided Traffic Flow Prediction
    Kuo Wang, Lingbo Liu, Yang Liu, Guanbin Li, Liang Lin
    Information Sciences (INS), 2023. (JCR Q1, CCF-B)
    Research topic: Traffic Flow Prediction, Spatial-temporal Representation Learning
  12. Dual adversarial adaptation for cross-device real-world image super-resolution
    Xiaoqian Xu, Pengxu Wei, Weikai Chen, Yang Liu, Mingzhi Mao, Liang Lin, Guanbin Li
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022. (CCF-A, Oral)
    GitHub stars GitHub forks GitHub issues Research topic: Image Super-resolution
  13. Causal Reasoning Meets Visual Representation Learning: A Prospective Study
    Yang Liu, Yushen Wei, Hong Yan, Guanbin Li, Liang Lin
    Machine Intelligence Research (MIR), 2022.
    GitHub stars GitHub forks GitHub issues Research topic: Causal Reasoning, Spatial-temporal Representation Learning
  14. Hybrid-Order Representation Learning for Electricity Theft Detection
    Yuying Zhu, Yang Zhang, Lingbo Liu, Yang Liu, Guanbin Li, Mingzhi Mao, Liang Lin
    IEEE Transactions on Industrial Informatics (T-II), 2022. (JCR Q1)
    GitHub stars GitHub forks GitHub issues Research topic: Spatial-temporal Anomaly Detection
  15. Cross-modal knowledge distillation for Vision-to-Sensor action recognition
    Jianyuan Ni, Raunak Sarbajna, Yang Liu, Anne HH Ngu, Yan Yan
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022. (CCF-B)
    Research topic: Multi-modal Spatial-temporal Knowledge Transfer
  16. TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning
    Yang Liu, Keze Wang, Lingbo Liu, Haoyuan Lan, Liang Lin
    IEEE Transactions on Image Processing (T-IP), 2022. (ESI Highly Cited Paper & ESI Hot Cited Paper, JCR Q1, CCF-A)
    GitHub stars GitHub forks GitHub issues Research topic: Self-supervised Spatial-temporal Representation Learning
  17. Semantics-aware Adaptive Knowledge Distillation for Sensor-to-Vision Action Recognition
    Yang Liu, Keze Wang, Guanbin Li, Liang Lin
    IEEE Transactions on Image Processing (T-IP), 2021. (JCR Q1, CCF-A)
    GitHub stars GitHub forks GitHub issues Research topic: Multi-modal Spatial-temporal Knowledge Transfer
  18. Deep Image-to-Video Adaptation and Fusion Networks for Action Recognition
    Yang Liu, Zhaoyang Lu, Jing Li, Tao Yang, Chao Yao
    IEEE Transactions on Image Processing (T-IP), 2020. (JCR Q1, CCF-A)
    GitHub stars GitHub forks GitHub issues Research topic: Multi-modal Spatial-temporal Knowledge Transfer
  19. /
  20. Hierarchically Learned View-Invariant Representations for Cross View Action Recognition
    Yang Liu, Zhaoyang Lu, Jing Li, Tao Yang
    IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), 2019. (JCR Q1, CCF-B)
    GitHub stars GitHub forks GitHub issues Research topic: Multi-view Spatial-temporal Knowledge Transfer
  21. Global Temporal Representation based CNNs for Infrared Action Recognition
    Yang Liu, Zhaoyang Lu, Jing Li, Tao Yang, Chao Yao
    IEEE Signal Processing Letters (SPL), 2018.
    GitHub stars GitHub forks GitHub issues Research topic: Heterogenous Spatial-temporal Representation Learning
  22. Transferable feature representation for visible-to-infrared cross-dataset human action recognition
    Yang Liu, Zhaoyang Lu, Jing Li, Chao Yao, Yanzi Deng
    Complexity, 2018.
    GitHub stars GitHub forks GitHub issues Research topic: Heterogenous Spatial-temporal Representation Learning
  23. Combining multiple features for cross-domain face sketch recognition
    Yang Liu, Jing Li, ZhaoYang Lu, Tao Yang, ZiJian Liu
    Chinese Conference on Biometric Recognition (CCBR), 2016.
    Research topic: Heterogenous Face-sketch Representation Learning

PhD Dissertation

  1. Cross-domain Human Action Recognition via Transfer Learning (基于迁移学习的跨域人体行为识别研究)
    PhD Dissertation (博士学位论文), Xidian University (西安电子科技大学), June 30, 2019.
    Yang Liu
    Supervisor:Prof. Zhaoyang Lu

Academic services

    Reviewer for Journals
  • Advanced Science
  • IEEE Transactions on Pattern Analysis and Machine Intelligence
  • IEEE Transactions on Image Processing
  • IEEE Transactions on Neural Networks and Learning Systems
  • IEEE Transactions on Cybernetics
  • IEEE Transactions on Multimedia
  • IEEE Transactions on Circuits and Systems for Video Technology
  • IEEE Transactions on Human-Machine Systems
  • IEEE Signal Processing Letters
  • IEEE Robotics and Automation Letters
  • IET Computer Vision
  • Pattern Recognition
  • Pattern Recognition Letters
  • Machine Vision and Applications
  • Journal of Visual Communication and Image Representation
  • Visual Computer
    Area Chair (AC)/Program Committee (PC)/Reviewer for Conferences
  • IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
  • IEEE/CVF International Conference on Computer Vision (ICCV)
  • European Conference on Computer Vision (ECCV)
  • ACM Multimedia (ACM MM)
  • AAAI Conference on Artificial Intelligence (AAAI)
  • IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • Computational Visual Media Conference (CVM)
  • Computer Graphics International (CGI)
  • Chinese Conference on Pattern Recognition and Computer Vision (PRCV)
  • ACM international joint conference on pervasive and ubiquitous computing (UbiComp)
  • IEEE International Semantic Web Conference (ISWC)
  • IEEE International Symposium on Circuits and Systems (ISCAS)
  • International AAAI Conference on Web and Social Media (ICWSM)
  • CCF BigData Conference (CCF BigData)
    Reviewer for Fundings
  • National Natural Science Foundation of China (NSFC)