Jiali Chen      

Currently, Jiali Chen is the second-year direct Ph.D. student at Key Laboratory of Big Data and Intelligent Robot of South China University of Technology (SCUT), supervised by Prof. Yi Cai. He works closely with Dr. Jiayuan Xie at Hong Kong Polytechnic University (PolyU). Before that, He also obtained the B.E. degree in Department of Software Engineering from South China University of Technology (SCUT) in 2023. His research interests revolve around Causal Inference, Multimodal Reasoning and Vision & Language.

Feel free to contact me if you're interested in discussing or seeking potential collaborations.

Email  /  Google Scholar  /  Github

News

  • [2025/01]   1 paper is accepted by NAACL 2025!
  • [2024/07]   2 papers are accepted by ACM MM 2024!
  • [2024/03]   1 paper is accepted by TCSVT 2024.
  • [2024/02]   1 paper is accepted by TIP 2024.
  • [2023/08]   1 paper is accepted by ACM MM 2023!

  • Education

    South China University of Technology (SCUT), China
    Honours Degree in Software Engineering      • Sep. 2019 - Jun. 2023
    Excellent Degree Dissertations of SCUT in 2023.

    South China University of Technology (SCUT), China
    Key Laboratory of Big Data and Intelligent Robot    • Sep. 2023 - Present
    Supervisor: Prof. Yi Cai

    Selected Publication [Google Scholar]
    Classic4Children: Adapting Chinese Literary Classics for Children with Large Language Model
    Jiali Chen, Xusen Hei, Yuqi Xue, Zihan Wu, Jiayuan Xie, Yi Cai
    Findings the Nations of the Americas Chapter of the ACL, NAACL 2025
    [Paperlink], [Code]
    Area: Large Language Model, Text Style

    We highlight children’s reading preferences: vivid character portrayals, concise narrative structure and appropriate readability are essential in adapting Chinese literary classics for children

    Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor
    Jiali Chen, Xusen Hei, Yuqi Xue, Yuancheng Wei, Jiayuan Xie, Yi Cai, Qing Li
    ACM Multimedia, ACM MM 2024
    [Paperlink], [Code]
    Area: Large Multimodal Model, New Benchmark

    We present the first work to investigate the error correction capabilities of large multimodal models (LMMs), construct a new benchmark and introduce the feedback generation task for evaluation. I would like to extend my heartfelt gratitude to my girlfriend, Ms. Wen, for inspiring the idea behind this paper.

    Deconfounded Emotion Guidance Sticker Selection with Causal Inference
    Jiali Chen, Yi Cai, Ruohang Xu, Jiexin Wang, Jiayuan Xie, Qing Li
    ACM Multimedia, ACM MM 2024
    [Paperlink]
    Area: Bias, Causal Inference, Sticker Selection

    This paper presents a Causal Knowledge-Enhanced Sticker Selection model that addresses spurious correlations in sticker selection by using a causal graph and a knowledge-enhanced approach.

    Knowledge-Guided Cross-Topic Visual Question Generation
    Hongfei Liu, Guohua Wang, Jiayuan Xie, Jiali Chen, Wenhao Fang, Yi Cai
    International Conference on Computational Linguistics, COLING 2024
    [Paperlink]
    Area: Knowledge, Cross-Topic, Text Generation

    We propose a knowledge-guided cross-topic visual question generation (KC-VQG) model to extract unseen topic-related information for question generation.

    Video Question Generation for Dynamic Changes
    Jiayuan Xie*, Jiali Chen*, Zhenghao Liu, Qingbao Huang, Yi Cai, Qing Li
    IEEE Transactions on Circuits and Systems for Video Technology, TCSVT 2024
    [Paperlink], [Code]
    Area: Video Understanding, Text Generation

    We introduce D-VQG, a difference-aware video question generation model that aims to generate questions about temporal differences in the video.

    Knowledge-Augmented Visual Question Answering with Natural Language Explanation
    IEEE Transactions on Image Processing, TIP 2024
    [Paperlink], [Code]
    Area: VQA, Multimodal Reasoning

    We introduce KICNLE, which generates consistent answer and explanation with external knowledge.

    Deconfounded Visual Question Generation with Causal Inference
    Jiali Chen, Zhenjun Guo, Jiayuan Xie, Yi Cai, Qing Li
    ACM Multimedia, ACM MM 2023
    [Paperlink], [Code]
    Area: Bias, Causal Inference, Visual Question Generation

    This study first introduces a causal perspective on VQG and adopts the causal graph to analyze spurious correlations among variables. We propose KECVQG mitigates the impact of spurious correlations for VQG.

    Academic Service

  • Conference Reviewer:   ACL, NAACL, EMNLP, ACM MM, KDD, WSDM
  • Journal Reviewer:   IEEE TPAMI, IEEE TIP

  • Honors & Scholarships

  • Principle's Scholarship of SCUT,  2024.
  • First Prize of the 17th National College Student Software Contest,  2024.
  • Excellent Degree Dissertations of SCUT (Bachelor Degree),  2023. (Extended version has been accepted by TIP 2024)
  • Global AI Challenge for Building E&M Facilities Golden Award,  2023.


  • Last updated on Jan, 2025

    This website template borrowed from [here]