Junyi Li (李军毅)

I am a Ph.D. student jointly supervised by Prof. Jian-Yun Nie from DIRO, Universite de Montreal and Prof. Xin Zhao from GSAI, Renmin University of China. I have a broad interest in natural language processing, with an emphasis on natural language generation (NLG), especially based on large language models (LLMs).

Email: junyi.li at umontreal dot ca / lijunyi at ruc dot edu dot cn

CV  /  Google Scholar  /  GitHub  /  Zhihu

Zhihu Posts
Publications

* Corresponding author
† Equal contribution

Preprint

!!!!!A Survey of Large Language Models
Wayne Xin Zhao, Kun Zhou†, Junyi Li†, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-Yun Nie, Ji-Rong Wen
pdf / code

GlyphDiffusion: Text Generation as Image Generation
Junyi Li, Wayne Xin Zhao*, Jian-Yun Nie, Ji-Rong Wen
pdf / code

The Dawn After the Dark: An Empirical Study on Factuality Hallucination in Large Language Models
Junyi Li, Jie Chen, Ruiyang Ren, Xiaoxue Cheng, Wayne Xin Zhao, Jian-Yun Nie, Ji-Rong Wen
arXiv, 2024
pdf / code

2024

ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting
Xiaoxue Cheng, Junyi Li, Wayne Xin Zhao, Ji-Rong Wen
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), 2024
pdf / code

BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models
Zican Dong, Tianyi Tang, Junyi Li, Wayne Xin Zhao, Ji-Rong Wen
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), 2024
pdf / code

Pretrained Language Models Based Text Generation: A Survey
Junyi Li†, Tianyi Tang†, Wayne Xin Zhao*, Jian-Yun Nie, Ji-Rong Wen
ACM Computing Surveys, 2024
pdf

2023

HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models
Junyi Li†, Xiaoxue Cheng†, Wayne Xin Zhao, Jian-Yun Nie, Ji-Rong Wen
The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
pdf / code

Learning to Imagine: Visually-Augmented Natural Language Generation
Tianyi Tang, Yushuo Chen, Yifan Du, Junyi Li, Wayne Xin Zhao*, Ji-Rong Wen
The 61st Annual Meeting of the Association for Computational Linguistics (ACL), 2023
pdf / code

MVP: Multi-task Supervised Pre-training for Natural Language Generation
Tianyi Tang, Junyi Li, Wayne Xin Zhao*, Ji-Rong Wen
Findings of The 61st Annual Meeting of the Association for Computational Linguistics (ACL), 2023
pdf / code

Zero-shot Visual Question Answering with Language Model Feedback
Yifan Du, Junyi Li, Tianyi Tang, Wayne Xin Zhao*, Ji-Rong Wen
Findings of The 61st Annual Meeting of the Association for Computational Linguistics (ACL), 2023
pdf / code

The Web Can Be Your Oyster for Improving Language Models
Junyi Li, Tianyi Tang, Wayne Xin Zhao*, Jingyuan Wang, Jian-Yun Nie, Ji-Rong Wen
Findings of The 61st Annual Meeting of the Association for Computational Linguistics (ACL), 2023
pdf / code

2022

ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation
Junyi Li, Tianyi Tang, Wayne Xin Zhao*, Jian-Yun Nie, Ji-Rong Wen
The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
pdf / code

TextBox 2.0: A Text Generation Library with Pre-trained Language Models
Tianyi Tang†, Junyi Li†, Zhipeng Chen†, Yiwen Hu, Zhuohao Yu, Wenxun Dai, Wayne Xin Zhao*, Jian-Yun Nie, Ji-Rong Wen
The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022, System Demonstration
pdf / code

Context-Tuning: Learning Contextualized Prompts for Natural Language Generation
Tianyi Tang, Junyi Li, Wayne Xin Zhao*, Ji-Rong Wen
The 29th International Conference on Computational Linguistics (COLING), 2022
pdf / code

A Survey of Vision-Language Pre-Trained Models
Yifan Du†, Zikang Liu†, Junyi Li, Wayne Xin Zhao*
The 31th International Joint Conference on Artificial Intelligence (IJCAI), 2022, Survey Track
pdf

ElitePLM: An Empirical Study on General Language Ability Evaluation of Pretrained Language Models
Junyi Li, Tianyi Tang, Zheng Gong, Lixin Yang, Zhuohao Yu, Zhipeng Chen, Jingyuan Wang, Wayne Xin Zhao*, Ji-Rong Wen
The North American Chapter of the Association for Computational Linguistics (NAACL), 2022
pdf / code

Learning to Transfer Prompts for Text Generation
Junyi Li, Tianyi Tang, Jian-Yun Nie, Ji-Rong Wen, Wayne Xin Zhao*
The North American Chapter of the Association for Computational Linguistics (NAACL), 2022
pdf / code

2021

Generating Long and Coherent Text with Multi-Level Generative Adversarial Networks
Tianyi Tang, Junyi Li, Wayne Xin Zhao*, Ji-Rong Wen
The 5th APWeb-WAIM International Joint Conference on Web and Big Data (APWeb-WAIM), 2021
pdf / code

TextBox: A Unified, Modularized, and Extensible Framework for Text Generation
Junyi Li†, Tianyi Tang†, Gaole He, Jinhao Jiang, Xiaoxuan Hu, Puzhao Xie, Zhipeng Chen, Zhuohao Yu, Wayne Xin Zhao*, Ji-Rong Wen
The 59th Annual Meeting of the Association for Computational Linguistics (ACL), 2021, System Demonstration
pdf / code

Few-shot Knowledge Graph-to-Text Generation with Pretrained Language Models
Junyi Li, Tianyi Tang, Wayne Xin Zhao*, Ji-Rong Wen
Findings of The 59th Annual Meeting of the Association for Computational Linguistics (ACL), 2021
pdf / code

Pretrained Language Model for Text Generation: A Survey
Junyi Li†, Tianyi Tang†, Wayne Xin Zhao*, Ji-Rong Wen
The 30th International Joint Conference on Artificial Intelligence (IJCAI), 2021, Survey Track
pdf

Knowledge-based Review Generation by Coherence Enhanced Text Planning
Junyi Li, Wayne Xin Zhao*, Zhicheng Wei, Nicholas Jing Yuan, Ji-Rong Wen
The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021
pdf / code

2020

Knowledge-Enhanced Personalized Review Generation with Capsule Graph Neural Network
Junyi Li, Siqing Li, Wayne Xin Zhao*, Gaole He, Zhicheng Wei, Nicholas Jing Yuan, Ji-Rong Wen
The 29th ACM International Conference on Information and Knowledge Management (CIKM), 2020
pdf / code

Mining Implicit Entity Preference from User-Item Interaction Data for Knowledge Graph Completion via Adversarial Learning
Gaole He, Junyi Li, Wayne Xin Zhao*, Peiju Liu, Ji-Rong Wen
International World Wide Web Conference (WWW), 2020
pdf / code

2019

Generating Long and Informative Reviews with Aspect-Aware Coarse-to-Fine Decoding
Junyi Li, Wayne Xin Zhao*, Ji-Rong Wen, Yang Song
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), 2019
pdf / code

Open Source Projects

(Most of my research work are open-source. Here are some my preferable projects!)
  • TextBox
    A unified, comprehensive and efficient framework for reproducing and developing text generation algorithms, covering more than 20 base models and nearly 10 benchmarks.
  • HaluEval
    A hallucination evaluation benchmark for large language models. HaluEval includes 5,000 general user queries with ChatGPT responses and 30,000 task-specific examples from three tasks, i.e., question answering, knowledge-grounded dialogue, and text summarization.
Professional Services

  • Reviewer
    • Journal: TALLIP, ACM Computing Survey, Computational Intelligence
    • Conference: AAAI 2021-23, IJCAI 2021-23, KDD 2021, EMNLP 2022, COLING 2022, ACL 2023
  • Chair
    • CSSNLP 2020 (Co-Chair)
Selected Awards and Honors

  • National Scholarship for Graduate Student (top 2% students), Ministry of Education of P.R.China, 2021
  • SIGIR Student Travel Grant (CIKM 2020)
  • National Scholarship for Graduate Student (top 2% students), Ministry of Education of P.R.China, 2019
  • China Undergraduate Mathematical Contest in Modeling, Second Prize in Beijing Contest District, 2016
Education

  • Ph.D. student of Artificial Intelligence, Renmin University of China & Universite de Montreal, 2020-present
  • M.Sc. of Computer Application, Renmin University of China, 2018-2020
  • B.Sc. of Computer Science, Renmin University of China, 2014-2018