Posts by Collection

portfolio

publications

Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue Generation

Published in ACL 2023 Main Conference, 2023

Exploring compositional generalization of multi-attribute controllable dialogue generation.

Recommended citation: Weihao Zeng, Lulu Zhao, Keqing He, Ruotong Geng, Jingang Wang, Wei Wu, Weiran Xu. (2023). "Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue Generation." ACL 2023 Main Conference.
Download Paper

7B Model and 8K Examples: Emerging Reasoning with Reinforcement Learning is Both Effective and Efficient

Published in Preprint, 2025

Demonstrating that emerging reasoning with reinforcement learning is both effective and efficient using a 7B model and 8K examples.

Recommended citation: Weihao Zeng*, Yuzhen Huang*, Wei Liu, Keqing He, Qian Liu, Zejun Ma, Junxian He. (2025). "7B Model and 8K Examples: Emerging Reasoning with Reinforcement Learning is Both Effective and Efficient." Preprint.
Download Paper

talks

teaching