SimpleRL-Zoo and B-STaR: Improving reasoning performance and efficiency through reinforcement learning

Date:

Invited talk on SimpleRL-Zoo and B-STaR: Improving reasoning performance and efficiency through reinforcement learning.