RL and GAN for Sentence Generation and Chat-bot

(img)

  • 傳統方法就是 minimize cross-entropy,也就是 maximum likelihood

RL for Sentence Generation

Maximum Likelihood v.s. Reinforcement Learning (implementation)

results matching ""

    No results matching ""