Neural Nets for NLP - Minimum Risk Training and Reinforcement Learning

Overview

Limited-Time Offer: Up to 75% Off Coursera Plus!

7000+ certificate courses from Google, Microsoft, IBM, and many more.

This course covers the following learning outcomes and goals: understanding Error, Risk, and Minimum Risk Training in Neural Networks for NLP, grasping the concept of Reinforcement Learning, learning about Policy Gradient, REINFORCE, and Value-based Reinforcement Learning, and exploring methods to stabilize Reinforcement Learning. The course teaches individual skills such as implementing Policy Gradient, understanding Credit Assignment for Rewards, adding a Baseline, calculating Baselines, and estimating Value Functions in the context of Neural Networks for NLP. The teaching method of the course involves lectures and theoretical explanations on Error, Risk, Minimum Risk Training, and Reinforcement Learning, with a focus on practical applications and examples. The intended audience for this course includes students and professionals interested in Neural Networks for Natural Language Processing, specifically those looking to deepen their understanding of Minimum Risk Training and Reinforcement Learning techniques in this field.

Syllabus

Intro
Problem 1: Exposure Bias
Problem 2: Disregard to Evaluation Metrics
Error
Problem: Argmax is Non- differentiable
Sampling for Risk
Adding Temperature
What is Reinforcement Learning?
Why Reinforcement Learning in NLP?
Supervised MLE
Self Training
Policy Gradient/REINFORCE
Credit Assignment for Rewards
Problems w/ Reinforcement Learning
Adding a Baseline
Calculating Baselines
Increasing Batch Size
Warm-start
When to Use Reinforcement Learning?
Action-Value Function
Estimating Value Functions

Taught by

Graham Neubig

Reviews

Start your review of Neural Nets for NLP - Minimum Risk Training and Reinforcement Learning

BloomTech’s Downfall: A Long Time Coming

Most common

Popular subjects

Popular courses

Neural Nets for NLP - Minimum Risk Training and Reinforcement Learning

Overview

Limited-Time Offer: Up to 75% Off Coursera Plus!

Syllabus

Taught by

Reviews

BloomTech’s Downfall: A Long Time Coming

Limited-Time Offer: Up to 75% Off Coursera Plus!

Taught by

Neural Nets for NLP 2019 - Reinforcement Learning

Neural Nets for NLP 2017 - Reinforcement Learning

A Friendly Introduction to Deep Reinforcement Learning, Q-Networks and Policy Gradients

Neural Nets for NLP - Document Level Models

Stanford CS234: Reinforcement Learning - Winter 2019

Neural Nets for NLP 2020 - Attention

Never Stop Learning.