CIS 5210 - Module 8 - Reinforcement Learning
Listen now
Description
This episode explores reinforcement learning and its relationship to MDPs. Also mentioned: exploration v. exploitation, multi-arm bandits, model-free learning, q-learning. Disclosure: This episode was generated using NotebookLM by uploading Professor Chris Callison-Burch's lecture notes and slides.
More Episodes
This episode explores MDPs, covering stochastic environments, transition functions, reward functions, policies, value iteration, policy iteration, expected utility, finite vs. infinite horizons, discount factors, etc. Disclosure: This episode was generated using NotebookLM by uploading Professor...
Published 10/05/24
Published 10/05/24
This episode explores knowledge-based agents in AI, covering knowledge bases, inference, propositional logic, theorem proving, logical equivalence, resolution, conjunctive normal form (CNF), proof by contradiction, and distributed knowledge representation and reasoning. Disclosure: This episode...
Published 09/29/24