CS234 κ°•ν™”ν•™μŠ΅
🀷

CS234 κ°•ν™”ν•™μŠ΅

νŒ€μ›
νŒ€ 주제
κ°•ν™”ν•™μŠ΅

μ–΄μ©Œλ‹€ κ°•ν™”ν•™μŠ΅

🀷
μ–΄μ©Œλ‹€λ³΄λ‹ˆ κ°•ν™”ν•™μŠ΅ 곡뢀λ₯Ό μ‹œμž‘ν•˜κ²Œ 된 μ„Έ λͺ…μ˜ μœ„ν‚€ν”Όλ””μ•„

μž‘μ„±μž

고영희(꼬영)
μ΄ν™”μ—¬μžλŒ€ν•™κ΅ 톡계학 λŒ€ν•™μ›
μ–‘μ§€μŠΉ(μ§€μŠΉ)
μ•„μ£ΌλŒ€ν•™κ΅ 산업곡학과
ν™μ§€μš°(루)
μ΄ν™”μ—¬μžλŒ€ν•™κ΅ 컴퓨터곡학전곡
Β 

κ°•μ˜ μ†Œκ°œ

CS234: Reinforcement Learning
The Stanford Artificial Intelligence Laboratory의 κ°•ν™”ν•™μŠ΅ κ°•μ˜
notion image
Β 

WIKIPEDIA

1κ°•_Introduction
2κ°•_Given a Model of the World
3κ°•_ Model-Free Policy Evaluation
4κ°•_Model Free Control
5κ°•_Value Function Approximation
6κ°•_ CNNs and Deep Q Learning
7κ°•_Imitation Learning in Large State Spaces