본문 바로가기

News/논문

Reinforcement Learning Introduction

팡요랩

https://www.youtube.com/@pang-yolab252

https://github.com/minyoungjun/Pang-yo

 

신인류

https://youtu.be/oykC6Y8UCpY?feature=shared

 

혁펜하임

https://youtu.be/cvctS4xWSaU?feature=shared

https://github.com/7201krap/KNU_RA

 

 

강화학습 복습 자료 1: Concept of RL
https://sophia-su.tistory.com/125

강화학습 복습 자료 2: Dummy Q-learning algorithm
https://sophia-su.tistory.com/126

강화학습 복습 자료 3: Exploit & Exploration
https://sophia-su.tistory.com/127

강화학습 복습 자료 4: Discounted future reward
https://sophia-su.tistory.com/128

강화학습 복습 자료 5: Stochastic World
https://sophia-su.tistory.com/129

Multi-armed bandit problem

https://medium.com/noodle-ds/multi-armed-bandit-problem-3b97aa7ee906

 

reinforcement learning (1/4): overview, dynamic programming, monte carlo
https://richard-warren.github.io/blog/rl_intro_1/

reinforcement learning (2/4): value function approximation
https://richard-warren.github.io/blog/rl_intro_2/

reinforcement learning (3/4): temporal difference learning
https://richard-warren.github.io/blog/rl_intro_3/

reinforcement learning (4/4): policy gradient
https://richard-warren.github.io/blog/rl_intro_4/

 

[강화학습] SARSA와 DQN 개념 정리

https://mengu.tistory.com/139

 

볼츠만 머신, Boltzmann machine

https://helpingstar.github.io/dl/other_network/

 

 

 

'News > 논문' 카테고리의 다른 글

LangChain/LLM Agent  (0) 2024.09.25
Symbolic AI/Symbolic Execution  (0) 2024.07.15
BART Implementation Code  (0) 2024.06.25
T5 Implementation Code  (0) 2024.06.24









>