팡요랩
https://www.youtube.com/@pang-yolab252
https://github.com/minyoungjun/Pang-yo
신인류
https://youtu.be/oykC6Y8UCpY?feature=shared
혁펜하임
https://youtu.be/cvctS4xWSaU?feature=shared
https://github.com/7201krap/KNU_RA
강화학습 복습 자료 1: Concept of RL
https://sophia-su.tistory.com/125
강화학습 복습 자료 2: Dummy Q-learning algorithm
https://sophia-su.tistory.com/126
강화학습 복습 자료 3: Exploit & Exploration
https://sophia-su.tistory.com/127
강화학습 복습 자료 4: Discounted future reward
https://sophia-su.tistory.com/128
강화학습 복습 자료 5: Stochastic World
https://sophia-su.tistory.com/129
Multi-armed bandit problem
https://medium.com/noodle-ds/multi-armed-bandit-problem-3b97aa7ee906
reinforcement learning (1/4): overview, dynamic programming, monte carlo
https://richard-warren.github.io/blog/rl_intro_1/
reinforcement learning (2/4): value function approximation
https://richard-warren.github.io/blog/rl_intro_2/
reinforcement learning (3/4): temporal difference learning
https://richard-warren.github.io/blog/rl_intro_3/
reinforcement learning (4/4): policy gradient
https://richard-warren.github.io/blog/rl_intro_4/
[강화학습] SARSA와 DQN 개념 정리
볼츠만 머신, Boltzmann machine
https://helpingstar.github.io/dl/other_network/
'News > 논문' 카테고리의 다른 글
LangChain/LLM Agent (0) | 2024.09.25 |
---|---|
Symbolic AI/Symbolic Execution (0) | 2024.07.15 |
BART Implementation Code (0) | 2024.06.25 |
T5 Implementation Code (0) | 2024.06.24 |