ML

ML

  • 분류 전체보기 (10)
    • GDSC_yonsei (1)
    • ML session (3)
    • Reinforcement Learning (5)
  • 홈
  • 태그
  • 방명록
RSS 피드
로그인
로그아웃 글쓰기 관리

ML

컨텐츠 검색

태그

ubuntu 22.10 reinforcement learning dataloader cuDNN Ricahrd S. Sutton resnet Cliff Problem ML BGD BartoSutton Sarsa Monte Carlo Attention is all you need imagenet pretrained model Richard S. Sutton Zero-shot Transfer Batch Gradient Descent DataStructure Q-Learning

최근글

댓글

공지사항

아카이브

Dynamic Programming(1)

  • 4. Dynamic Programming

    Dynamic Programming (Model-based approach) 1. Prediction Problem (Policy evaluation) Treat Bellman equation like an update rule Could have just used a baisc linear solver but it doesn't scaled iterative DP approach applied 2. Control Problem (Policy improvement) Policy improvement theorem : If changing an action once improves the value, changing it every time will give us a better policy Policy ..

    2023.01.20
이전
1
다음
티스토리
© 2018 TISTORY. All rights reserved.

티스토리툴바