ML

ML

  • 분류 전체보기 (10)
    • GDSC_yonsei (1)
    • ML session (3)
    • Reinforcement Learning (5)
  • 홈
  • 태그
  • 방명록
RSS 피드
로그인
로그아웃 글쓰기 관리

ML

컨텐츠 검색

태그

ubuntu 22.10 DataStructure Batch Gradient Descent ML pretrained model Monte Carlo cuDNN dataloader imagenet reinforcement learning Q-Learning BartoSutton BGD Cliff Problem Ricahrd S. Sutton resnet Sarsa Attention is all you need Richard S. Sutton Zero-shot Transfer

최근글

댓글

공지사항

아카이브

Dynamic Programming(1)

  • 4. Dynamic Programming

    Dynamic Programming (Model-based approach) 1. Prediction Problem (Policy evaluation) Treat Bellman equation like an update rule Could have just used a baisc linear solver but it doesn't scaled iterative DP approach applied 2. Control Problem (Policy improvement) Policy improvement theorem : If changing an action once improves the value, changing it every time will give us a better policy Policy ..

    2023.01.20
이전
1
다음
티스토리
© 2018 TISTORY. All rights reserved.

티스토리툴바