Sl.No | Chapter Name | MP4 Download |
---|---|---|
1 | Tutorial 1 - Probability Basics 1 | Download |
2 | Tutorial 1-Probability basics2 | Download |
3 | Tutorial 2-Linear algebra-1 | Download |
4 | Tutorial 2-Linear algebra-2 | Download |
5 | Introduction to RL | Download |
6 | RL Framework and applications | Download |
7 | Introduction to Immediate RL | Download |
8 | Bandit Optimalities | Download |
9 | Value function based methods | Download |
10 | UCB 1 | Download |
11 | Concentration Bounds | Download |
12 | UCB 1 Theorem | Download |
13 | PAC Bounds | Download |
14 | Median Elimination | Download |
15 | Thompson Sampling | Download |
16 | Policy Search | Download |
17 | REINFORCE | Download |
18 | Contextual Bandits | Download |
19 | Full RL Introduction | Download |
20 | Returns, Value Functions and MDPs | Download |
21 | MDP Modelling | Download |
22 | Bellman Equation | Download |
23 | Bellman Optimality Equation | Download |
24 | Cauchy Sequence and Green's Equation | Download |
25 | Banach Fixed Point Theorem | Download |
26 | Convergence Proof | Download |
27 | Lpi Convergence | Download |
28 | Value Iteration | Download |
29 | Policy Iteration | Download |
30 | Dynamic Programming | Download |
31 | Monte Carlo | Download |
32 | Control in Monte Carlo | Download |
33 | Off Policy MC | Download |
34 | UCT | Download |
35 | TD(0) | Download |
36 | TD(0) Control | Download |
37 | Q-Learning | Download |
38 | Afterstate | Download |
39 | Eligibility Traces | Download |
40 | Backward View of Eligibility Traces | Download |
41 | Eligibility Trace Control | Download |
42 | Thompson Sampling Recap | Download |
43 | Function Approximation | Download |
44 | Linear Parameterization | Download |
45 | State Aggregation Methods | Download |
46 | Function Approximation and Eligibility Traces | Download |
47 | LSTD and LSTDQ | Download |
48 | LSPI and Fitted Q | Download |
49 | DQN and Fitted Q-Iteration | Download |
50 | Policy Gradient Approach | Download |
51 | Actor Critic and REINFORCE | Download |
52 | REINFORCE (cont'd) | Download |
53 | Policy Gradient with Function Approximation | Download |
54 | Hierarchical Reinforcement Learning | Download |
55 | Types of Optimality | Download |
56 | Semi Markov Decision Processes | Download |
57 | Options | Download |
58 | Learning with Options | Download |
59 | Hierarchical Abstract Machines | Download |
60 | MAXQ | Download |
61 | MAXQ Value Function Decomposition | Download |
62 | Option Discovery | Download |
63 | POMDP Introduction | Download |
64 | Solving POMDP | Download |
Sl.No | Chapter Name | English |
---|---|---|
1 | Tutorial 1 - Probability Basics 1 | Download Verified |
2 | Tutorial 1-Probability basics2 | Download Verified |
3 | Tutorial 2-Linear algebra-1 | Download Verified |
4 | Tutorial 2-Linear algebra-2 | Download Verified |
5 | Introduction to RL | Download Verified |
6 | RL Framework and applications | Download Verified |
7 | Introduction to Immediate RL | Download Verified |
8 | Bandit Optimalities | Download Verified |
9 | Value function based methods | Download Verified |
10 | UCB 1 | Download Verified |
11 | Concentration Bounds | Download Verified |
12 | UCB 1 Theorem | Download Verified |
13 | PAC Bounds | Download Verified |
14 | Median Elimination | Download Verified |
15 | Thompson Sampling | Download Verified |
16 | Policy Search | Download Verified |
17 | REINFORCE | Download Verified |
18 | Contextual Bandits | Download Verified |
19 | Full RL Introduction | Download Verified |
20 | Returns, Value Functions and MDPs | Download Verified |
21 | MDP Modelling | Download Verified |
22 | Bellman Equation | Download To be verified |
23 | Bellman Optimality Equation | Download Verified |
24 | Cauchy Sequence and Green's Equation | Download Verified |
25 | Banach Fixed Point Theorem | Download Verified |
26 | Convergence Proof | Download Verified |
27 | Lpi Convergence | Download Verified |
28 | Value Iteration | Download Verified |
29 | Policy Iteration | Download Verified |
30 | Dynamic Programming | Download Verified |
31 | Monte Carlo | Download Verified |
32 | Control in Monte Carlo | Download Verified |
33 | Off Policy MC | Download Verified |
34 | UCT | Download Verified |
35 | TD(0) | Download Verified |
36 | TD(0) Control | Download Verified |
37 | Q-Learning | Download Verified |
38 | Afterstate | Download Verified |
39 | Eligibility Traces | Download Verified |
40 | Backward View of Eligibility Traces | Download Verified |
41 | Eligibility Trace Control | Download To be verified |
42 | Thompson Sampling Recap | Download To be verified |
43 | Function Approximation | Download To be verified |
44 | Linear Parameterization | Download To be verified |
45 | State Aggregation Methods | Download To be verified |
46 | Function Approximation and Eligibility Traces | Download To be verified |
47 | LSTD and LSTDQ | Download To be verified |
48 | LSPI and Fitted Q | Download To be verified |
49 | DQN and Fitted Q-Iteration | Download To be verified |
50 | Policy Gradient Approach | Download To be verified |
51 | Actor Critic and REINFORCE | Download To be verified |
52 | REINFORCE (cont'd) | Download To be verified |
53 | Policy Gradient with Function Approximation | Download To be verified |
54 | Hierarchical Reinforcement Learning | Download Verified |
55 | Types of Optimality | Download Verified |
56 | Semi Markov Decision Processes | Download Verified |
57 | Options | Download Verified |
58 | Learning with Options | Download Verified |
59 | Hierarchical Abstract Machines | Download Verified |
60 | MAXQ | Download To be verified |
61 | MAXQ Value Function Decomposition | Download To be verified |
62 | Option Discovery | Download To be verified |
63 | POMDP Introduction | Download To be verified |
64 | Solving POMDP | Download To be verified |
Sl.No | Language | Book link |
---|---|---|
1 | English | Not Available |
2 | Bengali | Not Available |
3 | Gujarati | Not Available |
4 | Hindi | Not Available |
5 | Kannada | Not Available |
6 | Malayalam | Not Available |
7 | Marathi | Not Available |
8 | Tamil | Not Available |
9 | Telugu | Not Available |