Modules / Lectures


Sl.No Chapter Name MP4 Download
1Introduction to Parallel ProgrammingDownload
2Parallel Architectures and Programming ModelsDownload
3PipeliningDownload
4Superpipelining and VLIWDownload
5Memory LatencyDownload
6Cache and Temporal LocalityDownload
7Cache, Memory bandwidth and Spatial LocalityDownload
8Intuition for Shared and Distributed Memory architecturesDownload
9Shared and Distributed Memory architecturesDownload
10Interconnection networks in Distributed Memory architecturesDownload
11OpenMP: A parallel Hello World ProgramDownload
12Program with Single threadDownload
13Program Memory with Multiple threads and Multi-taskingDownload
14Context SwitchingDownload
15OpenMP: Basic thread functionsDownload
16OpenMP: About OpenMPDownload
17Shared Memory Consistency Models and the Sequential Consistency ModelDownload
18Race ConditionsDownload
19OpenMP: Scoping variables and some race conditionsDownload
20OpenMP: thread private variables and more constructsDownload
21Computing sum: first attempt at parallelizationDownload
22Manual distribution of work and critical sectionsDownload
23Distributing for loops and reductionDownload
24Vector-Vector operations (Dot product)Download
25Matrix-Vector operations (Matrix-Vector Multiply)Download
26Matrix-Matrix operations (Matrix-Matrix Multiply)Download
27Introduction to tasksDownload
28Task queues and task executionDownload
29Accessing variables in tasksDownload
30Completion of tasks and scoping variables in tasksDownload
31Recursive task spawning and pitfallsDownload
32Understanding LU FactorizationDownload
33Parallel LU FactorizationDownload
34LocksDownload
35Advanced Task handlingDownload
36Matrix Multiplication using tasksDownload
37The OpenMP Shared Memory Consistency ModelDownload
38Applications finite element methodDownload
39Applications deep learningDownload
40Introduction to MPI and basic callsDownload
41MPI calls to send and receive dataDownload
42MPI calls for broadcasting dataDownload
43MPI non blocking callsDownload
44Application distributed histogram updationDownload
45MPI collectives and MPI broadcastDownload
46MPI gathering and scattering collectivesDownload
47MPI reduction and alltoall collectivesDownload
48Discussion on MPI collectives designDownload
49Characteriziation of interconnectsDownload
50Linear arrays 2D mesh and torusDownload
51d dimensional torusDownload
52HypercubeDownload
53Trees and cliquesDownload
54Hockney modelDownload
55Broadcast and Reduce with recursive doublingDownload
56Scatter and Gather with recursive doublingDownload
57Reduce scatter and All gather with recursive doublingDownload
58Discussion of message sizes in analysisDownload
59Revisiting Reduce scatter on 2D meshDownload
60Reduce scatter and Allreduce on the HypercubeDownload
61Alltoall on the HypercubeDownload
62Lower boundsDownload
63Pipeline based algorithm for AllreduceDownload
64An improved algorithm for Alltoall on the Hypercube using E cube routingDownload
65Pipeline based algorithm for BroadcastDownload
66Introduction to parallel graph algorithmsDownload
67Breadth First Search BFS using matrix algebraDownload
68BFS Shared memory parallelization using OpenMPDownload
69Distributed memory settings and data distributionDownload
70Distributed BFS algorithmDownload
71Performance considerationsDownload
72Prims AlgorithmDownload
73OpenMP based shared memory parallelization for MSTDownload
74MPI based distributed memory parallelization for MSTDownload
75Sequential Algorithm Adaptation from PrimsDownload
76Parallelization Strategy for Prims algorithmDownload
77Dry run with the parallel strategyDownload
78Johnsons algorithm with 1D data distributionDownload
79Speedup analysis on a grid graphDownload
80Floyds algorithm for all pair shortest pathsDownload
81Floyds algorithm with 2D data distributionDownload
82Adaptation to transitive closuresDownload
83Parallelization strategy for connected componentsDownload
84Analysis for parallel connected componentsDownload

Sl.No Chapter Name English
1Introduction to Parallel ProgrammingPDF unavailable
2Parallel Architectures and Programming ModelsPDF unavailable
3PipeliningPDF unavailable
4Superpipelining and VLIWPDF unavailable
5Memory LatencyPDF unavailable
6Cache and Temporal LocalityPDF unavailable
7Cache, Memory bandwidth and Spatial LocalityPDF unavailable
8Intuition for Shared and Distributed Memory architecturesPDF unavailable
9Shared and Distributed Memory architecturesPDF unavailable
10Interconnection networks in Distributed Memory architecturesPDF unavailable
11OpenMP: A parallel Hello World ProgramPDF unavailable
12Program with Single threadPDF unavailable
13Program Memory with Multiple threads and Multi-taskingPDF unavailable
14Context SwitchingPDF unavailable
15OpenMP: Basic thread functionsPDF unavailable
16OpenMP: About OpenMPPDF unavailable
17Shared Memory Consistency Models and the Sequential Consistency ModelPDF unavailable
18Race ConditionsPDF unavailable
19OpenMP: Scoping variables and some race conditionsPDF unavailable
20OpenMP: thread private variables and more constructsPDF unavailable
21Computing sum: first attempt at parallelizationPDF unavailable
22Manual distribution of work and critical sectionsPDF unavailable
23Distributing for loops and reductionPDF unavailable
24Vector-Vector operations (Dot product)PDF unavailable
25Matrix-Vector operations (Matrix-Vector Multiply)PDF unavailable
26Matrix-Matrix operations (Matrix-Matrix Multiply)PDF unavailable
27Introduction to tasksPDF unavailable
28Task queues and task executionPDF unavailable
29Accessing variables in tasksPDF unavailable
30Completion of tasks and scoping variables in tasksPDF unavailable
31Recursive task spawning and pitfallsPDF unavailable
32Understanding LU FactorizationPDF unavailable
33Parallel LU FactorizationPDF unavailable
34LocksPDF unavailable
35Advanced Task handlingPDF unavailable
36Matrix Multiplication using tasksPDF unavailable
37The OpenMP Shared Memory Consistency ModelPDF unavailable
38Applications finite element methodPDF unavailable
39Applications deep learningPDF unavailable
40Introduction to MPI and basic callsPDF unavailable
41MPI calls to send and receive dataPDF unavailable
42MPI calls for broadcasting dataPDF unavailable
43MPI non blocking callsPDF unavailable
44Application distributed histogram updationPDF unavailable
45MPI collectives and MPI broadcastPDF unavailable
46MPI gathering and scattering collectivesPDF unavailable
47MPI reduction and alltoall collectivesPDF unavailable
48Discussion on MPI collectives designPDF unavailable
49Characteriziation of interconnectsPDF unavailable
50Linear arrays 2D mesh and torusPDF unavailable
51d dimensional torusPDF unavailable
52HypercubePDF unavailable
53Trees and cliquesPDF unavailable
54Hockney modelPDF unavailable
55Broadcast and Reduce with recursive doublingPDF unavailable
56Scatter and Gather with recursive doublingPDF unavailable
57Reduce scatter and All gather with recursive doublingPDF unavailable
58Discussion of message sizes in analysisPDF unavailable
59Revisiting Reduce scatter on 2D meshPDF unavailable
60Reduce scatter and Allreduce on the HypercubePDF unavailable
61Alltoall on the HypercubePDF unavailable
62Lower boundsPDF unavailable
63Pipeline based algorithm for AllreducePDF unavailable
64An improved algorithm for Alltoall on the Hypercube using E cube routingPDF unavailable
65Pipeline based algorithm for BroadcastPDF unavailable
66Introduction to parallel graph algorithmsPDF unavailable
67Breadth First Search BFS using matrix algebraPDF unavailable
68BFS Shared memory parallelization using OpenMPPDF unavailable
69Distributed memory settings and data distributionPDF unavailable
70Distributed BFS algorithmPDF unavailable
71Performance considerationsPDF unavailable
72Prims AlgorithmPDF unavailable
73OpenMP based shared memory parallelization for MSTPDF unavailable
74MPI based distributed memory parallelization for MSTPDF unavailable
75Sequential Algorithm Adaptation from PrimsPDF unavailable
76Parallelization Strategy for Prims algorithmPDF unavailable
77Dry run with the parallel strategyPDF unavailable
78Johnsons algorithm with 1D data distributionPDF unavailable
79Speedup analysis on a grid graphPDF unavailable
80Floyds algorithm for all pair shortest pathsPDF unavailable
81Floyds algorithm with 2D data distributionPDF unavailable
82Adaptation to transitive closuresPDF unavailable
83Parallelization strategy for connected componentsPDF unavailable
84Analysis for parallel connected componentsPDF unavailable
Sl.No Chapter Name Hindi
1Introduction to Parallel ProgrammingDownload
2Parallel Architectures and Programming ModelsDownload
3PipeliningDownload
4Superpipelining and VLIWDownload
5Memory LatencyDownload
6Cache and Temporal LocalityDownload
7Cache, Memory bandwidth and Spatial LocalityDownload
8Intuition for Shared and Distributed Memory architecturesDownload
9Shared and Distributed Memory architecturesDownload
10Interconnection networks in Distributed Memory architecturesDownload
11OpenMP: A parallel Hello World ProgramDownload
12Program with Single threadDownload
13Program Memory with Multiple threads and Multi-taskingDownload
14Context SwitchingDownload
15OpenMP: Basic thread functionsDownload
16OpenMP: About OpenMPDownload
17Shared Memory Consistency Models and the Sequential Consistency ModelDownload
18Race ConditionsDownload
19OpenMP: Scoping variables and some race conditionsDownload
20OpenMP: thread private variables and more constructsDownload
21Computing sum: first attempt at parallelizationDownload
22Manual distribution of work and critical sectionsDownload
23Distributing for loops and reductionDownload
24Vector-Vector operations (Dot product)Download
25Matrix-Vector operations (Matrix-Vector Multiply)Download
26Matrix-Matrix operations (Matrix-Matrix Multiply)Download
27Introduction to tasksDownload
28Task queues and task executionDownload
29Accessing variables in tasksDownload
30Completion of tasks and scoping variables in tasksDownload
31Recursive task spawning and pitfallsDownload
32Understanding LU FactorizationDownload
33Parallel LU FactorizationDownload
34LocksDownload
35Advanced Task handlingDownload
36Matrix Multiplication using tasksDownload
37The OpenMP Shared Memory Consistency ModelDownload
38Applications finite element methodNot Available
39Applications deep learningNot Available
40Introduction to MPI and basic callsNot Available
41MPI calls to send and receive dataNot Available
42MPI calls for broadcasting dataNot Available
43MPI non blocking callsNot Available
44Application distributed histogram updationNot Available
45MPI collectives and MPI broadcastNot Available
46MPI gathering and scattering collectivesNot Available
47MPI reduction and alltoall collectivesNot Available
48Discussion on MPI collectives designNot Available
49Characteriziation of interconnectsNot Available
50Linear arrays 2D mesh and torusNot Available
51d dimensional torusNot Available
52HypercubeNot Available
53Trees and cliquesNot Available
54Hockney modelNot Available
55Broadcast and Reduce with recursive doublingNot Available
56Scatter and Gather with recursive doublingNot Available
57Reduce scatter and All gather with recursive doublingNot Available
58Discussion of message sizes in analysisNot Available
59Revisiting Reduce scatter on 2D meshNot Available
60Reduce scatter and Allreduce on the HypercubeNot Available
61Alltoall on the HypercubeNot Available
62Lower boundsNot Available
63Pipeline based algorithm for AllreduceNot Available
64An improved algorithm for Alltoall on the Hypercube using E cube routingNot Available
65Pipeline based algorithm for BroadcastNot Available
66Introduction to parallel graph algorithmsNot Available
67Breadth First Search BFS using matrix algebraNot Available
68BFS Shared memory parallelization using OpenMPNot Available
69Distributed memory settings and data distributionNot Available
70Distributed BFS algorithmNot Available
71Performance considerationsNot Available
72Prims AlgorithmNot Available
73OpenMP based shared memory parallelization for MSTNot Available
74MPI based distributed memory parallelization for MSTNot Available
75Sequential Algorithm Adaptation from PrimsNot Available
76Parallelization Strategy for Prims algorithmNot Available
77Dry run with the parallel strategyNot Available
78Johnsons algorithm with 1D data distributionNot Available
79Speedup analysis on a grid graphNot Available
80Floyds algorithm for all pair shortest pathsNot Available
81Floyds algorithm with 2D data distributionNot Available
82Adaptation to transitive closuresNot Available
83Parallelization strategy for connected componentsNot Available
84Analysis for parallel connected componentsNot Available


Sl.No Language Book link
1EnglishDownload
2BengaliNot Available
3GujaratiNot Available
4HindiNot Available
5KannadaNot Available
6MalayalamNot Available
7MarathiNot Available
8TamilNot Available
9TeluguNot Available