Objectives_template

Design issues

Dragon example

Design issues

Can we eliminate the Sm state?
- Yes. Provided on every BusUpd the memory is also updated; then the Sc state is sufficient (essentially boils down to a standard MSI update protocol)
- However, update to cache may be faster than memory; but updating cache means occupying data banks during update thereby preventing the processor from accessing the cache, so not to degrade performance, extra cache ports may be needed
Is it necessary to launch a bus transaction on an eviction of a line in Sc state?
- May help if this was the last copy of line in Sc state
- If there is a line in Sm state, it can go back to M and save subsequent unnecessary BusUpd transactions (the shared wire already solves this)

General issues

Thus far we have assumed an atomic bus where transactions are not interleaved
- In reality, high performance busses are pipelined and multiple transactions are in progress at the same time
- How do you reason about coherence?
Thus far we have assumed that the processor has only one level of cache
- How to extend the coherence protocol to multiple levels of cache?
- Normally, the cache coherence protocols we have discussed thus far executes only on the outermost level of cache hierarchy
- A simpler but different protocol runs within the hierarchy to maintain coherence
We will revisit these questions soon

Evaluating protocols

In message passing machines the design of the message layer plays an important role
Similarly, cache coherence protocols are central to the design of a shared memory multiprocessor
The protocol performance depends on an array of parameters
Experience and intuition help in determining good design points
Otherwise designers use workload-driven simulations for cost/performance analysis
- Goal is to decide where to spend money, time and energy
- The simulators model the underlying multiprocessor in enough detail to capture correct performance trends as one explores the parameter space