Objectives_template

	Back to shared bus Centralized shared bus makes it easy to support SC Writes and reads are all serialized in a total order through the bus transaction ordering If a read gets a value of a previous write, that write is guaranteed to be complete because that bus transaction is complete The write order seen by all processors is the same in a write through system because every write causes a transaction and hence is visible to all in the same order In a nutshell, every processor sees the same total bus order for all memory operations and therefore any bus-based SMP with write through caches is SC What about a multiprocessor with writeback cache? No SMP uses write through protocol due to high BW Snoopy protocols No change to processor or cache Just extend the cache controller with snoop logic and exploit the bus We will focus on writeback caches only Possible states of a cache line: Invalid (I), Shared (S), Modified or dirty (M), Clean exclusive (E), Owned (O); every processor does not support all five states E state is equivalent to M in the sense that the line has permission to write, but in E state the line is not yet modified and the copy in memory is the same as in cache; if someone else requests the line the memory will provide the line O state is exactly same as E state but in this case memory is not responsible for servicing requests to the line; the owner must supply the line (just as in M state) Stores really read the memory (as opposed to write) Stores Look at stores a little more closely There are three situations at the time a store issues: the line is not in the cache, the line is in the cache in S state, the line is in the cache in one of M, E and O states If the line is in I state, the store generates a read-exclusive request on the bus and gets the line in M state If the line is in S or O state, that means the processor only has read permission for that line; the store generates an upgrade request on the bus and the upgrade acknowledgment gives it the write permission (this is a data-less transaction) If the line is in M or E state, no bus transaction is generated; the cache already has write permission for the line (this is the case of a write hit; previous two are write misses) Invalidation vs. update Two main classes of protocols: Invalidation-based and update-based Dictates what action should be taken on a write Invalidation-based protocols invalidate sharers when a write miss (upgrade or readX) appears on the bus Update-based protocols update the sharer caches with new value on a write: requires write transactions (carrying just the modified bytes) on the bus even on write hits (not very attractive with writeback caches) Advantage of update-based protocols: sharers continue to hit in the cache while in invalidation-based protocols sharers will miss next time they try to access the line Advantage of invalidation-based protocols: only write misses go on bus (suited for writeback caches) and subsequent stores to the same line are cache hits Which one is better? Difficult to answer Depends on program behavior and hardware cost When is update-based protocol good? What sharing pattern? (large-scale producer/consumer) Otherwise it would just waste bus bandwidth doing useless updates When is invalidation-protocol good? Sequence of multiple writes to a cache line Saves intermediate write transactions Also think about the overhead of initiating small updates for every write in update protocols Invalidation-based protocols are much more popular Some systems support both or maybe some hybrid based on dynamic sharing pattern of a cache line