I have a certain basic understanding of how CPUs are made up with functional blocks, stages in a pipeline, why instructions are decoded and dispatched, and a naive understanding of cache hierarchy.
The memory hierarchy (including caches and main memory) can consume as much as 50% of an embedded system power. This power is very application dependent, and tuning caches for a given application is a ...