Circuits for Low Power Bus Traffic Encoding

Spring 2003 EE241 Mid-Semester Project Report

Yatish Patel, Yury Markovskiy

yatish@cs, yurym@cs

Introduction

One proposed method for reducing energy consumed by on-chip buses is to send frequently transmitted values less frequently. This is accomplished by inserting a transmitter on one side of the bus and a receiver circuit on the other side. These circuits effectively reduce the number of bit transitions by caching previously sent values and if a data entry is in the cache, only the cache index is sent across the bus.

Status

We have put together a layout for the bus traffic encouder and performed a netlist extraction with full parasitics. The layout is shown in figure 1. The netlist was then modified to allow us to measure energy consumed by individual components.

Layout picture

One of the first surprising findings was that the majority of energy was consumed by the control circuitry, i.e. enable and clock lines that run to all CAM cells. Therefore, optimizations to control circuits were our first priority. Specifically, we have made the following modification so far. The main CAM cell originally was designed such that it required 5 control lines (shift, enA, enA_bar, enB, enB_bar). The storage element was redesigned to require only 1 control signal (simple level sensitive latch built with 6 transistors). A schematic of the CAM cell follows.

CAM cell schematic

The graph below shows the amount of energy consumed by our circuit to process 50 cycles of gcc benchmark. The first column is the result prior to any modification and the second column is after modification. It is important to note that the layout was not "compressed" after unnecessary elements were removed. In other words, once we create a new layout each cell can be even smaller, requiring less energy than these results show. Energy Comparision

Currently, we are formulating the strategy to minimize the energy consumption even further. We plan to address the following questions and determine their effect on the overall performance.

Paper Summaries

The encoder we are working with caches a number of most recently transmitted words in its CAM array. We attempted to locate literature that presents implementation of low-power storage and word matching structures. The elements that we are particularly interested in are implementation of decoders, storage cells, reduction of capacitance driven on word and bit lines.