CS 252 (Spring '01): Project Suggestions

CS252: Spring 2001 Project Suggestions <Under construction!>

This document is divided into several sections associated with projects.

The IRAM Project

The IRAM project has a few topics that need investigation. VIRAM is a 100M transistor plus microprocessor for portable multimedia applications the is being constructued at Berkeley. It should be sent to be manufactured this Spring.

Build a performance simulator for VIRAM-1

Suggested byChristoforos Kozyrakis (kozyraki@cs.berkeley.edu)

VIRAM has an instruction set simular to show proper execution of programs, but no performance information. Instead, it produces a trace that can be feed into another program that calculates the number of clock cycles for that program. The original performance simulator was written early in the process, before we had settled on many key parameters, and that student graduated a while ago. The suggestion is to start with a clean slate and build in most of the key parameters thereby considerably simplifying (and speeding up) the performance simulator. Thus this does not need to interpret instructions, just calculate clock cycles. We would still like to vary a few paramters, such as the number of elements executed each clock cycle ("number of lanes"). The VIRAM architecture is online, although you should talk to Kozyrakis before getting started. The old Performance Simulation Manual is also online and available if you are on the Berkeley campus.

Port Berkeley Multimedia Workload to VIRAM

Suggested by Kathy Yelick (yelick@cs.berkeley.edu)

The Berkeley multimedia workload was developed in order to facilitate studies on architectural support for multimedia. These 16 kernels were written orginally in C and then tuned by hand for all existing SIMD multimedia architectures. VIRAM is a vector computer designed for multimedia, and it comes with a vectorzing compiler. The first step, after reading the papers above, would be to evaluate try the compiler on these kernels. The next step would be to code these by hand to see the performance improvement over compiled code and to compare to the computers with SIMD extentions. Metrics could include clock cycles, time, code size, power, percent vectorization, and relative performance of compiled vs. hand tuned code. As C does not have language support to express some of the DSP primitives (e.g., saturating arithmetic), you might also suggest what changes would be needed at the language level to be able to express these kernels. Christoforos Kozyrakis (kozyraki@cs.berkeley.edu)has C and VIRAM hand-tuned versions of other media kernels as a place to start on the IRAM version. VIRAM architecture is online, although you should talk to Prof. Yelick and Kozyrakis before getting started.

The ISTORE Project

The ISTORE project is investigating the integration of processors (intelligence) into the storage systems of large-scale servers. An ISTORE system consists of a traditional front-end CPU or SMP, plus multiple so-called "Intelligent Disks" (IDISKs, disks with integrated processors) interconnected via a fast crossbar-switched network. It very much follows the suggestions for new research directions by Jim Gray and John Hennessy found in the reading list. ISTORE-1 is being constructed this semester, and we hope to be operational in April. Rather than wait for that to be finished, a 8 node cluster of Cobalt PCs is being aquired for use in this classs. The following are some ISTORE-related projects. There are several people who may be able to help with these projects; talk to Aaron Brown first(abrown@cs.berkeley.edu) if you're interested in one of the following projects.

Availability Benchmarks

Suggested by Aaron Brown (abrown@cs.berkeley.edu)

USENIX

Tools for Availability Benchmarks. The idea is to modify device drivers to be able to precisely insert common failures at the device driver level in software to simulate hardware failures. Aaron Brown can point you at common failures for disks and networks. These are similar to failure breakpoints, but one issue is what mechanisms make sense to specify when a failure should occur. There are probably two projects here: one for networks and one for disks. Linux drivers would be the most likely target.
Availability Benchmarks on other systems. The Cobalt PCs have a failover mechanism for software servers which consists of shipping data over another Ethernet link to a shadow PC, which can then take over for the master if the master fails. It is called StaQware Cluster. One application is Cobalts Intershop Commerce software which makes it easy to set up a product selling internet site. The idea would be to get it running and then force faults to happen and watching how the system behaves. This data would be plotted against a workload to see what happens, similar to the RAID failure paper.

Maintainability Benchmarks

Suggested by Aaron Brown (abrown@cs.berkeley.edu)

Databases. Commercial databases can generally be downloaded for free for a 30 day basis. Try downloading several, set them up in some standard TPC configuration such as TPC-C, and then see how easy/difficult it is to add a node to the cluster, or add a warehouse to the database. Does this vary by database product? Part of this project will be to think about metrics that you can use

Operating systems. For Linux and FreeBSD, how long does it take and how hard is it to switch the 'personality' of a node, say from one web server to another or from a web server to a database server? How hard is it to add nodes to scale up the capabilities of an application?

Tools for Network Instrospection

Suggested by Eric Anderson (eanders@cs.berkeley.edu)

Making NFS More Robust

Suggested by Aaron Brown (abrown@cs.berkeley.edu)

Visual Google (Voogle? Goggle?)

Survey the products that support "3 tier applications" How dynamic are they? How well would they run on ISTORE? One starting point might be Cobalts Intershop Commerce software.

Back to CS252 page

Maintained by Dave Patterson (pattrsn@cs.berkeley.edu ). Last modified 20 Janurary 2001.