InstructorSteven Swanson
Office: EBU3B 3212 Office Hours: TBA UCSD homepage |
This course will cover advanced topics in processor microarchitecture. We will cover both the "latest and greatest" as well as the "oldies but goodies" in both commercial processors and architecture research. We will learn answers to questions like:
The basic format for the class will be: Read papers and discuss. There will also be a mid-sized project.
Note that 40% of you grade is determined by preparing for and participating in class.
Paper summaries | 20% | You will summarize each paper we read in class. Summaries are due 20 minutes before class begins. No exceptions. This means there is no reason to be late for class to complete your summary. |
Class participation | 20% | This class is discussion driven, so must come prepared to discuss the material |
Project | 30% | There will be a mid-sized project. |
In class presentations | 30% | In lieue of exams, each of you prepare and present two presentations on topics we will cover. |
We will read roughly two papers per class. Some days listed below have more than that, we'll thin them out depending on class interest.
YOU MUST BRING A PRINTED COPY OF EACH PAPER TO CLASS!
Items in the schedule more that one week in the future are subject to
change. Check back for updates for the assigned readings, etc.
Deadlines for homeworks/projecsts that
I will post the slides for most lectures. Since the slides contain material I am not allowed to distribute publically, they are password protected. I have posted the username and password to the web board.
Date | Topic | Readings | Slides | Due | Notes |
---|---|---|---|---|---|
Tuesday, January 6 | Administrivia and overview | slides , slides | |||
Thursday, January 8 | Historical perspectives | The history of the microcomputer-invention and evolution, , Proceedings of the IEEE 83(12):1601-1608, Dec 1995 link. Additional readings if you are interested: A 4096-bit dynamic MOS RAM, , Solid-State Circuits Conference. Digest of Technical Papers. 1972 IEEE International XV: 10-11, Feb 1972 link. A three transistor-cell, 1024-bit, 500 NS MOS RAM, , Solid-State Circuits Conference. Digest of Technical Papers. 1970 IEEE International XIII: 42-43, Feb 1970 link. Design of ion-implanted MOSFET's with very small physical dimensions, , Solid-State Circuits, IEEE Journal of 9(5): 256-268, Oct 1974 link. The future of wires, , Proceedings of the IEEE 89(4):490-504, Apr 2001 link. |
slides , slides , slides , slides | ||
Tuesday, January 13 | Historical perspectives | Parallel operation in the control data 6600, , :5-12, 1995 link. Additional readings if you are interested: Design of a Computer -- The Control Data 6600, , link. Considerations in Computer Design - Leading up to the Control Data 6600, , , 1963 link. IBM's 360 and early 370 systems, MIT Press, 1991. |
slides , slides , slides | ||
Thursday, January 15 | Historical perspectives | The CRAY-1 computer system, , Commun. ACM 21(1):63-72, 1978 link. Additional readings if you are interested: An analysis of the Cray-1 computer, , ISCA '78: Proceedings of the 5th annual symposium on Computer architecture, New York, NY, USA, 1978, pages 101-106 link. Tarantula: a vector extension to the alpha architecture, , Computer Architecture, 2002. Proceedings. 29th Annual International Symposium on:281-292, 2002 link. |
slides | ||
Tuesday, January 20 | Unconventional OOO exeuction | HPS, a new microarchitecture: rationale and introduction, , MICRO 18: Proceedings of the 18th annual workshop on Microprogramming, New York, NY, USA, 1985, pages 103-108 link. Additional readings if you are interested: Critical issues regarding HPS, a high performance microarchitecture, , SIGMICRO Newsl. 16(4):109-116, 1985 link. First version of a data flow procedure language, , Programming Symposium, Proceedings Colloque sur la Programmation, London, UK, 1974, pages 362-376. Monsoon: an explicit token-store architecture, , Computer Architecture, 1990. Proceedings., 17th Annual International Symposium on:82-91, May 1990 link. |
slides , slides | Project 1-1; | |
Thursday, January 22 | Unconventional OOO exeuction | . Tartan: evaluating spatial computation for whole program execution, , ASPLOS-XII: Proceedings of the 12th international conference on Architectural support for programming languages and operating systems, New York, NY, USA, 2006, pages 163-174 link. Additional readings if you are interested: Spatial computation, , ASPLOS-XI: Proceedings of the 11th international conference on Architectural support for programming languages and operating systems, New York, NY, USA, 2004, pages 14-26 link. NanoFabrics: spatial computing using molecular electronics, , ISCA '01: Proceedings of the 28th annual international symposium on Computer architecture, New York, NY, USA, 2001, pages 178-191 link. |
slides , slides | ||
Tuesday, January 27 | Unconventional OOO exeuction | Composable Lightweight Processors, , MICRO '07: Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture, Washington, DC, USA, 2007, pages 381-394 link. Additional readings if you are interested: Universal Mechanisms for Data-Parallel Architectures, , MICRO 36: Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture, Washington, DC, USA, 2003, page 303 link. A design space evaluation of grid processor architectures, , Microarchitecture, 2001. MICRO-34. Proceedings. 34th ACM/IEEE International Symposium on: 40-51, Dec. 2001 link. |
slides | ||
Thursday, January 29 | Reliability | Techniques to Reduce the Soft Error Rate of a High-Performance Microprocessor, , ISCA '04: Proceedings of the 31st annual international symposium on Computer architecture, Washington, DC, USA, 2004, page 264 link. Additional readings if you are interested: Concurrent error detection using watchdog processors-a survey, , Computers, IEEE Transactions on 37(2):160-174, Feb. 1988 link. The risk of data corruption in microprocessor-based systems, , Fault-Tolerant Computing, 1993. FTCS-23. Digest of Papers., The Twenty-Third International Symposium on:576-585, Jun 1993 link. AR-SMT: a microarchitectural approach to fault tolerance in microprocessors, , Fault-Tolerant Computing, 1999. Digest of Papers. Twenty-Ninth Annual International Symposium on:84-91, 1999 link. IBM's S/390 G5 microprocessor design, , Micro, IEEE 19(2):12-23, Mar/Apr 1999 link. |
slides | ||
Tuesday, February 3 | Reliability | DIVA: a reliable substrate for deep submicron microarchitecture design, , MICRO 32: Proceedings of the 32nd annual ACM/IEEE international symposium on Microarchitecture, Washington, DC, USA, 1999, pages 196-207 link. |
slides , slides | ||
Thursday, February 5 | Circuit-level microarchitectural issues | Razor: a low-power pipeline based on circuit-level timing speculation, , Microarchitecture, 2003. MICRO-36. Proceedings. 36th Annual IEEE/ACM International Symposium on: 7-18, Dec. 2003 link. |
slides | ||
Tuesday, February 10 | Circuit-level microarchitectural issues | Optimum Power/Performance Pipeline Depth, , MICRO 36: Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture, Washington, DC, USA, 2003, page 117 link. |
slides | ||
Thursday, February 12 | Multi-threading | A scalable approach to thread-level speculation, , ISCA '00: Proceedings of the 27th annual international symposium on Computer architecture, New York, NY, USA, 2000, pages 1-12 link. Additional readings if you are interested: Speculative Versioning Cache, , IEEE Trans. Parallel Distrib. Syst. 12(12):1305-1317, 2001 link. |
slides , slides | ||
Tuesday, February 17 | Specialized architectures | CryptoManiac: a fast flexible architecture for secure communication, , ISCA '01: Proceedings of the 28th annual international symposium on Computer architecture, New York, NY, USA, 2001, pages 110-119 link. Additional readings if you are interested: Evaluating the Imagine Stream Architecture, , ISCA '04: Proceedings of the 31st annual international symposium on Computer architecture, Washington, DC, USA, 2004, page 14. |
slides , slides | ||
Thursday, February 19 | Program analysis | Limits of control flow on parallelism, , SIGARCH Comput. Archit. News 20(2):46-57, 1992 link. Additional readings if you are interested: Phase tracking and prediction, , SIGARCH Comput. Archit. News 31(2):336-349, 2003 link. The intrinsic bandwidth requirements of ordinary programs, , ASPLOS-VII: Proceedings of the seventh international conference on Architectural support for programming languages and operating systems, New York, NY, USA, 1996, pages 105-114 link. Limits on multiple instruction issue, , SIGARCH Comput. Archit. News 17(2):290-302, 1989 link. Limits of instruction-level parallelism, , ASPLOS-IV: Proceedings of the fourth international conference on Architectural support for programming languages and operating systems, New York, NY, USA, 1991, pages 176-188 link. |
slides | ||
Tuesday, February 24 | Highly-dynamic execution | PipeRench implementation of the instruction path coprocessor, , MICRO 33: Proceedings of the 33rd annual ACM/IEEE international symposium on Microarchitecture, New York, NY, USA, 2000, pages 147-158 link. |
slides | ||
Thursday, February 26 | Power | Temperature-aware microarchitecture: Modeling and implementation, , ACM Trans. Archit. Code Optim. 1(1):94-125, 2004 link. |
slides | ||
Tuesday, March 2 | Case Studies | Core 2 article 2 Core 2 article 3 Itanium Processor Microarchitecture, , IEEE Micro 20(5):24-43, 2000 link. EPIC: Explicitly Parallel Instruction Computing, , Computer 33(2):37-45, Feb 2000 link. |
slides | ||
Thursday, March 4 | Case studies | The Alpha 21264 microprocessor architecture, , Computer Design: VLSI in Computers and Processors, 1998. ICCD '98. Proceedings. International Conference on:90-95, Oct 1998 link. |
slides | ||
Tuesday, March 9 | |||||
Thursday, March 11 | Potporri | Decoupled access/execute computer architectures, , ACM Trans. Comput. Syst. 2(4):289-308, 1984 link. |
slides , slides | ||
6:00pm, Thursday, March 19 | TBA | Project 1-2; |