SF – CS2013 Version

Systems Fundamentals (SF)

The underlying hardware and software infrastructure upon which applications are constructed is collectively described by the term “computer systems.” Computer systems broadly span the sub-disciplines of operating systems, parallel and distributed systems, communications networks, and computer architecture. Traditionally, these areas are taught in a non-integrated way through independent courses. However these sub-disciplines increasingly share important common fundamental concepts within their respective cores. These concepts include computational paradigms, parallelism, cross-layer communications, state and state transition, resource allocation and scheduling, and so on. This knowledge area is designed to present an integrative view of these fundamental concepts in a unified albeit simplified fashion, providing a common foundation for the different specialized mechanisms and policies appropriate to the particular domain area.

SF. Systems Fundamentals. [18 Core-Tier1 hours, 9 Core-Tier2 hours]

	Core-Tier1 hours	Core-Tier2 hours	Includes Electives
SF/Computational Paradigms	3		N
SF/Cross-Layer Communications	3		N
SF/State and State Machines	6		N
SF/Parallelism	3		N
SF/Evaluation	3		N
SF/Resource Allocation and Scheduling		2	N
SF/Proximity		3	N
SF/Virtualization and Isolation		2	N
SF/Reliability through Redundancy		2	N
SF/Quantitative Evaluation			Y

SF/Computational Paradigms

[3 Core-Tier1 hours]

The view presented here is the multiple representations of a system across layers, from hardware building blocks to application components, and the parallelism available in each representation. [Cross-reference PD/Parallelism Fundamentals].

Topics:

Basic building blocks and components of a computer (gates, flip-flops, registers, interconnections; Datapath + Control + Memory)
Hardware as a computational paradigm: Fundamental logic building blocks; Logic expressions, minimization, sum of product forms
Application-level sequential processing: single thread
Simple application-level parallel processing: request level (web services/client-server/distributed), single thread per server, multiple threads with multiple servers
Basic concept of pipelining, overlapped processing stages
Basic concept of scaling: going faster vs. handling larger problems

Learning Outcomes:

List commonly encountered patterns of how computations are organized. [Familiarity]
Describe the basic building blocks of computers and their role in the historical development of computer architecture. [Familiarity]
Articulate the differences between single thread vs. multiple thread, single server vs. multiple server models, motivated by real world examples (e.g., cooking recipes, lines for multiple teller machines and couples shopping for food). [Familiarity]
Articulate the concept of strong vs. weak scaling, i.e., how performance is affected by scale of problem vs. scale of resources to solve the problem. This can be motivated by the simple, real-world examples. [Familiarity]
Design a simple logic circuit using the fundamental building blocks of logic design. [Usage]
Use tools for capture, synthesis, and simulation to evaluate a logic design. [Usage]
Write a simple sequential problem and a simple parallel version of the same program. [Usage]
Evaluate performance of simple sequential and parallel versions of a program with different problem sizes, and be able to describe the speed-ups achieved. [Assessment]

SF/Cross-Layer Communications

[Cross-reference NC/Introduction, OS/Operating Systems Principles]

[3 Core-Tier1 hours]

Topics:

Programming abstractions, interfaces, use of libraries
Distinction between Application and OS services, Remote Procedure Call
Application-Virtual Machine Interaction
Reliability

Learning Outcomes:

Describe how computing systems are constructed of layers upon layers, based on separation of concerns, with well-defined interfaces, hiding details of low layers from the higher layers. [Familiarity]
Describe that hardware, VM, OS, application are additional layers of interpretation/processing. [Familiarity]
Describe the mechanisms of how errors are detected, signaled back, and handled through the layers. [Familiarity]
Construct a simple program using methods of layering, error detection and recovery, and reflection of error status across layers. [Usage]
Find bugs in a layered program by using tools for program tracing, single stepping, and debugging. [Usage]

SF/State and State Machines

[6 Core-Tier1 hours]

[Cross-reference AL/Basic Computability and Complexity, OS/State and State diagrams, NC/Protocols]

Topics:

Digital vs. Analog/Discrete vs. Continuous Systems
Simple logic gates, logical expressions, Boolean logic simplification
Clocks, State, Sequencing
Combinational Logic, Sequential Logic, Registers, Memories
Computers and Network Protocols as examples of State Machines

Learning Outcomes:

Describe computations as a system characyterized by a known set of configurations with transitions from one unique configuration (state) to another (state). [Familiarity]
Describe the distinction between systems whose output is only a function of their input (Combinational) and those with memory/history (Sequential). [Familiarity]
Describe a computer as a state machine that interprets machine instructions. [Familiarity]
Explain how a program or network protocol can also be expressed as a state machine, and that alternative representations for the same computation can exist. [Familiarity]
Develop state machine descriptions for simple problem statement solutions (e.g., traffic light sequencing, pattern recognizers). [Usage]
Derive time-series behavior of a state machine from its state machine representation. [Assessment]

SF/Parallelism

[3 Core-Tier1 hours]

[Cross-reference: PD/Parallelism Fundamentals]

Topics:

Sequential vs. parallel processing
Parallel programming vs. concurrent programming
Request parallelism Task parallelism
Client-Server/Web Services, Thread (Fork-Join), Pipelining
Multicore architectures and hardware support for synchronization

Learning Outcomes:

For a given program, distinguish between its sequential and parallel execution, and the performance implications thereof. [Familiarity]
Demonstrate on an execution time line that parallelism events and operations can take place simultaneously (i.e., at the same time). Explain how work can be performed in less elapsed time if this can be exploited. [Familiarity]
Explain other uses of parallelism, such as for reliability/redundancy of execution. [Familiarity]
Define the differences between the concepts of Instruction Parallelism, Data Parallelism, Thread Parallelism/Multitasking, Task/Request Parallelism. [Familiarity]
Write more than one parallel program (e.g., one simple parallel program in more than one parallel programming paradigm; a simple parallel program that manages shared resources through synchronization primitives; a simple parallel program that performs simultaneous operation on partitioned data through task parallel (e.g., parallel search terms; a simple parallel program that performs step-by-step pipeline processing through message passing). [Usage]
Use performance tools to measure speed-up achieved by parallel programs in terms of both problem size and number of resources. [Assessment]

SF/Evaluation

[3 Core-Tier1 hours]

[Cross-reference PD/Parallel Performance]

Topics:

Performance figures of merit
Workloads and representative benchmarks, and methods of collecting and analyzing performance figures of merit
CPI (Cycles per Instruction) equation as tool for understanding tradeoffs in the design of instruction sets, processor pipelines, and memory system organizations.
Amdahl’s Law: the part of the computation that cannot be sped up limits the effect of the parts that can

Learning Outcomes:

Explain how the components of system architecture contribute to improving its performance. [Familiarity]
Describe Amdahl’s law and discuss its limitations. [Familiarity]
Design and conduct a performance-oriented experiment. [Usage]
Use software tools to profile and measure program performance. [Assessment]

SF/Resource Allocation and Scheduling

[2 Core-Tier2 hours]

Topics:

Kinds of resources (e.g., processor share, memory, disk, net bandwidth)
Kinds of scheduling (e.g., first-come, priority)
Advantages of fair scheduling, preemptive scheduling

Learning Outcomes:

Define how finite computer resources (e.g., processor share, memory, storage and network bandwidth) are managed by their careful allocation to existing entities. [Familiarity]
Describe the scheduling algorithms by which resources are allocated to competing entities, and the figures of merit by which these algorithms are evaluated, such as fairness. [Familiarity]
Implement simple schedule algorithms. [Usage]
Use figures of merit of alternative scheduler implementations. [Assessment]

SF/Proximity

[3 Core-Tier2 hours]

[Cross-reference: AR/Memory Management, OS/Virtual Memory]

Topics:

Speed of light and computers (one foot per nanosecond vs. one GHz clocks)
Latencies in computer systems: memory vs. disk latencies vs. across the network memory
Caches and the effects of spatial and temporal locality on performance in processors and systems
Caches and cache coherency in databases, operating systems, distributed systems, and computer architecture
Introduction into the processor memory hierarchy and the formula for average memory access time

Learning Outcomes:

Explain the importance of locality in determining performance. [Familiarity]
Describe why things that are close in space take less time to access. [Familiarity]
Calculate average memory access time and describe the tradeoffs in memory hierarchy performance in terms of capacity, miss/hit rate, and access time. [Assessment]

SF/Virtualization and Isolation

[2 Core-Tier2 hours]

Topics:

Rationale for protection and predictable performance
Levels of indirection, illustrated by virtual memory for managing physical memory resources
Methods for implementing virtual memory and virtual machines

Learning Outcomes:

Explain why it is important to isolate and protect the execution of individual programs and environments that share common underlying resources. [Familiarity]
Describe how the concept of indirection can create the illusion of a dedicated machine and its resources even when physically shared among multiple programs and environments. [Familiarity]
Measure the performance of two application instances running on separate virtual machines, and determine the effect of performance isolation. [Assessment]

SF/Reliability through Redundancy

[2 Core-Tier2 hours]

Topics:

Distinction between bugs and faults
Redundancy through check and retry
Redundancy through redundant encoding (error correcting codes, CRC, FEC)
Duplication/mirroring/replicas
Other approaches to fault tolerance and availability

Learning Outcomes:

Explain the distinction between program errors, system errors, and hardware faults (e.g., bad memory) and exceptions (e.g., attempt to divide by zero). [Familiarity]
Articulate the distinction between detecting, handling, and recovering from faults, and the methods for their implementation. [Familiarity]
Describe the role of error correcting codes in providing error checking and correction techniques in memories, storage, and networks. [Familiarity]
Apply simple algorithms for exploiting redundant information for the purposes of data correction. [Usage]
Compare different error detection and correction methods for their data overhead, implementation complexity, and relative execution time for encoding, detecting, and correcting errors. [Assessment]

SF/Quantitative Evaluation

[Elective]

Topics:

Analytical tools to guide quantitative evaluation
Order of magnitude analysis (Big O notation)
Analysis of slow and fast paths of a system
Events on their effect on performance (e.g., instruction stalls, cache misses, page faults)
Understanding layered systems, workloads, and platforms, their implications for performance, and the challenges they represent for evaluation
Microbenchmarking pitfalls

Learning Outcomes:

Explain the circumstances in which a given figure of system performance metric is useful. [Familiarity]
Explain the inadequacies of benchmarks as a measure of system performance. [Familiarity]
Use limit studies or simple calculations to produce order-of-magnitude estimates for a given performance metric in a given context. [Usage]
Conduct a performance experiment on a layered system to determine the effect of a system parameter on figure of system performance. [Assessment]