Final Sample Questions
The final will be cumulative, though it will be weighted more towards the second half of the class. So why not check out:
This bank of questions is taken from prior midterms and finals. The course changes from year to year, so some of the questions may refer to concepts we did not emphasize this year, and some concepts we did emphasize this year may not be represented here.
The final will 3 hours long. It will be open-note, open-book, open-computer, semiopen-network, using rules very similar to those in the midterm.
DATAREP-24. Integer representation
Write the value of the variable or expression in each problem, using signed decimal representation.
For example, if we gave you:
int i = 0xA;
int j = 0xFFFFFFFF;
you would write A) 10 B) -1
QUESTION DATAREP-24A. int i = 0xFFFF;
(You may write this either
in decimal or as an expression using a power of 2)
QUESTION DATAREP-24B. short s = 0xFFFF;
(You may write this
either in decimal or as an expression using a power of 2)
QUESTION DATAREP-24C. unsigned u = 1 \<\< 10;
QUESTION DATAREP-24D. From WeensyOS: unsigned long l = PTE_P \| PTE_U;
QUESTION DATAREP-24E. int j = ~0;
QUESTION DATAREP-24F. From WeensyOS: sizeof(x86_64_pagetable);
QUESTION DATAREP-24G. Given this structure:
struct s {
char c;
short s;
long l;
};
struct s* ps;
This expression: sizeof(ps);
QUESTION DATAREP-24H. Using the structure above: sizeof(\*ps);
QUESTION DATAREP-24I. unsigned char u = 0xABC;
QUESTION DATAREP-24J. signed char c = 0xABC;
DATAREP-25. Data representation
In gdb, you observe the following values for a set of memory locations.
0x100001020: 0xa0 0xb1 0xc2 0xd3 0xe4 0xf5 0x06 0x17
0x100001028: 0x28 0x39 0x4a 0x5b 0x6c 0x7d 0x8e 0x9f
0x100001030: 0x89 0x7a 0x6b 0x5c 0x4d 0x3e 0x2f 0x10
0x100001038: 0x01 0xf2 0xe3 0xd4 0xc5 0xb6 0xa7 0x96
For each C expression below, write its value in hexadecimal. For example, if we gave you:
char *cp = (char*) 0x100001020; cp[0] =
the answer would be 0xa0
.
Assume the following structure and union declarations and variable definitions.
struct _s1 {
int i;
long l;
short s;
};
struct _s2 {
char c[4];
int i;
struct _s1 s;
};
union _u {
char c[8];
int i;
long l;
short s;
};
char* cp = (char*) 0x100001020;
struct _s1* s1 = (struct _s1*) 0x100001020;
struct _s2* s2 = (struct _s2*) 0x100001020;
union _u* u = (union _u*) 0x100001020;
QUESTION DATAREP-25A. cp[4] =
QUESTION DATAREP-25B. cp + 7 =
QUESTION DATAREP-25C. s1 + 1 =
QUESTION DATAREP-25D. s1->i =
QUESTION DATAREP-25E. sizeof(s1) =
QUESTION DATAREP-25F. &s2->s =
QUESTION DATAREP-25G. &u->s =
QUESTION DATAREP-25H. s1->l =
QUESTION DATAREP-25I. s2->s.s =
QUESTION DATAREP-25J. u->l =
ASM-13. Assembly language
Consider the following four assembly functions.
|
|
|
|
(Note: The %sil
register is the lowest-order byte of register
%rsi
, just as %al
is the lowest-order byte of %rax
and
%r8b
is the lowest-order byte of %r8
.)
QUESTION ASM-13A. Which two of the assembly functions perform the exact same task?
QUESTION ASM-13B. What is that task? You can describe it briefly, or give the name of the corresponding C library function.
QUESTION ASM-13C. Explain how the other two functions differ from each other.
IO-11. Caching
QUESTION IO-11A. If it takes 200ns to access main memory, which of the following two caches will produce a lower average access time?
- A cache with a 10ns access time that produces a 90% hit rate
- A cache with a 20ns access time that produces a 98% hit rate
QUESTION IO-11B. Let’s say that you have a direct-mapped cache with
four slots. A page with page number N
must reside in the slot numbered
N % 4
. What is the best hit rate this could achieve given the
following sequence of page accesses?
3 6 7 5 3 2 1 1 1 8
QUESTION IO-11C. What is the best hit rate a fully-associative four-slot cache could achieve for that sequence of page accesses? (A fully-associative cache may put any page in any slot. You may assume you know the full reference stream in advance.)
QUESTION IO-11D. What hit rate would the fully-associative four-slot cache achieve if it used the LRU eviction policy?
KERN-1. Virtual memory
QUESTION KERN-1A. What is the x86-64 page size? Circle all that apply.
- 4096 bytes
- 64 cache lines
- 256 words
0x1000
bytes- 216 bits
- None of the above
The following questions concern the sizes of page tables. Answer the questions in units of pages. For instance, the page tables in WeensyOS each contained one level-1 page table page, one level-2 page table page, one level-3 page table page, and two level-4 page table pages, for a total size of 5 pages per page table.
QUESTION KERN-1B. What is the maximum size (in pages) of an x86-64 page table (page tables only, not destination pages)? You may write an expression rather than a number.
QUESTION KERN-1C. What is the minimum size (in pages) of an x86-64 page table that would allow a process to access 221 distinct physical addresses?
The 64-bit x86-64 architecture is an extension of the 32-bit x86 architecture, which used 32-bit virtual addresses and 32-bit physical addresses. Extensions to this architecture increased both these limits.
- Physical Address Extensions (PAE) allow 32-bit machines to access up to 252 bytes of physical memory (which is about 4000000 GB). That is, virtual addresses are 32 bits, and physical addresses are 52 bits.
- The x86-64 architecture evolves the x86 architecture to a 64-bit word size. x86-64 pointers are 64 bits wide instead of 32. However, only 48 of those bits are meaningful: the upper 16 bits of each virtual address are ignored. Thus, virtual addresses are 48 bits. As with PAE, physical addresses are 52 bits.
QUESTION KERN-1D. Which of these two machines would support a higher number of concurrent processes?
- x86-32 with PAE with 100 GB of physical memory.
- x86-64 with 20 GB of physical memory.
QUESTION KERN-1E. Which of these two machines would support a higher maximum number of threads per process?
- x86-32 with PAE with 100 GB of physical memory.
- x86-64 with 20 GB of physical memory.
KERN-2. Virtual memory and kernel programming
These problems consider implementations of virtual memory features in a
WeensyOS-like operating system. Recall the signatures and specifications
of the virtual_memory_lookup
and virtual_memory_map
functions:
// virtual_memory_map(pagetable, va, pa, sz, perm, allocator)
` // Map virtual address range `[va, va+sz)` in `pagetable`. `
` // When `X >= 0 && X < sz`, the new pagetable will map virtual address `
` // `va+X` to physical address `pa+X` with permissions `perm`. `
//
` // Precondition: `va`, `pa`, and `sz` must be multiples of PAGESIZE `
// (4096).
//
` // Typically `perm` is a combination of `PTE_P` (the memory is Present), `
` // `PTE_W` (the memory is Writable), and `PTE_U` (the memory may be `
` // accessed by User applications). If `!(perm & PTE_P)`, `pa` is ignored. `
//
// Sometimes mapping memory will require allocating new page tables. The
` // `allocator` function should return a newly allocated page, or NULL `
// on allocation failure.
//
// Returns 0 if the map succeeds, -1 if it fails because a required
// page table could not be allocated.
int virtual_memory_map(x86_64_pagetable* pagetable, uintptr_t va,
uintptr_t pa, size_t sz, int perm,
x86_64_pagetable* (*allocator)(void));
// virtual_memory_lookup(pagetable, va)
` // Returns information about the mapping of the virtual address `va` in `
` // `pagetable`. The information is returned as a `vamapping` object, `
// which has the following components:
typedef struct vamapping {
int pn; // physical page number; -1 if unmapped
uintptr_t pa; // physical address; (uintptr_t) -1 if unmapped
int perm; // permissions; 0 if unmapped
} vamapping;
vamapping virtual_memory_lookup(x86_64_pagetable* pagetable, uintptr_t va);
Also recall that WeensyOS tracks physical memory using an array of
pageinfo
structures:
typedef struct physical_pageinfo {
int8_t owner;
int8_t refcount; // 0 means the page is free
} physical_pageinfo;
static physical_pageinfo pageinfo[PAGENUMBER(MEMSIZE_PHYSICAL)];
The WeensyOS kernel occupies virtual addresses 0 through 0xFFFFF; the
process address space starts at PROC_START_ADDR
== 0x100000 and goes
up to (but not including) MEMSIZE_VIRTUAL
== 0x300000.
QUESTION KERN-2A. True or false: On x86-64 Linux, like on WeensyOS, the kernel occupies low virtual addresses.
QUESTION KERN-2B. On WeensyOS, which region of a process’s address space is closest to the kernel’s address space? Choose from code, data, stack, and heap.
QUESTION KERN-2C. On Linux on an x86-64 machine, which region of a process’s address space is closest to the kernel’s address space? Choose from code, data, stack, and heap.
Recall that the WeensyOS sys_page_alloc(addr)
system call allocates a
new physical page at the given virtual address. Here’s an example kernel
implementation of sys_page_alloc
, taken from the WeensyOS interrupt
function:
case INT_SYS_PAGE_ALLOC: {
uintptr_t addr = current->p_registers.reg_rdi;
//
[A]
int free_pn = find_free_physical_page();
if (free_pn < 0) { // no free physical pages
console_printf(CPOS(24, 0), 0x0C00, "Out of physical memory!\n");
current->p_registers.reg_rax = -1; // return result in %rax
break; // will call run(current)
}
//
[B]
// otherwise, allocate the page
assert(pageinfo[free_pn].refcount == 0);
pageinfo[free_pn].refcount += 1;
pageinfo[free_pn].owner = current->p_pid;
//
[C]
// and map it into the user’s address space
virtual_memory_map(current->p_pagetable, addr, PAGEADDRESS(free_pn), PAGESIZE, PTE_P | PTE_U | PTE_W, NULL);
current->p_registers.reg_rax = 0;
//
[D]
break;
}
QUESTION KERN-2D. Thanks to insufficient checking, this implementation allows a WeensyOS process to crash the operating system or even take it over. This kernel is not isolated. What the kernel should do is return −1 when the calling process supplies bad arguments. Write code that, if executed at slot [A], would preserve kernel isolation and handle bad arguments correctly.
QUESTION KERN-2E. This implementation has another problem, which the following process would trigger:
void process_main(void) {
heap_top = ROUNDUP((uint8_t*) end, PAGESIZE); // first address in heap region
while (1) {
sys_page_alloc(heap_top);
sys_yield();
}
}
This process code repeatedly allocates a page at the same address. What should happen is that the kernel should repeatedly deallocate the old page and replace it with a newly-allocated page. But that’s not what will happen given the example implementation.
What will happen instead? And what is the name of this kind of problem?
QUESTION KERN-2F. Write code that would fix the problem, and name
the slot in the INT_SYS_PAGE_ALLOC
implementation where your code
should go.
KERN-3. Kernel programming
WeensyOS processes are quite isolated: the only way they can communicate is by using the console. Let’s design some system calls that will allow processes to explicitly share pages of memory. Then the processes can communicate by writing and reading the shared memory region. Here are two new WeensyOS system calls that allow minimal page sharing; they return 0 on success and –1 on error.
int share(pid_t p, void* addr)
Allow process p
to access the page at address addr
.
int attach(pid_t p, void* remote_addr, void* local_addr)
Access the page in process p
’s address space at address remote_addr
.
That physical page is added to the calling process’s address space at
address local_addr
, replacing any page that was previously mapped
there. It is an error if p
has not shared the page at remote_addr
with the calling process.
Here’s an initial implementation of these system calls, written as
clauses in the WeensyOS kernel’s exception
function.
case INT_SYS_SHARE: {
pid_t p = current->p_registers.reg_rdi;
uintptr_t addr = current->p_registers.reg_rsi;
//
[A]
int shindex = current->p_nshared;
if (shindex >= MAX_NSHARED) {
goto return_error;
}
//
[B]
++current->p_nshared;
current->p_shared[shindex].sh_addr = addr;
current->p_shared[shindex].sh_partner = p;
current->p_registers.reg_rax = 0;
break;
}
case INT_SYS_ATTACH: {
pid_t p = current->p_registers.reg_rdi;
uintptr_t remote_addr = current->p_registers.reg_rsi;
uintptr_t local_addr = current->p_registers.reg_rdx;
//
[C]
int shindex = -1;
for (int i = 0; i < processes[p].p_nshared; ++i) {
if (processes[p].p_shared[i].sh_addr == remote_addr
&& processes[p].p_shared[i].sh_partner == current->p_pid) {
shindex = i;
}
}
if (shindex == -1) {
goto return_error;
}
//
[D]
vamapping vam = virtual_memory_lookup(processes[p].p_pagetable, remote_addr);
//
[E]
virtual_memory_map(current->p_pagetable, local_addr,
vam.pa, PAGESIZE, PTE_P|PTE_W|PTE_U);
//
[F]
current->p_registers.reg_rax = 0;
break;
}
return_error:
current->p_registers.reg_rax = -1;
break;
Some notes:
- The implementation stores sharing records in an array. A process may
call
share
successfully at mostMAX_NSHARED
times. After that, its futureshare
calls will return an error. processes[p].p_nshared
is initialized to 0 for all processes.- Assume that WeensyOS has been implemented as in Problem Set 4 up through step 6 (shared read-only memory).
QUESTION KERN-3A. True or false: Given this implementation, a single
WeensyOS process can cause the kernel to crash simply by calling share
one or more times (with no process ever calling attach
). If true, give
an example of a call or calls that would likely crash the kernel.
QUESTION KERN-3B. True or false: Given this implementation, a single
WeensyOS process can cause the kernel to crash simply by calling
attach
one or more times (with no process ever calling share
). If
true, give an example of a call or calls that would likely crash the
kernel.
QUESTION KERN-3C. True or false: Given this implementation, WeensyOS
processes 2 and 3 could work together to obtain write access to the
kernel code located at address KERNEL_START_ADDR
. If true, give an
example of calls that would obtain this access.
QUESTION KERN-3D. True or false: Given this implementation, WeensyOS processes 2 and 3 could work together to obtain write access to any memory, without crashing or modifying kernel code or data. If true, give an example of calls that would obtain access to a page mapped at address 0x110000 in process 5.
QUESTION KERN-3E. True or false: Given this implementation, WeensyOS
child processes 2 and 3 could work together to modify the code run by a
their shared parent, process 1, without crashing or modifying kernel
code or data. If true, give an example of calls that would obtain write
access to process 1’s code, which is mapped at address
PROC_START_ADDR
.
QUESTION KERN-3F. Every “true” answer to the preceding questions is a bug in WeensyOS’s process isolation. Fix these bugs. Write code snippets that address these problems, and say where they go in the WeensyOS code (for instance, you could refer to bracketed letters to place your snippets); or for partial credit describe what your code should do.
KERN-4. Teensy OS VM System
The folks at Teensy Computers, Inc, need your help with their VM system. The hardware team that developed the VM system abruptly left and the folks remaining aren't quite sure how VM works. I volunteered you to help them.
The Teensy machine has a 16-bit virtual address space with 4 KB pages. The Teensy hardware specifies a single-level page table. Each entry in the page table is 16-bits. Eight of those bits are reserved for the physical page number and 8 of the bits are reserved for flag values. Sadly, the hardware designers did not document what the bits do!
QUESTION KERN-4A. How many pages are in the Teensy virtual address space?
QUESTION KERN-4B. How many bits comprise a physical address?
QUESTION KERN-4C. Is the physical address space larger or smaller than the virtual address space?
QUESTION KERN-4D. Write, in hex, a PAGE_OFFSET_MASK
(the value
that when anded with an address returns the offset of the address on a
page).
QUESTION KERN-4E. Write a C expression that takes a virtual address,
in the variable vaddr
, and returns the virtual page number.
You are now going to work with the Teensy engineers to figure out what those other bits in the page table entries mean! Fortunately, they have some engineering notes from the hardware team—they need your help in making sense of them. Each letter below has the contents of a note, state what you can conclude from that note about the lower 8 bits of the page table entries.
QUESTION KERN-4F. “Robin, I ran 8 tests using a kernel that did nothing other than loop infinitely -- for each test I set a different bit in all the PTEs of the page table. All of them ended up in the exception handler except for the one where I set bit 4. Any idea what this means?”
QUESTION KERN-4G. “Lynn, I'm writing a memory test that iterates over all of memory making sure that I can read back the same pattern I write into memory. If I don't set bit 7 of the page table entries to 1, I get permission faults. Do you know what might be happening?”
QUESTION KERN-4H. “Pat, I almost have user level processes running! It seems that the user processes take permission faults unless I have both bit 4 and bit 3 set. Do you know why?”
KERN-5. Teensy OS Page Tables
The Teensy engineers are well on their way now, but they do have a few bugs and they need your help debugging the VM system. They hand you the following page table, using the notation we used for Assignment 6 for permissions, and need your help specifying correct behavior for the operations that follow.
Index |
Physical |
Permissions |
---|---|---|
0 |
0x00 |
PTE_U |
1 |
0x01 |
PTE_P |
2 |
0x02 |
PTE_P PTE_W |
3 |
0x03 |
PTE_P PTE_U PTE_W |
4 |
0xFF |
PTE_U PTE_W |
5 |
0xFE |
PTE_U |
6 |
0x80 |
PTE_W |
7 |
0x92 |
PTE_P PTE_W PTE_U |
8 |
0xAB |
PTE_P PTE_W PTE_U |
9 |
0x09 |
PTE_P PTE_U |
10 |
0xFE |
PTE_P PTE_U |
11 |
0x00 |
PTE_W |
12 |
0x11 |
PTE_U |
Rest of PTEs follow and are all invalid |
For each problem below, write either the physical address of the given virtual address or identify what fault would be produced. The fault types should be one of:
- Invalid page access (there is no mapping for the requested page)
- Privilege violation (user level process trying to access a supervisor page)
- Permission violation (attempt to write a read-only page)
QUESTION KERN-5A. The kernel dereferences a NULL pointer
QUESTION KERN-5B. A user process dereferences a NULL pointer
QUESTION KERN-5C. The kernel writes to the address 0x8432
QUESTION KERN-5D. A user process writes to the address 0xB123
QUESTION KERN-5E. The kernel reads from the address 0x9876
QUESTION KERN-5F. A user process reads from the address 0x7654
QUESTION KERN-5G. A user process writes to the address 0xABCD
QUESTION KERN-5H. A user process writes to the address 0x2321
KERN-6. Virtual Memory
You may recall that Professor Seltzer loves inventing strange and wonderful virtual memory systems—she’s at it again! The Tom and Ginny (TAG) processor has 16-bit virtual addresses and 256-byte pages. Virtual memory translation is provided via two-level page tables as shown in the figure below.
QUESTION KERN-6A. How many entries are in an L1 page table?
QUESTION KERN-6B. How many entries are in an L2 page table?
QUESTION KERN-6C. If each page table entry occupies 2 bytes of memory, how large (in bytes) is a single page table?
QUESTION KERN-6D. What is the maximum number of L1 page tables that a process can have?
QUESTION KERN-6E. What is the maximum number of L2 page tables that a process can have?
The Figure below shows how the PTEs are organized.
QUESTION KERN-6F. Given the number of bits allocated to the physical page number in the PTE, how much physical memory can the TAG processor support?
Finally, you’ll actually perform virtual address translation in software. We will define a TAG page table entry as follows:
typedef unsigned short tag_pageentry;
QUESTION KERN-6G. Write a function
unsigned virtual_to_physical(tag_pageentry* pagetable, unsigned vaddr)
that takes as arguments:
pagetable
: a TAG page table (that is, a pointer to the first entry in the L1 page table)vaddr
: a TAG virtual address
and returns a physical address if a valid mapping exists and an invalid physical address if no valid mapping exists. Comment your code to explain each step that you want the function to take.
KERN-7. Cost expressions
In the following questions, you will reason about the abstract costs of various operations, using the following tables of constants.
Table of Basic Costs
S | System call overhead (i.e., entering and exiting the kernel) |
F | Page fault cost (i.e., entering and exiting the kernel) |
P | Cost of allocating a new physical page |
M | Cost of installing a new page mapping |
B | Cost of copying a byte |
Table of Sizes
nk | Number of memory pages allocated to the kernel | |
Per-process sizes (defined for each process p) | ||
np | Number of memory pages allocated to p | |
rp | Number of read-only memory pages allocated to p | |
wp | = np − rp | Number of writable memory pages allocated to p |
mp | Number of memory pages actually modified by p after the previous fork() |
One of our tiny operating systems from class (OS02) included a program that called a recursive function. When the recursive function’s argument grew large enough, the stack pointer moved beyond the memory actually allocated for the stack, and the program crashed.
QUESTION KERN-7A. In our first solution for this problem, the
process called the sys_page_alloc(void *addr)
system call, which
allocated and mapped a single new page at address addr
(the new stack
page). Write an expression for the cost of this sys_page_alloc()
system call in terms of the constants above.
QUESTION KERN-7B. Our second solution for this problem changed the operating system’s page fault handler. When a fault occurred in a process’s stack region, the operating system allocated a new page to cover the corresponding address and restarted the process. Write an expression for the cost of such a fault in terms of the constants above.
QUESTION KERN-7C. Design a revised version of sys_page_alloc
that
supports batching. Give its signature and describe its behavior.
QUESTION KERN-7D. Write an expression for the cost of a call to your batching allocation API.
In the remaining questions, a process p calls fork()
, which creates
a child process, c.
Assume that the base cost of performing a fork()
system call is Φ.
This cost includes the fork()
system call overhead (S), the overhead
of allocating a new process, the overhead of allocating a new page
directory with kernel mappings, and the overhead of copying registers.
But it does not include overhead from allocating, copying, or mapping
other memory.
QUESTION KERN-7E. Consider the following implementations of
fork()
:
A. | Naive fork: Copy all process memory (WeensyOS, Step 5). |
B. | Eager fork: Copy all writable process memory; share read-only process memory, such as code (WeensyOS, Step 6). |
C. | Copy-on-write fork: initially share all memory as read-only. Create writable copies later, on demand, in response to write faults (WeensyOS extra credit). |
Which expression best represents the total cost of the fork()
system
call in process p, for each of these fork implementations? Only
consider the system call itself, not later copy-on-write faults.
(Note: Per-process variables, such as n, are defined for each process. So, for example, np is the number of pages allocated to the parent process p, and nc is the number of pages allocated to the child process c.)
- Φ
- Φ + np × M
- Φ + (np + wp) × M
- Φ + np × 212 × (B + F)
- Φ + np × (212B + P + M)
- Φ + np × (P + M)
- Φ + wp × (212B + P + M)
- Φ + np × (212B + P + M) − rp × (212B + P)
- Φ + np × M + mc × (P + F)
- Φ + np × M + mc × (212B + F + P)
- Φ + np × M + (mp+mc) × (P + F)
- Φ + np × M + (mp+mc) × (212B + F + P)
QUESTION KERN-7F. When would copy-on-write fork be more efficient than eager fork (meaning that the sum of all fork-related overheads, including faults for pages that were copied on write, would be less for copy-on-write fork than eager fork)? Circle the best answer.
- When np < nk.
- When wp × F < wp × (M + P).
- When mc × (F + M + P) < wp × (M + P).
- When (mp+mc) × (F + M + P + 212B) < wp × (P + 212B).
- When (mp+mc) × (F + P + 212B) < wp × (P + M + 212B).
- When mp < mc.
- None of the above.
SH-1. Processes
This question builds versions of the existing system calls based on new abstractions. Here are three system calls that define a new abstraction called a rendezvous.
int newrendezvous(void) Returns a rendezvous ID that hasn’t been used yet.
int rendezvous(int rid, int data) Blocks the calling process P1 until some other process P2 calls rendezvous() with the same rid (rendezvous ID). Then, both of the system calls return, but P1’s system call returns P2’s data and vice versa. Thus, the two processes swap their data. Rendezvous acts pairwise; if three processes call rendezvous, then two of them will swap values and the third will block, waiting for a fourth.
void freezerendezvous(int rid, int freezedata) Freezes the rendezvous rid. All future calls to rendezvous(rid, data) will immediately return freezedata.
Here's an example. The two columns represent two processes. Assume they are the only processes using rendezvous ID 0.
int result = rendezvous(0, 5); |
printf("About to rendezvous\n"); |
int result = rendezvous(0, 600); |
|
/* The processes swap data; | both become runnable */ |
printf("Process A got %d\n", result); |
printf("Process B got %d\n", result); |
This code will print
About to rendezvous
Process B got 5
Process A got 600
(the last 2 lines might appear in either order).
QUESTION SH-1A. How might you implement pipes in terms of rendezvous? Try to figure out analogues for the pipe(), close(), read(), and write() system calls (perhaps with different signatures), but only worry about reading and writing 1 character at a time.
QUESTION SH-1B. Can a rendezvous-pipe support all pipe features?
SH-2. Process management
Here’s the skeleton of a shell function implementing a simple
two-command pipeline, such as “cmd1 | cmd2
”.
void simple_pipe(const char* cmd1, char* const* argv1, const char* cmd2, char* const* argv2) {
int pipefd[2], r, status;
[A]
pid_t child1 = fork();
if (child1 == 0) {
[B]
execvp(cmd1, argv1);
}
assert(child1 > 0);
[C]
pid_t child2 = fork();
if (child2 == 0) {
[D]
execvp(cmd2, argv2);
}
assert(child2 > 0);
[E]
}
And here is a grab bag of system calls.
[1] close(pipefd[0]);
[2] close(pipefd[1]);
[3] dup2(pipefd[0], STDIN_FILENO);
[4] dup2(pipefd[0], STDOUT_FILENO);
[5] dup2(pipefd[1], STDIN_FILENO);
[6] dup2(pipefd[1], STDOUT_FILENO);
[7] pipe(pipefd);
[8] r = waitpid(child1, &status, 0);
[9] r = waitpid(child2, &status, 0);
Your task is to assign system call IDs, such as “1
”, to slots, such as
“A
”, to achieve several behaviors, including a correct pipeline and
several incorrect pipelines. For each question:
- You may use each system call ID once, more than once, or not at all.
- You may use zero or more system call IDs per slot. Write them in the order they should appear in the code.
- You may assume that no signals are delivered to the shell process (so
no system call ever returns an
EINTR
error). - The function should wait for both commands in the pipeline to complete before returning.
- It may help to detach the last “Reference material” page of the exam.
QUESTION SH-2A. Implement a correct foreground pipeline.
A |
B (child1) |
C |
D (child2) |
E |
---|---|---|---|---|
QUESTION SH-2B. Implement a pipeline so that, given arguments
corresponding to “echo foo | wc -c
”, the wc
process reads “foo
”
from its standard input but does not exit thereafter. For partial
credit describe in words how this might happen.
A |
B (child1) |
C |
D (child2) |
E |
---|---|---|---|---|
QUESTION SH-2C. Implement a pipeline so that, given arguments
corresponding to “echo foo | wc -c
”, “foo
” is printed to the
shell’s standard output and the wc
process prints “0
”. (In a
correctly implemented pipeline, “wc
” would print 4
, which is the
number of characters in “foo\n
”.) For partial credit describe in
words how this might happen.
A |
B (child1) |
C |
D (child2) |
E |
---|---|---|---|---|
QUESTION SH-2D. Implement a pipeline that appears to work correctly
on “echo foo | wc -c
”, but always blocks forever if the left-hand
command outputs more than 65536 characters. For partial credit
describe in words how this might happen.
A |
B (child1) |
C |
D (child2) |
E |
---|---|---|---|---|
QUESTION SH-2E. Implement a pipeline so that, given arguments
corresponding to “echo foo | wc -c
”, both echo
and wc
report a
“Bad file descriptor” error. (This error, which corresponds to EBADF
,
is returned when a file descriptor is not valid or does not support the
requested operation.) For partial credit describe in words how this
might happen.
A |
B (child1) |
C |
D (child2) |
E |
---|---|---|---|---|
SH-3. Processes
Consider the two programs shown below.
// Program 1
#include <stdio.h>
#include <unistd.h>
int
main(void)
{
printf("PID %d running prog1\n", getpid());
}
// Program 2
#include <stdio.h>
#include <unistd.h>
int
main(void)
{
char *argv[2];
argv[0] = "prog1";
argv[1] = NULL;
printf("PID %d running prog2\n", getpid());
int r = execv("./prog1", argv);
printf("PID %d exiting from prog2\n", getpid());
}
QUESTION SH-3A. How many different PIDs will print out if you run Program 2?
QUESTION SH-3B. How many lines of output will you see?
Now, let's assume that we change Program 2 to the following:
// Program 2B
#include <stdio.h>
#include <unistd.h>
int
main(void)
{
char* argv[2];
argv[0] = "prog1";
argv[1] = NULL;
printf("PID %d running prog2\n", getpid());
pid_t p = fork();
if (p == 0) {
int r = execv("./prog1", argv);
} else {
printf("PID %d exiting from prog2\n", getpid());
}
}
QUESTION SH-3C. How many different PIDs will print out if you run Program 2B?
QUESTION SH-3D. How many lines of output will you see?
Finally, consider this version of Program 2.
// Program 2C
#include <stdio.h>
#include <unistd.h>
int
main(void)
{
char *argv[2];
argv[0] = "prog1";
argv[1] = NULL;
printf("PID %d running prog2\n", getpid());
pid_t p = fork();
pid_t q = fork();
if (p == 0 || q == 0) {
int r = execv("./prog1", argv);
} else {
printf("PID %d exiting from prog2\n", getpid());
}
}
QUESTION SH-3E. How many different PIDs will print out if you run Program 2C?
QUESTION SH-3F. How many lines of output will you see?
SH-4. Be a CS61 TF!
You are a CS61 teaching fellow. A student working on A4 is having difficulty getting pipes working. S/he comes to you for assistance. The function below is intended to traverse a linked list of commands, fork/exec the indicated processes, and hook up the pipes between commands correctly. The student has commented it reasonably, but is quite confused about how to finish writing the code. Can you help? Figure out what code to add at points A, B, and C.
#include "sh61.h"
typedef struct command command;
struct command {
command *next; // Next in sequence of commands
int argc; // number of arguments
int ispipe; // pipe symbol follows this command
char** argv; // arguments, terminated by NULL
pid_t pid; // pid running this command
};
void
do_pipes(command *c)
{
pid_t newpid;
int havepipe = 0; // We had a pipe on the previous command
int lastpipe[2] = {-1, -1};
int curpipe[2];
do {
if (c->ispipe) {
int r = pipe(curpipe);
assert(r == 0);
}
newpid = fork();
assert(newpid >= 0);
if (newpid == 0) {
if (havepipe) {
// There was a pipe on the last command; It's stored
// in lastpipe; I need to hook it up to this process???
// **** PART A ****
}
if (c->ispipe) {
// The current command is a pipe -- how do I hook it up???
// **** PART B ****
}
execvp(c->argv[0], c->argv);
fprintf(stderr, "Exec failed\n");
_exit(1);
}
// I bet there is some cleanup I have to do here!?
// **** PART C ****
// Set up for the next command
havepipe = c->ispipe;
if (c->ispipe) {
lastpipe[0] = curpipe[0];
lastpipe[1] = curpipe[1];
}
c->pid = newpid;
c = c->next;
} while (newpid != -1 && havepipe);
}
QUESTION SH-4A. What should go in the Part A space above, if anything?
QUESTION SH-4B. What should go in the Part B space above, if anything?
QUESTION SH-4C. What should go in the Part C space above, if anything?
SH-5. Spork
Patty Posix has an idea for a new system call, spork
. Her system
call combines fork
, file descriptor manipulations, and execvp
.
It’s pretty cool:
typedef struct {
int type; // equals SPORK_OPEN, SPORK_CLOSE, or SPORK_DUP2
int fd;
int old_fd; // SPORK_DUP2 only
const char* filename; // SPORK_OPEN only
int flags; // SPORK_OPEN only
mode_t mode; // SPORK_OPEN only
} spork_file_action_t;
pid_t `**`spork`**`(const char* file, const spork_file_action_t* file_actions, int n_file_actions, char* argv[]);
Here’s how spork
works.
- First,
spork
forks a child process. - The child process loops over the
file_actions
array (there aren_file_actions
elements) and performs each file action in turn. A file actionfa
means different things depending on its type. Specifically:fa-\>type == SPORK_OPEN
: The child process opens the file namedfa-\>filename
with flagsfa-\>flags
and optional modefa-\>mode
, as if byopen(fa-\>filename, fa-\>flags, fa-\>mode)
. The opened file descriptor is given numberfa-\>fd
. (Note that this requires multiple steps, since the file must be first opened and then moved tofa-\>fd
.)fa-\>type == SPORK_CLOSE
: The child process closes file descriptorfa-\>fd
.fa-\>type == SPORK_DUP2
: The child process makesfa-\>fd
a duplicate offa-\>old_fd
.
- Finally, the child process executes the given
file
with argument listargv
. - If all these steps succeed, then
spork
returns the child process ID. If any of the steps fails, then eitherspork
returns –1 and creates no child, or the child process exits with status 127. In particular, if a file action fails, then the child process exits with status 127 before callingexecvp
.
This function uses spork
to print the number of words in a file to
standard output.
void print_word_count(const char* file) {
spork_file_action_t file_actions[1];
file_actions[0].type = SPORK_OPEN;
file_actions[0].fd = STDIN_FILENO;
file_actions[0].filename = file;
file_actions[0].flags = O_RDONLY;
const char* argv[2] = {"wc", NULL};
pid_t p = spork("wc", file_actions, 1, argv);
assert(p >= 0);
waitpid(p, NULL, 0);
}
QUESTION SH-5A. Use spork
to implement the following function.
` // Create a pipeline like `argv1 | argv2`. `
// The pipeline consists of two child processes, one running the command with argument
` // list `argv1` and one running the command with argument list `argv2`. The standard `
` // output of `argv1` is piped to the standard input of `argv2`. `
` // Return the PID of the `argv2` process or -1 on failure. `
pid_t `**`make_pipeline`**`(char* argv1[], char* argv2[]);
QUESTION SH-5B. Now, implement spork
in terms of system calls
you already know. For full credit, make sure you catch all errors. Be
careful of SPORK_OPEN
.
QUESTION SH-5C. Can fork
be implemented in terms of spork
?
Why or why not?
QUESTION SH-5D. At least one of the file action types is
redundant, meaning a spork
caller could simulate its behavior
using the other action types and possibly some additional system calls.
Say which action types are redundant, and briefly describe how they
could be simulated.
SH-6. File descriptor facts
Here are twelve file descriptor-oriented system calls.
accept |
bind |
close |
connect |
dup2 |
listen |
open |
pipe |
read |
select |
socket |
write |
QUESTION SH-6A. Which of these system calls may cause the number of open file descriptors to increase? List all that apply.
QUESTION SH-6B. Which of these system calls may close a file descriptor? List all that apply. Note that some system calls might close a file descriptor even though the total number of open file descriptors remains the same.
QUESTION SH-6C. Which of these system calls can block? List all that apply.
QUESTION SH-6D. Which system calls can open at least one file descriptor where that file descriptor is suitable for both reading and writing? List all that apply.
QUESTION SH-6E. Which system calls must a network server make in order to receive a connection on a well-known port? List all that apply in order, first to last. Avoid unnecessary calls.
QUESTION SH-6F. Which system calls must a network client make in order to (1) connect to a server, (2) send a message, (3) receive a reply, and (4) close the connection? List all that apply in order, first to last. Avoid unnecessary calls.
NET-1. Networking
QUESTION NET-1A. Which of the following system calls should a programmer expect to sometimes block (i.e., to return after significant delay)? Circle all that apply.
1. socket |
5. connect |
|
2. read |
6. write |
|
3. accept |
7. usleep |
|
4. listen |
8. None of these |
QUESTION NET-1B. Below are seven message sequence diagrams demonstrating the operation of a client–server RPC protocol. A request such as “get(X)” means “fetch the value of the object named X”; the response contains that value. Match each network property or programming strategy below with the diagram with which it best corresponds. You will use every diagram once.
1. Loss | 4. Duplication | 7. Exponential backoff | ||
2. Delay | 5. Batching | |||
3. Reordering | 6. Prefetching |
A |
B |
C |
D |
E |
F |
G |
NET-2. Making Network Servers Robust
QUESTION NET-2A. You've built a network server, list the resources that you might run out of if someone launched a DoS attack on you.
QUESTION NET-2B. Sam suggests that you just create a separate thread to handle each incoming connection. Why isn't this necessarily going to work?
QUESTION NET-2C. A server sets up a socket to listen on a connection. When a client wants to establish a connection, how does the server manage the multiple clients? In your answer indicate what system call or calls are used and what they do.
QUESTION NET-2D. Which of the following system calls might block?
- accept
- bind
- connect
- listen
- setsockopt
- select
- socket
SYNCH-1. Threads
The following code performs a matrix multiplication, c = ab
, where
a
, b
, and c
are all square matrices of dimension sz
. It uses the
cache-friendly ikj index ordering.
#define MELT(matrix, sz, row, col) (matrix)[(row)*(sz) + (col)]
void matrix_multiply(double* c, const double* a, const double* b, size_t sz) {
for (size_t i = 0; i < sz; ++i)
for (size_t j = 0; j < sz; ++j)
MELT(c, sz, i, j) = 0;
for (size_t i = 0; i < sz; ++i)
for (size_t k = 0; k < sz; ++k)
for (size_t j = 0; j < sz; ++j)
MELT(c, sz, i, j) += MELT(a, sz, i, k) * MELT(b, sz, k, j);
}
But matrix multiplication is a naturally parallelizable problem. Here’s
some code that uses threads to multiply even faster on a multicore
machine. We use sz
parallel threads, one per row of c
.
typedef struct matrix_args {
double* c;
const double* a;
const double* b;
size_t sz;
size_t i;
} matrix_args;
void* matrix_multiply_ikj_thread(void* arg) {
(α) matrix_args* m = (matrix_args*) arg;
(β) for (size_t j = 0; j < m->sz; ++j)
(γ) MELT(m->c, m->sz, m->i, j) = 0;
(δ) for (size_t k = 0; k < m->sz; ++k)
(ε) for (size_t j = 0; j < m->sz; ++j)
(ζ) MELT(m->c, m->sz, m->i, j) += MELT(m->a, m->sz, m->i, k) * MELT(m->b, m->sz, k, j);
(η) return NULL;
}
void matrix_multiply_ikj(double* c, const double* a, const double* b, size_t sz) {
(1) pthread_t* threads = (pthread_t*) malloc(sizeof(pthread_t) * sz);
(2) for (size_t i = 0; i < sz; ++i) {
(3) matrix_args m = { c, a, b, sz, i };
(4) int r = pthread_create(&threads[i], NULL, &matrix_multiply_ikj_thread, &m);
(5) assert(r == 0);
(6) }
(7) for (size_t i = 0; i < sz; ++i)
(8) pthread_join(threads[i], NULL);
(9) free(threads);
}
But when run, this code gives wildly incorrect results.
QUESTION SYNCH-1A. What is wrong? Describe why the problem is a synchronization issue.
QUESTION SYNCH-1B. Write C code showing how the problem could be
fixed with changes only to matrix_multiply_ikj
. Refer to the numbered
lines to indicate replacements and/or insertions. Use one or more
additional heap allocations and no additional calls to pthread
functions. Free any memory you allocate once it is safe to do so.
On single-core machines, the kij order performs almost as fast as the ikj order. Here’s a version of the parallel matrix multiplication code that uses kij.
typedef struct matrix_args_kij {
double* c;
const double* a;
const double* b;
size_t sz;
size_t k;
} matrix_args_kij;
void* matrix_multiply_kij_thread(void* arg) {
(α) matrix_args_kij* m = (matrix_args_kij*) arg;
(β) for (size_t i = 0; i < m->sz; ++i)
(γ) for (size_t j = 0; j < m->sz; ++j)
(δ) MELT(m->c, m->sz, i, j) += MELT(m->a, m->sz, i, m->k) * MELT(m->b, m->sz, m->k, j);
(ε) return NULL;
}
void matrix_multiply_kij(double* c, const double* a, const double* b, size_t sz) {
(1) pthread_t* threads = (pthread_t*) malloc(sizeof(pthread_t) * sz);
(2) for (size_t i = 0; i < sz; ++i)
(3) for (size_t j = 0; j < sz; ++j)
(4) MELT(c, sz, i, j) = 0;
(5) for (size_t k = 0; k < sz; ++k) {
(6) matrix_args_kij m = { c, a, b, sz, k };
(7) int r = pthread_create(&threads[k], NULL, &matrix_multiply_kij_thread, &m);
(8) assert(r == 0);
(9) }
(10) for (size_t k = 0; k < sz; ++k)
(11) pthread_join(threads[k], NULL);
(12) free(threads);
}
This problem has the same problem as the previous version, plus another problem. Even after your fix from 8A–8B is applied, this version produces incorrect results.
QUESTION SYNCH-1C. What is the new problem? Describe why it is a synchronization issue.
QUESTION SYNCH-1D. Write pseudocode or C code that fixes this problem. You should refer to pthread functions. For full credit your solution should have low contention.
SYNCH-2. Synchronization and concurrency
Most synchronization objects have at least two operations. Mutual-exclusion locks support lock and unlock; condition variables support wait and signal; and from section notes you may remember the semaphore synchronization object, one of the earliest synchronization objects ever invented, which supports P and V.
In this problem, you’ll work with a synchronization object with only one operation, which we call a hemiphore. Hemiphores behave like the following; it is very important that you understand this pseudocode.
typedef struct hemiphore {
int value;
} hemiphore;
// Initialize the hemiphore to value 0.
void hemiphore_init(hemiphore* h) {
h->value = 0;
}
` // Block until the hemiphore has value >= `bound`, then ``**`atomically`**`` increment its value by `delta`. `
void H(hemiphore* h, int bound, int delta) {
// This is pseudocode; a real hemiphore implementation would block, not spin, and would
// ensure that the test and the increment happen in one atomic step.
while (h->value < bound) {
sched_yield();
}
h->value += delta;
}
Once a hemiphore is initialized with hemiphore_init
, application code
should access the hemiphore only through the H
operation.
QUESTION SYNCH-2A. Use hemiphores to implement mutual-exclusion
locks. Fill out the code below. (You may not need to fill in every empty
slot. You may use standard C constants; for example, INT_MIN
is the
smallest possible value for a variable of type int
, which on an x86-64
machine is −2147483648.)
typedef struct mutex { // Initialize the mutex to the unlocked state.
hemiphore h; void mutex_init(mutex* m) {
hemiphore_init(&m->h);
} mutex;
}
// Lock the mutex. // Unlock the mutex.
void mutex_lock(mutex* m) { void mutex_unlock(mutex* m) {
} }
QUESTION SYNCH-2B. Use hemiphores to implement condition variables.
Fill out the code below. You may assume that the implementation of
mutex
is your hemiphore-based implementation from above (so, for
instance, cond_wait
may access the hemiphore m->h
). See the Hints at
the end of the question.
typedef struct condvar { // Initialize the condition variable.
mutex m; void cond_init(condvar* c) {
hemiphore h; mutex_init(&c->m);
hemiphore_init(&c->h);
} condvar; }
// Signal the condition variable.
void cond_signal(condvar* c) {
}
` // Block until the condition variable is signaled. The mutex `m` must be locked by the current `
// thread. It is unlocked before the wait begins and re-locked after the wait ends.
` // There are no sleep-wakeup race conditions: if thread 1 has `m` locked and executes `
` // `cond_wait(c, m)`, no other thread is waiting on `c`, and thread 2 executes `
` // `mutex_lock(m); cond_signal(c); mutex_unlock(m)`, then thread 1 will always receive the `
// signal (i.e., wake up).
void cond_wait(condvar* c, mutex* m) {
}
Hints. For full credit:
- If no thread is waiting on condition variable
c
, thencond_signal(c)
should do nothing. - Assume N threads are waiting on condition variable
c
. Then N calls tocond_signal(c)
are both necessary and sufficient to wake them all up. - Your solution must not add new sleep-wakeup race conditions to the user’s code. (That is, no sleep-wakeup race conditions unless the user uses mutexes incorrectly.)
QUESTION SYNCH-2C. Use pthread mutexes and condition variables to implement hemiphores. Fill out the code below. See the hints after the question.
typedef struct hemiphore {
pthread_mutex_t m;
int value;
pthread_cond_t c;
} hemiphore;
void hemiphore_init(hemiphore* h) {
pthread_mutex_init(&h->m);
h->value = 0;
pthread_cond_init(&h->c);
}
void H(hemiphore* h, int bound, int delta) {
}
Hints. The pthread mutex and condition variable operations have the
following signatures. You should pass NULL
for any attributes
arguments. Don’t worry about the pthread_mutex_destroy
and
pthread_cond_destroy
operations, and feel free to abbreviate (e.g.
“lock
” instead of “pthread_mutex_lock
”).
pthread_mutex_init(pthread_mutex_t* m, const pthread_mutexattr_t* attributes)
pthread_mutex_lock(pthread_mutex_t* m)
pthread_mutex_unlock(pthread_mutex_t* m)
pthread_cond_init(pthread_cond_t* c, const pthread_condattr_t* attributes)
pthread_cond_signal(pthread_cond_t* c)
(wakes up at most one thread waiting onc
)pthread_cond_broadcast(pthread_cond_t* c)
(wakes up all threads waiting onc
)pthread_cond_wait(pthread_cond_t* c, pthread_mutex_t* m)
QUESTION SYNCH-2D. Consider the following two threads, which use a
shared hemiphore h
with initial value 0.
Thread 1
Thread 2
H(&h, 1000, 1); while (1) {
printf("Thread 1 done\n"); H(&h, 0, 1);
H(&h, 0, -1);
}
Thread 2 will never block, and the hemiphore’s value will alternate
between 1 and 0. Thread 1 will never reach the printf
, because the
hemiphore’s value never reaches 1000. However, in most people’s first
implementation of hemiphores using pthread mutexes and condition
variables, Thread 1 will not block. Every call to H
in Thread 2 will
effectively wake up Thread 1. Though Thread 1 will then check the
hemiphore’s value and immediately go back to sleep, doing so wastes CPU
time.
Design an implementation of hemiphores using pthread mutexes and condition variables that solves this problem. In your revised implementation, Thread 1 above should block forever. For full credit, write C code. For partial credit, write pseudocode or English describing your design.
Hint. One working implementation constructs a linked list of “waiter” objects, where each waiter object is on a different thread’s stack, as initially sketched below. You can use such objects or not as you please.
typedef struct hemiphore_waiter { typedef struct hemiphore {
struct hemiphore_waiter* next; pthread_mutex_t m;
int value;
hemiphore_waiter* waiters;
} hemiphore_waiter; } hemiphore;
void hemiphore_init(hemiphore* h) {
pthread_mutex_init(&h->m);
h->value = 0;
h->waiters = NULL;
}
void H(hemiphore* h, int bound, int delta) {
hemiphore_waiter w;
}
SYNCH-3. Pipes and synchronization
In the following questions, you will implement a mutex using a pipe, and a limited type of pipe using a mutex.
The definitions of the pthread mutex and condition variable operations are as follows.
int pthread_mutex_init(pthread_mutex_t* m, const pthread_mutexattr_t* attr)
Create a new mutex with attributes defined by attr
. (For this
question, attr
is ignored.)
int pthread_mutex_lock(pthread_mutex_t* m)
Locks m
. If the mutex is already locked, the calling thread will block
until the mutex becomes available.
int pthread_mutex_unlock(pthread_mutex_t* m)
Unlocks m
. Calling pthread_mutex_unlock
with a mutex that the
calling thread does not hold will result in undefined behavior.
int pthread_cond_init(pthread_cond_t* c, const pthread_condattr_t* attr)
Create a new condition variable with attributes defined by attr
. (For
this question, attr
is ignored.)
int pthread_cond_signal(pthread_cond_t* c)
Unblocks one thread waiting for c
.
int pthread_cond_wait(pthread_cond_t* c, pthread_mutex_t* m)
Atomically unlocks m
and blocks the calling thread on the condition
c
. When the condition is signaled, the thread locks m
and returns.
Calling pthread_cond_wait
with an unlocked mutex will result in
undefined behavior.
The operations return 0 on success. Although errors are possible (for
instance, ENOMEM
if there’s not enough memory to allocate a new mutex)
you may assume that they don’t occur.
QUESTION SYNCH-3A. In this question, you are to implement mutex
functionality using a pipe. Fill in the definitions of
pipe_mutex_init
, pipe_mutex_lock
, and pipe_mutex_unlock
. You
should be able to implement the same functionality as the pthread
versions (assuming no other code accesses the pipe).
typedef struct pipe_mutex {
int fd[2];
} pipe_mutex;
int pipe_mutex_init(pipe_mutex* m) {
if (pipe(&m->fd) < 0) {
return -1;
}
return 0;
}
int pipe_mutex_lock(pipe_mutex* m) {
}
int pipe_mutex_unlock(pipe_mutex* m) {
}
In the next questions, you will help implement pipe functionality using an in-memory buffer and a mutex. This “mutex pipe” will only work between threads of the same process (in contrast to a regular pipe, which also works between processes). An initial implementation of mutex pipes is as follows; you will note that it contains no mutexes.
typedef struct mutex_pipe {
1.
char buf[BUFSIZ];
2.
size_t head;
3.
size_t sz;
} mutex_pipe;
int mutex_pipe_init(mutex_pipe* p) {
6.
p->head = p->sz = 0;
7.
memset(&p->buf[0], 0, sizeof(p->buf));
8.
return 0;
}
` // Read up to `sz` bytes from the mutex_pipe into `buf` and return the number of bytes `
// read. If no bytes are available, wait until at least one byte can be read.
ssize_t mutex_pipe_read(mutex_pipe* p, char* buf, size_t sz) {
10.
size_t n = 0;
11.
while (n < sz && (p->sz != 0 || n == 0)) {
12.
size_t ncopy = p->sz;
13.
if (ncopy > sizeof(p->buf) - p->head)
14.
ncopy = sizeof(p->buf) - p->head;
15.
if (ncopy > sz - n)
16.
ncopy = sz - n;
17.
memcpy(&buf[n], &p->buf[p->head], ncopy);
18.
n += ncopy;
19.
p->head += ncopy;
20.
p->head = p->head % sizeof(p->buf);
21.
p->sz -= ncopy;
22.
}
23.
return n;
}
` // Write up to `sz` bytes from `buf` into the mutex_pipe and return the number of bytes `
// written. If no space is available, wait until at least one byte can be written.
ssize_t mutex_pipe_write(mutex_pipe* p, const char* buf, size_t sz) {
30.
size_t n = 0;
31.
while (n < sz && (p->sz != sizeof(p->buf) || n == 0)) {
32.
size_t tail = p->head + p->sz;
33.
tail = tail % sizeof(p->buf);
34.
size_t ncopy = sizeof(p->buf) - p->sz;
35.
if (ncopy > sizeof(p->buf) - tail)
36.
ncopy = sizeof(p->buf) - tail;
37.
if (ncopy > sz - n)
38.
ncopy = sz - n;
39.
memcpy(&p->buf[tail], &buf[n], ncopy);
40.
n += ncopy;
41.
p->sz += ncopy;
42.
}
43.
return n;
}
The last page of this exam has a copy of that code that you can remove and keep.
NOT A QUESTION.
It would be wise to work through an example. For example, assume
BUFSIZ == 4
, and figure out how the following calls would behave.
mutex_pipe_write(p, "Hi", 2);
mutex_pipe_read(p, buf, 4);
mutex_pipe_write(p, "Test", 4);
mutex_pipe_read(p, buf, 3);
First let’s reason about this code in the absence of threads.
QUESTION SYNCH-3B. Which of the following changes could, if made in isolation, result in undefined behavior when a mutex pipe was used? Circle all that apply.
- Eliminating line 6
- Eliminating line 7
- Eliminating lines 13–14
- Eliminating lines 15–16
- Eliminating line 18
- Eliminating line 19
QUESTION SYNCH-3C. Which of the following changes could, if made in
isolation, cause a mutex_pipe_read
to return incorrect data (that is,
the byte sequence produced by read
will not equal the byte sequence
passed to write
)? Circle all that apply.
- Eliminating line 33
- Eliminating lines 35–36
- Eliminating lines 37–38
- Eliminating line 39
- Eliminating line 40
- Eliminating line 41
QUESTION SYNCH-3D. Which of the following changes could, if made in
isolation, cause a call to mutex_pipe_write
to never return (when a
correct implementation would return)? Circle all that apply.
- Eliminating line 33
- Eliminating lines 35–36
- Eliminating lines 37–38
- Eliminating line 39
- Eliminating line 40
- Eliminating line 41
QUESTION SYNCH-3E. Write an invariant for p->sz
. An invariant is a
statement about the value of p->sz
that is always true. Write your
invariant in the form of an assertion; for full credit give the most
specific true invariant you can. (“p->sz
is an integer” is unspecific,
but true; “p->sz == 4
” is specific, but false.)
QUESTION SYNCH-3F. Write an invariant for p->head
. For full credit
give the most specific true invariant you can.
In the remaining questions, you will add synchronization objects and operations to make your mutex pipe work in a multithreaded program. Here is your starting point:
typedef struct mutex_pipe {
1.
char buf[BUFSIZ];
2.
size_t head;
3.
size_t sz;
4.
pthread_mutex_t m;
} mutex_pipe;
int mutex_pipe_init(mutex_pipe* p) {
5.
pthread_mutex_init(&p->m, NULL);
6.
p->head = p->sz = 0;
7.
memset(&p->buf[0], 0, sizeof(p->buf));
8.
return 0;
}
(the rest of the code as in the prior questions)
QUESTION SYNCH-3G. Add calls to “lock
” (pthread_mutex_lock
) and
“unlock
” (pthread_mutex_unlock
) that protect the mutex pipe from
race condition bugs. Write one or more snippets of C code and give line
numbers after which the snippets should appear. For full credit, your
solution must not deadlock—if one thread is reading from a pipe and
another thread is writing to the pipe, then both threads must eventually
make progress.
- Add
pthread_mutex_lock(&p->m);
after lines: 10, 30 - Add
pthread_mutex_unlock(&p->m);
after lines: 22, 42 - Other changes (if any):
QUESTION SYNCH-3H. Your solution to the last question has poor
utilization. For instance, a thread that calls mutex_pipe_read
on an
empty mutex pipe will spin forever, rather than block. Introduce one or
more condition variables so that mutex_pipe_read
will block until data
is available. Write one or more snippets of C code and give line numbers
after which the snippets should appear.
- Add to
struct mutex_pipe
:pthread_cond_t c;
- Add to
mutex_pipe_init
after line 7:pthread_cond_init(&c, NULL);
- Other changes:
SYNCH-4. Race conditions
Most operating systems support process priority levels, where the kernel runs higher-priority processes more frequently than lower-priority processes. A hypothetical Unix-like operating system called “Boonix” has two priority levels, normal and batch. A Boonix parent process changes the priority level of one of its children with this system call:
int setbatch(pid_t p)
Sets process p
to have batch priority. All future children of p
will
also have batch priority. Returns 0 on success, –1 on error. Errors
include ESRCH
, if p
is not a child of the calling process.
Note that a process cannot change its own batch status.
You’re writing a Boonix shell that can run commands with batch priority.
If c->isbatch
is nonzero, then c
should run with batch priority, as
should its children. Your start_command
function looks like this:
pid_t start_command(command* c) {
1.
c->pid = fork();
2.
if (c->pid == 0) {
3.
handle_pipes(c);
4.
handle_redirections(c);
5.
(void) execvp(c->argv[0], c->argv);
6.
// if we get here, execvp failed
7.
perror("execvp");
8.
exit(1);
9.
}
10.
assert(c->pid > 0);
11.
if (c->isbatch)
12.
setbatch(c->pid);
13.
return c->pid;
}
This shell has two race conditions, one more serious.
QUESTION SYNCH-4A. In some cases, c
will change to batch priority
after it starts running. Draw a dependency diagram demonstrating this
race condition, or briefly describe it.
QUESTION SYNCH-4B. In some cases, c
or one of its children could
run forever with normal priority. Draw a dependency diagram
demonstrating this race condition, or briefly describe it.
In the remaining questions, you will fix these race conditions in three different ways. The first uses a new system call:
int isbatch()
Returns 1 if the calling process has batch priority, 0 if it has normal
priority.
QUESTION SYNCH-4C. Use isbatch
to prevent both race conditions.
Write a snippet of C code and give the line number after which it should
appear. You should need one code snippet.
QUESTION SYNCH-4D. Use the pipe
system call and friends to prevent
both race conditions. Write snippets of C code and give the line numbers
after which they should appear. You should need several snippets. Make
sure you clean up any extraneous file descriptors before running the
command or returning from start_command
.
QUESTION SYNCH-4E. Why should the pipe
solution be preferred to
the isbatch
solution? A sentence, or the right single word, will
suffice.
QUESTION SYNCH-4F. Suggest a change to the setbatch
system call’s
behavior that could fix both race conditions, and say how to use this
new setbatch
in start_command
. Write one or more snippets of C code
and give the line numbers after which they should appear.
SYNCH-5. Minimal minimal minimal synchronization synchronization synchronization
Minimalist composer Philip Glass, who prefers everything minimal,
proposes the following implementation of condition variables based on
mutexes. He’s only implementing wait
and signal
at first.
typedef struct {
pthread_mutex_t cv_mutex;
} philip_cond_t;
int philip_cond_init(philip_cond_t* cond) {
pthread_mutex_init(&cond->cv_mutex);
pthread_mutex_lock(&cond->cv_mutex); // start in LOCKED state
}
int philip_cond_wait(philip_cond_t* cond, pthread_mutex_t* mutex) {
pthread_mutex_unlock(mutex);
pthread_mutex_lock(&cond->cv_mutex); // block until another thread calls philip_cond_signal
pthread_mutex_lock(mutex);
}
int philip_cond_signal(philip_cond_t* cond) {
pthread_mutex_unlock(&cond->cv_mutex);
}
Philip wants to use his condition variables to build a bank. Banco Glasso accounts support these operations:
void deposit(gacct* a, unsigned amt)
Adds amt
to a-\>balance
.
void withdraw(gacct* a, unsigned amt)
Blocks until a-\>balance \>= amt
; then deducts amt
from
a-\>balance
and returns.
Here’s Philip’s code.
typedef struct {
unsigned long balance;
pthread_mutex_t mutex;
philip_cond_t cv;
} gacct;
void deposit(gacct* a, unsigned amt) { void withdraw(gacct* a, unsigned amt) {
D1
pthread_mutex_lock(&a->mutex);
W1
pthread_mutex_lock(&a->mutex);
D2
a->balance += amt;
W2
while (a->balance < amt)
D3
philip_cond_signal(&a->cv);
W3
philip_cond_wait(&a->cv, &a->mutex);
D4
pthread_mutex_unlock(&a->mutex);
W4
a->balance -= amt;
} `**`W5`**` pthread_mutex_unlock(&a->mutex);
}
Philip’s friend Pauline Oliveros just shakes her head. “You got serious
problems,” she says, pointing at this section of the Mac man page for
pthread_mutex_lock
:
Calling
pthread_mutex_unlock()
with a mutex that the calling thread does not hold will result in undefined behavior.
QUESTION SYNCH-5A. Briefly explain how Philip’s code can trigger that undefined behavior.
Philip switches to Linux’s “fast” mutexes, which do not have this undefined behavior. An unlocked “fast” mutex can be unlocked again without error, and it is OK to unlock a “fast” mutex on a different thread than the thread that locked it. A “fast” mutex can have two values, 0 (unlocked) and 1 (locked).
Below, we've begun to write out an execution where Philip’s code is
called by two threads. We write the line numbers each thread executes
and the values in a
after each line. We’ve left lines blank for you
to fill in; you do not need to turn in the full table.
|
|
|
||
---|---|---|---|---|
Initial values: |
5 |
0 |
1 |
|
T1: |
T2: |
|||
W1 |
5 |
1 |
1 |
|
D1 (blocks) |
5 |
1 |
1 |
|
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
________ |
Complete that execution and then answer the following questions.
QUESTION SYNCH-5B. In the above execution, what are the final values
for a-\>balance
, a-\>mutex
, and a-\>cv.cv_mutex
?
QUESTION SYNCH-5C. In the above execution, which line of code (W1–5) unblocks Thread T1?
QUESTION SYNCH-5D. In the above execution, which, if any, line(s) of
code (D1–4 and/or W1–5) set a-\>cv.cv_mutex
to zero?
For the remaining two questions, consider all possible concurrent
executions of threads running withdraw
and/or deposit
.
QUESTION SYNCH-5E. Philip’s code always gives a correct balance. Why? List all that apply.
- Access to
a-\>balance
is protected by a condition variable. - Access to
a-\>balance
is protected by a mutex. - Arithmetic instructions like
a-\>balance += amt;
have atomic effect.
QUESTION SYNCH-5F. Philip’s code can sometimes block incorrectly: a
thread running withdraw(5)
might block indefinitely even though the
current balance is 10. Describe briefly how this can happen.
SYNCH-6. Weensy threads
Betsy Ross is changing her WeensyOS to support threads. There are many
ways to implement threads, but Betsy wants to implement threads using
the processes
array. “After all,” she says, “a thread is just like a
process, except it shares memory with some other process!”
Betsy has defined a new system call, sys_create_thread
, that starts
a new thread running a given thread function, with a given argument, and
a given stack pointer:
typedef void* (*thread_function)(void*);
pid_t `**`sys_create_thread`**`(thread_function f, void* arg, void* stack_top);
The system call’s return value is the ID of the new thread.
Betsy’s kernel contains the following code for her sys_fork
implementation.
// in exception()
case INT_SYS_FORK:
current->p_registers.reg_rax = handle_fork(current);
break;
uint64_t `**`handle_fork`**`(proc* p) {
proc* new_p = find_unused_process();
if (!new_p) {
return -1;
}
new_p->p_pagetable = copy_pagetable(p->p_pagetable);
if (!new_p->p_pagetable) {
return -1;
}
new_p->p_registers = p->p_registers;
new_p->p_registers.reg_rax = 0;
new_p->p_state = P_RUNNABLE;
return 0;
}
And here’s the start of her sys_create_thread
implementation.
// in exception()
case INT_SYS_CREATE_THREAD:
current->p_registers.reg_rax = handle_create_thread(current);
break;
uint64_t `**`handle_create_thread`**`(proc* p) {
// Whoops! Got a revolution to run, back later
return -1;
}
QUESTION SYNCH-6A. Complete her handle_create_thread
implementation. Assume for now that the thread function never exits. You
may use these helper functions if you need them (you may not):
proc\* find_unused_process(void)
Return a proc\*
that has state P_FREE
. Returns NULL
if no
unused process exists.
x86_64_pagetable\* copy_pagetable(x86_64_pagetable\* pgtbl)
Return a copy of pagetable pgtbl
, with all unprivileged writable
pages copied. Returns NULL
if any allocation fails.
x86_64_pagetable\* allocate_page(void)
Allocates a new physical page, zeros it, and returns its physical
address. (Recall that the WeensyOS kernel always installs
kernel_pagetable
, so the page data may be accessed using the same
virtual address.) May be passed as the allocator
to
virtual_memory_map
.
…or any function from the WeensyOS handout code See reference material at end of quiz.
Recall that system call arguments are passed according to the x86-64
calling convention: first argument in %rdi
, second in %rsi
,
third in %rdx
, etc.
QUESTION SYNCH-6B. Betsy’s friend Prince Dimitri Galitzin thinks
Betsy should give processes even more flexibility. He suggests that
sys_create_thread
take a full set of registers, rather than just a
new instruction pointer and a new stack pointer. That way, the creating
thread can supply all registers to the new thread, rather than just a
single argument.
pid_t `**`sys_create_thread`**`(x86_64_registers* new_registers);
The kernel will simply copy \*new_registers
into the proc
structure for the new thread. Easy!
Which of the following properties of x86_64_registers
would allow
Dimitri’s plan to violate kernel isolation? List all that apply.
reg_rax
contains the thread’s%rax
register.reg_rip
contains the thread’s instruction pointer.reg_cs
contains the thread’s privilege level, which is 3 for unprivileged.reg_intno
contains the number of the last interrupt the thread caused.reg_rflags
contains theEFLAGS_IF
flag, which indicates that the thread runs with interrupts enabled.reg_rsp
contains the thread’s stack pointer.
Now Betsy wants to handle thread exit. She introduces two new system
calls, sys_exit_thread
and sys_join_thread
:
void `**`sys_exit_thread`**`(void* exit_value);
void* `**`sys_join_thread`**`(pid_t thread);
sys_exit_thread
causes the thread to exit with the given exit value;
it does not return. sys_join_thread
behaves like pthread_join
or
waitpid
. If thread
corresponds is a thread of the same process,
and thread
has exited, sys_join_thread
cleans up the thread and
returns its exit value; otherwise, sys_join_thread
returns
(void\*) -1
.
QUESTION SYNCH-6C. Is the sys_join_thread
specification blocking
or polling?
Betsy makes the following changes to WeensyOS internal structures to support thread exit.
- She adds a
void\* p_exit_value
member tostruct proc
. - She adds a new process state,
P_EXITED
, that corresponds to exited threads.
QUESTION SYNCH-6D. Complete the case for INT_SYS_EXIT_THREAD
in
exception()
. Don’t worry about the case where the last thread in a
process calls sys_exit_thread
instead of sys_exit
.
case INT_SYS_EXIT_THREAD:
QUESTION SYNCH-6E. Complete the following helper function.
` // Test whether `test_pid` is the PID of a thread in the same process as `p`. `
` // Return 1 if it is; return 0 if `test_pid` is an illegal PID, it corresponds to `
// a freed process, or it corresponds to a thread in a different process.
int is_thread_in(pid_t test_pid, proc* p) {
QUESTION SYNCH-6F. Complete the case for INT_SYS_JOIN_THREAD
in
exception()
. Remember that a thread may be successfully joined at
most once: after it is joined, its PID is made available for
reallocation.
case INT_SYS_JOIN_THREAD:
QUESTION SYNCH-6G. In pthreads, a thread can exit by returning from its thread function; the return value is used as an exit value. So far, that’s not true in Weensy threads: a thread returning from its thread function will execute random code, depending on what random garbage was stored in its initial stack in the return address position. But Betsy thinks she can implement pthread-style behavior entirely at user level, with two changes:
- She’ll write a two-instruction function called
thread_exit_vector
. - Her
create_thread
library function will write a single 8-byte value to the thread’s new stack before callingsys_create_thread
.
Explain how this will work. What instructions will
thread_exit_vector` contain? What 8-byte value will `create_thread
write to the thread’s new stack? And where will that value be written
relative to sys_create_thread
’s stack_top
argument?
MISC-4. Debugging
In the following short-answer questions, you have access to five
debugging tools: top
, strace
, gdb
, valgrind
, and man
. You
can’t change program source code or use other tools. Answer the
questions briefly (a couple sentences at most).
QUESTION MISC-4A. You are given a program that appears to “get stuck” when run. How would you distinguish whether the program blocked forever (e.g., made a system call that never returned) or entered an infinite loop?
QUESTION MISC-4B. You are given a program that uses a lot of memory. How would you tell whether the program leaks memory?
QUESTION MISC-4C. You are given a program that produces weird answers. How would you check if it invoked undefined behavior?
QUESTION MISC-4D. You are given a program that blocks forever. How would you tell where the program blocked (which function called the blocking system call)?
QUESTION MISC-4E. You are given a program that takes a long time to produce a result. How would you tell whether the program was using system calls unintelligently?
QUESTION MISC-4F. You are given a program that exits with a system call error, but doesn’t explain what happened in detail. How would you find what error condition occurred and understand the conditions that could cause that error?
MISC-5. Miscellany
QUESTION MISC-5A. True or false in conventional Unix systems?
- File descriptors are often used to communicate among processes on the same machine.
- File descriptors are often used to communicate among processes on different machines.
- File descriptors are often used to communicate with persistent storage.
- File descriptors are often used to access primary memory.
- File descriptors are often used to create child processes.
QUESTION MISC-5B. Match the process isolation feature on the left with the hardware feature that helps enforce it on the right. Use each hardware feature once (make the best match you can).
|
|
The remaining questions refer to the following lines of code.
1.
close(fd);
2.
connect(fd, sockaddr, socklen);
3.
listen(fd);
4.
mmap(NULL, 4096, PROT_READ, MAP_SHARED, fd, 0);
5.
read(fd, buf, 4096);
6.
write(fd, buf, 4096);
QUESTION MISC-5C. If a program executes the following line without error, which lines could be executed next without error? List all numbers that apply.
fd = open("/home/cs61user/cs61-psets/pset6/pong61.c", O_RDWR);
QUESTION MISC-5D. If a program executes the following line without error, which lines could be executed next without error? List all numbers that apply.
fd = socket(AF_INET, SOCK_STREAM, 0);
QUESTION MISC-5E. If a program executes the following lines without error, which lines could be executed next without error? List all numbers that apply.
pipe(pipefd); fd = pipefd[0];
MISC-6. More Miscellany
QUESTION MISC-6A. True or false: Any C arithmetic operation has a well-defined result.
QUESTION MISC-6B. True or false: Any x86 processor instruction has a well-defined result.
QUESTION MISC-6C. True or false: By executing a trap instruction, a process can force an operating system kernel to execute arbitrary code.
QUESTION MISC-6D. True or false: By manipulating process memory and registers, an operating system kernel can force a process to execute arbitrary instructions.
QUESTION MISC-6E. True or false: All signals are sent explicitly via
the kill()
system call.
QUESTION MISC-6F. True or false: An operating system’s buffer cache is generally fully associative.
QUESTION MISC-6G. True or false: The least-recently-used eviction policy is more useful for very large files that are read sequentially than it is for stacks.
QUESTION MISC-6H. True or false: Making a cache bigger can lower its hit rate for a given workload.
QUESTION MISC-6I. True or false: x86 processor caches are coherent (i.e., always appear to contain the most up-to-date values).
QUESTION MISC-6J. True or false: A socket file descriptor supports either reading or writing, but not both.
MISC-7. Pot Pourri
Parts A-D pertain to the data structures and hexdump output shown here.
struct x {
unsigned long ul;
unsigned short us;
unsigned char uc;
} *sp;
// Hexdump output of some program running on the appliance
08c1b008 e9 11 cf d0 0d d0 3f f3 63 61 74 00 0d f0 fe ca |......?.cat.....|
08c1b018 5e ea 15 0d de c0 ad de |^.......|
You are told that sp
= 0x08c1b008.
QUESTION MISC-7A. What is the value (in hex) of sp->ul?
QUESTION MISC-7B. What is the value (in hex) of sp->uc?
QUESTION MISC-7C. At what address will you find the string "cat"?
QUESTION MISC-7D. You think that the bytes after the string "cat" comprise an array of 3 integers; what is the value (in hex) of the middle one of those?
QUESTION MISC-7E. What is the following binary value expressed in hexadecimal: 01011010?
QUESTION MISC-7F. What is the value of the hex number 0x7FF in decimal?
QUESTION MISC-7G. Is 0x98765432 a valid return from malloc?
QUESTION MISC-7H. What is the minimum number of x86 instruction bytes you need to write an infinite loop?
QUESTION MISC-7I. True or False: Every declaration in C code allocates space for an object.
QUESTION MISC-7J. True or False: Processes cannot share memory.
For parts K–O, assume we are running on the appliance and we initialize ival, p, and q as shown below. Write the value of the expression -- you may express the values in hex if that's simpler, just be sure to prefix them with 0x to make it clear that you are doing so. For True/False questions, there is no need to correct or provide a counterexample for any statements that are false.
int ival[4] = {0x12345678, 0x9ABCDEF0, 0x13579BDF, 0x2468ACE0};
int* p = &ival[0];
int* q = &ival[3];
int* x = p + 1;
char* cp = (char*) (q - 2);
QUESTION MISC-7K. q - p
QUESTION MISC-7L. ((char \*)q - (char \*)p)
QUESTION MISC-7M. x - p
QUESTION MISC-7N. \*((short \*)((char \*)x+2))
QUESTION MISC-7O. \*cp
QUESTION MISC-7P. What system call allows you to block on a collection of file descriptors?
QUESTION MISC-7Q. What system call creates a communication channel that can only be used among related processes?
QUESTION MISC-7R. What system call can change the attributes of a file descriptor so you can poll on it rather than block?
QUESTION MISC-7S. What system call produces a file descriptor on which a server can exchange messages with a client?
QUESTION MISC-7T. True or False: A program and a process are the same thing.
MISC-8. CS61 in Real Life
QUESTION MISC-8A. The CS61 Staff have built a jet (the NightmareLiner) modeled on the Boeing Dreamliner. Unfortunately, they modeled it just a bit too closely on the Dreamliner, which needs to be rebooted periodically to avoid failure. In the case of the NightmareLiner, it needs to be rebooted approximately every 16 days. Your job is to use what you've learned in CS61 about data representation to hypothesize why.
Hint: There are 86,400,000 ms in a day. 86,400,000 is between 226 and 227.
Google recently discovered (and reported) a bug in the GNU libc
implementation of getaddrinfo
. This function can perform RPC calls,
which involve sending and receiving messages. In some cases,
getaddrinfo
failed to check that a received message could fit inside a
buffer variable located on the stack (2048 bytes).
QUESTION MISC-8B. True or false: This flaw means getaddrinfo
will
always execute undefined behavior.
QUESTION MISC-8C. Give an example of a message that will cause
getaddrinfo
to exhibit undefined behavior.
QUESTION MISC-8D. Briefly describe the contents of a message that
would cause the getaddrinfo
function to return to address
0x400012988 rather than to its caller.
This code used to appear in the Linux kernel:
1. struct tun_struct *tun = ...; // This is a valid assignment;
2. struct sock *sk = tun->sk;
3. if (!tun)
4. return POLLERR; // This is an error return
QUESTION MISC-8E. The compiler removed lines 3 and 4. Why was that a valid thing for the compiler to do?
MISC-9. Miscellany
QUESTION MISC-9A. Name the property that implies a process cannot cause the kernel to execute code at an arbitrary address.
QUESTION MISC-9B. True or false: It’s safe to call any C library function from a signal handler.
QUESTION MISC-9C. Assume that 10 processes in a synchronous distributed system are attempting to agree on whether red or blue is the majority of the processes’ favorite color. What is the maximum number of processes that can fail (i.e., go insane and start performing arbitrary actions) for agreement to be possible?