CS552 Course Wiki: Spring 2016 | Main »
Homework 6 |
Tasks |
Homework 6 Due 05/09 DRAFT: DO NOT BELIEVE Problem 1 must be done with your project partner. Names must be included in the partner.txt file included in the supplied tar file. Submitted:
Not Submitted:
The remaining problems will not be graded but are recommended for better understanding of the course material. Problem 2Consider the example single-error-correcting code with a minimum Hamming distance of three example in the ECC1 handout. The handout can be found at the bottom of this page
Problem 3 (review of chapter-3)
This is a good reference on IEEE 754 standard; link Problem 4Virtual Memory: Consider a byte addressable system with 1GB physical memory and 4GB of virtual address space and no caching. 1) What should be the width of the address bus within the processor (prior to address translation)? What should be the width of the address bus after address translation? 2) Assume that the system uses 4KB pages. How many page table entries are required per process, (assuming that all processes uses all 4GB of the virtual address space)? 3) Now we are adding a TLB to speed up address translation. The TLB is fully associative with only four entries. The snapshot of the TLB and the contents of the relevant portion of the page table are shown below. Explain why the tags are 20 bits wide and why the physical page numbers are 18 bits wide. TLB snapshot: (:notabledit:)
Page table snapshot: (:notabledit:)
Problem 5Consider the same system of Problem 6 and the snapshot of TLB and page tables for a given process. 1)Given that this process generates the following stream of virtual addresses: 32'h0000_7000 32'h0000_B000 32'h0000_3000 32'h0000_BA5A 32'h0000_BFFF 32'h0000_AA5A 32'h0000_5FFF 32'h0000_1CCC Classify each access as TLB hit/Page table hit/Page fault. 2)For each memory access which is not a page fault, what is the translated physical address? Problem 6What are types of programs or problems that are not suitable for GPUs? Give one example. You may describe an algorithm, psuedo-code or c-code. Justify why you think that the performance of this algorithm will be bad on GPUs. Problem 7Something about cache-coherence and multiprocessors here. TBD Problem 8Something about cache-coherence and multiprocessors here. TBD |
Page last modified on April 29, 2016, visited times |