


下載本文檔
版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進(jìn)行舉報或認(rèn)領(lǐng)
文檔簡介
1、Beijing Jiaotong University Final ExaminationCourse:Computer Architecture Lecturer: Ai Lihua,Wang Bing(NOTE:4 parts in 2 hours: total 100 points)Part No.Part1Part2Part3Part4Part5TotalScoreExaminerPlease show your work CLEARLY for all problems. I hope you enjoy the test!Part1Mark only one answer for
2、each question 10 points1. A superscalar processor has ( )(a) multiple functional units (b) a high clock speed (c) a large amount of RAM (d) many I/O ports2. On-chip cache has ( )(a) lower access time than RAM(b) larger capacity than off chip cache (c) its own data bus (d) become obsolete3. ( ) data
3、hazards are not possible in the DLX in-order instruction issue and in-order execution multicycle pipeline?(a) WAR(b) WAW (c)RAW (d) RAR4.Pipelining improves CPU performance due to ( )(a) reduced memory access time (b) increased clock speed (c) the introduction of parallellism (d) additional function
4、al units5. Cache memory enhances ( )(a) memory capacity (b) memory access time (c) secondary storage capacity (d) secondary storage access time6. RISC machines typically ( )(a) have high capacity on-chip cache memories (b) have fewer registers than CISC machines (c) are less reliable than CISC machi
5、nes (d) execute 1 instruction per clock cycle.7. Which of the following is NOT a computer performance metric: ( )(a) MIPS(b) FLOPS(c) SPECbenchmark, (d) RISC8.Given a 5 stage pipeline with stages taking 1, 2, 3, 1, 1 units of time, the clock period of the pipeline is: ( )(a) 8 (b) 1/8 (c) 1/3 (d) 39
6、. The average memory access time for a machine with a cache hit rate of 90% where the cache access time is 10ns and the memory access time is 100ns is ( )(a) 55ns (b) 45ns(c) 90ns(d) 19ns10. Delayed branching is used ( )(a) to introduce delays in program execution (b) in pipelining(c) in cache memor
7、y (d) decoding instructionsPart2Fundamentals of Computer Design 10 points1. 10 points In many practical applications that demand a real-time response, the computational workload W is often fixed. As the number of processors increases in a parallel computer, the fixed workload is distributed to more
8、processors for parallel execution. Assume 20 percent of W must be executed sequentially, and 80 percent can be executed by 4 nodes simultaneously. What is a fixed-load speedup?Part 3Instruction Set Architecture 20 points2. 8 points Suppose the variable x of type int and at address 0x100 has a hexade
9、cimal value 0x. The ordering of the bytes within the address range 0x100 through 0x103 depends on the type of machine. What will be arranged in memory 0x1000x103 according to Little Endian and Big Endian? 3. 12 points A model machine has 7 instructions, which frequencies are 43%, 21%, 12%, 8%, 6%, 6
10、%, and 4% respectively.3.1 Encoding operator with the minimum average code length.3.2According to 5.1, give the value of the minimum average code length.Part 4Pipelining 35 points4. 6 points Why would a designer sometimes allow structural hazards?5. 21 points Using the following code fragment:LOOP:L
11、W R1 , 0(R2);load R1 from address 0+R2ADDI R1, R1, #1; R1= R1+1 SW 0(R2), R1;store R1 at address 0+ R2ADDI R2, R2, #4; R2= R2+4SUB R4, R3, R2; R4= R3-R2BNEZ R4, LOOP;branch to LOOP if R4!=0Assume the initial value of R3 is R2+200. They run on a pipelined machine like DLX.5.1.According to the followi
12、ng format, show the timing of this instruction sequence with normal forwarding and bypassing hardware. Also assuming a register read and a write in the same clock cycle “forwards” through the register file. instructionClock123456789101112131415LW R1,0(R2)IFIDEXEMEMWBADDI R1, R1, #1SW 0(R2), R1ADDI R
13、2, R2, #4SUBR4, R3, R2BNEZ R4, LOOP 5.2.Assume that the branch is handled by predicting it as not taken. If all memory references take 1 cycle, how many cycles does this loop take to execute?5.3.In order to reduce the total cycles of 8.2, what measures could be taken? Give an explanation.6. 8 points
14、 For two-level branch prediction strategy with (2,2) predictor, how many bits does the branch prediction buffer need for 2K branch instructions?Part 5Memory Hierarchy 25points7. 7 points Cache design: Give short answers to the following questions.7.1Cache miss rates decrease with larger cache block
15、sizes due to what kind of locality?7.2How many sets in fully associative cache with 64 cache blocks?7.3 How many sets in a direct-mapped cache with 128 cache blocks?7.4How to keep the main memory and the cache consistent?8.18 points Assume a four blocks direct-mapped cache, two words per block, LRU replacement. The ca
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。
最新文檔
- GB 15269-2025雪茄煙
- 2025鋼材買賣合同樣本
- 代理記賬公司培訓(xùn)
- 2025全面商用建筑設(shè)計(jì)合同
- 2025居民購房合同協(xié)議書標(biāo)準(zhǔn)
- 2025二手折彎機(jī)買賣合同
- 2025臨時工合同模板
- 2025中外合作合同范本2
- 汽車美容服務(wù)流程優(yōu)化策略探討試題及答案
- 教育智能體的構(gòu)建與實(shí)踐路徑探討
- 《中央八項(xiàng)規(guī)定精神學(xué)習(xí)教育》專項(xiàng)講座
- 經(jīng)典物理浮力計(jì)算題(含答案)
- 2021年中國海洋大學(xué)輔導(dǎo)員招聘試題及答案解析
- 生物地理中考總復(fù)習(xí)知識點(diǎn)(地生中考,要這樣復(fù)習(xí))
- 云南省大理白族自治州各縣區(qū)鄉(xiāng)鎮(zhèn)行政村村莊村名居民村民委員會明細(xì)
- 勞動者就業(yè)登記表(通用模板)
- 壓力容器(氣瓶)風(fēng)險點(diǎn)告知卡
- 斜屋面專項(xiàng)施工方案-掛瓦坡屋面(附圖)
- 自保溫砌塊施工方案
- 合成氣生產(chǎn)甲醇工藝流程圖
- T分布臨界值表
評論
0/150
提交評論