CS136 Winter 2015
Transcription
CS136 Winter 2015
CS136 Winter 2015 - Tutorial 3 CS136 ISAs: cs136@student.cs.uwaterloo.ca 27th, 28th of January, 2015 0-0 Goals of this Tutorial The goal of this tutorial is to reinforce the following material: • different systems of numeration (binary, decimal, hexadecimal and octal) • memory capacity • bits, bytes, and chars in memory • overflow • modules in C (testing and interface & implementation files) • working with floats CS 136 Winter 2015 Tutorial 3 1 Systems of Numeration Recall three systems of numeration: binary (base 2), decimal (base 10), and hexadecimal (base 16). We will also introduce one new system of numeration, octal (base 8). When we write the number 97 in decimal, we interpret it as 97 = 9 × 101 + 7 × 100 . We can represent the number 97 in other numerical systems; consider the following alternate represenations of 97: 11000012 , and 6116 . Each base is denoted by a subscript. Conventionally, if a number is in base 10 then it will not have a subscript. CS 136 Winter 2015 Tutorial 3 2 Binary notation In a binary representation (also known as base 2) there are two distinct digits (01). When we write the number 1011010 in binary, we interpret it as 1 × 26 + 0 × 25 + 1 × 24 + 1 × 23 + 0 × 22 + 1 × 21 + 0 × 20 = 64 + 16 + 8 + 2 = 90 (in base 10). 4 binary digits, can represent 24 (16) different possible values (0 . . . 1111) or (0 . . . 15) in base 10. In CS 251 / CS 230 you will learn more about binary notation. CS 136 Winter 2015 Tutorial 3 3 Hexadecimal notation In a hexadecimal (hex) representation (also known as base 16) there are sixteen distinct digits (0123456789ABCDEF). When we write the number 2A9F in hex, we interpret it as 2 × 163 + 10 × 162 + 9 × 161 + 15 × 160 = 8192 + 2560 + 144 + 15 = 10911 (in base 10). The reason hex is so popular is because it is easy to switch between binary and hex representation. A single hex digit corresponds to exactly 4 bits. CS 136 Winter 2015 Tutorial 3 4 Conversion table Dec Bin Hex Dec Bin Hex 0 0000 0 8 1000 8 1 0001 1 9 1001 9 2 0010 2 10 1010 A 3 0011 3 11 1011 B 4 0100 4 12 1100 C 5 0101 5 13 1101 D 6 0110 6 14 1110 E 7 0111 7 15 1111 F CS 136 Winter 2015 Tutorial 3 5 Octal In an Octal representation (also known as base 8) there are eight distinct digits (01234567). We write the number 0452 in octal, we interpret it as 4 × 82 + 5 × 81 + 2 × 80 = 256 + 40 + 2 = 298 (in base 10). Consider the following equivalent representations of 97. const int a = 0141; // base 8. const int b = 97; // base 10. const int c = 0x61; // base 16. Be careful, 0141, 0x141 and 141 are not the same! CS 136 Winter 2015 Tutorial 3 6 CQ 1: You are Bart from the Simpsons (a tv show that had its debut in 1989). In the Simpsons each character has four fingers on each hand (total of 8 fingers). If Bart created his own numerical system, it would likely be in base 8. Represent 85 in base 8. 0105 0414 0232 D 0125 E 0101 A B C CS 136 Winter 2015 Tutorial 3 6 Memory Capacity To have a better understanding of the C memory model, we provide a brief introduction to working with bits and bytes. You are probably aware that internally, computers work with bits. A bit of storage (in the memory of a computer) is in one of two states: either 0 or 1. A traditional light switch can be thought of as a bit of storage. CS 136 Winter 2015 Tutorial 3 7 Early in computing it became obvious that working with individual bits was tedious and inefficient. It was decided to work with 8 bits of storage at a time, and a group of 8 bits became known as a byte. Each byte in memory is in one of 256 possible states. With today’s computers, we can have large memory capacities: • 1 KB = 1 kilobyte = 1024 (210 ) bytes∗ • 1 MB = 1 megabyte = 1024 KB = 1,048,576 (220 ) bytes∗ • 1 GB = 1 gigabyte = 1024 MB = 1,073,741,824 (230 ) bytes∗ CS 136 Winter 2015 Tutorial 3 8 ∗ The size of a kilobyte can be 1000 bytes or 1024 bytes, depending on the context. Similarly, a megabyte can be 106 or 220 bytes, etc.. Manufacturers often use the measurement that makes their product appear better. For example, A terabyte (TB) drive is almost always 1012 bytes instead of 240 . To avoid confusion in scientific use, a standard was established to use KB for 1000 bytes and KiB for 1024 bytes, etc.. In general use, KB is still commonly used to represent both. CS 136 Winter 2015 Tutorial 3 9 Primary Memory Modern computers have primary memory in addition to secondary storage (hard drives, solid state drives, flash drives, DVDs, etc.). The characteristics of primary memory and secondary storage devices vary, but in general: Primary Memory: Secondary Storage: • very fast (nanoseconds) • (≈20x-1000x) slower • medium capacity (≈GB) • large capacity (≈TB) • high cost ($) per byte • low cost ($) per byte • harder to remove • removable or portable • erased on power down • persistent after power down CS 136 Winter 2015 Tutorial 3 10 In practice, programs can only “run” in primary memory. When you “launch” a program, it is copied from secondary storage to primary memory before it is “run”. In this course, we are always referring to primary memory, which is also known as Random Access Memory (RAM). With RAM you can access any individual byte directly and you can access the memory in any order you desire (randomly). CS 136 Winter 2015 Tutorial 3 11 Traditional secondary storage devices (hard drives) are faster if you access data sequentially (not randomly). Primary memory became known as RAM to distinguish it from sequential access devices. The term “RAM” is becoming outdated, as solid state drives and flash drives use random access. Also, modern RAM can be faster when accessed sequentially (in “bursts”). Regardless, when you encounter the term “RAM”, you should interpret it as “primary memory” . CS 136 Winter 2015 Tutorial 3 12 Bits, Bytes, and C Characters in Memory Recall that in C, single quotes (') are used to indicate an ASCII character. Moreover, in C there is no difference between the following two variables: const char letter_a = 'a'; const char ninety_seven = 97; In memory, letter_a is represented in 0s and 1s. Recall that there are 8 bits in a byte and a char reserves 1 byte in our RunC environment. CS 136 Winter 2015 Tutorial 3 13 CQ 2: Consider the following definition for i. const int i = 'b'; // equivalent to 98 in decimal. How many bytes does i reserve on the stack and how many 1s (bits) are in the binary representation of i? A B C D E 4 bytes and 1 bit 4 bytes and 2 bits 4 bytes and 3 bits 4 bytes and 4 bits 1 byte and 3 bits CS 136 Winter 2015 Tutorial 3 13 Overflow and Underflow Recall that integer overflow (or underflow) occurs when we try to represent values outside of the integer limits. By carefully specifying the order of operations you can sometimes avoid overflow. If you #include <limits.h>, the constants INT_MIN and INT_MAX are defined with those limit values. CS 136 Winter 2015 Tutorial 3 14 example: overflow k! . Consider the equation m! (1)(2)(3)...(k−2)(k−1)(k) We can express the former equation as (1)(2)(3)...(m−2)(m−1)(m) . If we let k = 13 and m = 10 then (1)(2)...(9)(10) 1 13! = (11)(12)(13) = (1716) = 1716 10! (1)(2)...(9)(10) 1 k! may overflow for sufficiently large values of k , however if we simplify the expression before we evaluate the factorial then we can mitigate overflow. CS 136 Winter 2015 Tutorial 3 15 Modules in C Recall that a C module consists of an interface (.h file) and an implementation (.c file) file. The interface consists of: function declarations, variable declarations, and structure definitions; whereas the implementation consists of variable and function definitions. If a function is declared in the interface then the design recipe goes in the interface with the function declaration, otherwise it goes with the function definition in the implementation file. From assignment 03 onward, you are expected to assert your required restrictions. CS 136 Winter 2015 Tutorial 3 16 example: safe overflow module The CS136 C master has promoted you to project lead for project Safe Overflow. This responsibility entails writing a C module which provides a function that safely determines if the sum of two integers will overflow. Fortunately, the previous project lead wrote some of the interface and implementation; it is your task to finish writing the interface and implementation. CS 136 Winter 2015 Tutorial 3 17 example: test module for the safe overflow module Great work, you have completed the C module; the CS136 C master is pleased. Oops! The CS136 C master noticed that you did not submit your test module. The CS136 C master is asking you to write a test module that verifies the correctness of your module. Write the test module and send it to the CS136 C master. CS 136 Winter 2015 Tutorial 3 18 Working with Floats Consider a bank account with 1 billion dollars and 0 cents; observe what happens when we withdraw $100. const float balance = 1000000000.0; const float withdraw = 100.0; const float new_balance = balance - withdraw; printf("balance = %f\n", balance); printf("withdraw = %f\n", withdraw); printf("new balance = %f\n", new_balance); balance = 1000000000.0 withdraw = 100.0 new balance = 999999872.000000 CS 136 Winter 2015 Tutorial 3 19 CQ 3: From the previous example, the new balance is not accurate because: A floats are stored in memory B one billion is outside the range of a float C floats have a mantissa and an exponent D floats are not precise CS 136 Winter 2015 Tutorial 3 19 It is dangerous to compare floats using the equality operator (==). Typically, we only want to know if a float is within some small distance (i.e. an epsilon) of another float. Bad Good const float a = 10.0/3.0; const float b = 3.333333; const float a = 10.0/3.0; const float b = 3.333333; const float epsilon = 0.000001; printf("a = %f\n", a); printf("b = %f\n", b); printf("a = %f\n", a); printf("b = %f\n", b); assert(a == b); // failed. a = 3.333333 b = 3.333333 Program finished with errors CS 136 Winter 2015 assert((a - b) <= epsilon); assert((a - b) >= -epsilon); a = 3.333333 b = 3.333333 Tutorial 3 20