Do Now 18 – Balance Point Directions: Use the data table to answer
Transcription
Do Now 18 – Balance Point Directions: Use the data table to answer
Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Do Now 18 Name: Date: Do Now 18 – Balance Point Directions: Use the data table to answer the questions. 1. Calculate the balance point. 2. Explain whether it is reasonable to fit a line to the data. 3. The data is plotted as a scatterplot below. Draw a line that contains your balance point and seems to be a good approximation of the data. 4. Find the equation of the line you plotted in 3 using point-‐ slope form of a line. 12 10 8 6 4 2 5 10 Math 2 Week 7 Packet Page 8 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Name: Date: Linear Regression on the Calculator – Classwork and Homework We have been finding lines of best fit for quantitative data. Now, we will use our calculators to make the process easier. Math 2 Week 7 Packet Page 9 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 1. This table shows the average number of gallons of milk a family drinks per week. a. Follow directions on first page to find the equation of the best-fit line. Write down the equation. family size 1 2 3 4 5 6 Name: Date: number of gallons of milk 1.0 1.5 2.2 3.8 4.7 5.0 b. What is the y-intercept of this line? Explain what it represents about the situation. Does it make sense? c. What is the slope of this line? Explain what it represents about the situation. Does it make sense? d. Sally has eight people in her family. How many gallons of milk do you think they drink in a week? Does your answer make sense? If Sally says her family drank 8 gallons of milk last week, do you believe her? e. You run into your friend John at the store and he is buying 3 gallons of milk for his family this week. How many people do you think are in his family? Does your answer make sense? If John says his family has 4 people in it, do you believe him? Math 2 Week 7 Packet Page 10 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Name: Date: 2. Use your calculator to find the best-fit line equation for the data in the table. a. Write down the equation. x –3 –1 0 1 3 y 4 3 2 1 0 b. Graph the data points and the best-fit line together on your calculator screen. (If you don’t see five points and a line, try pressing [ZOOM][9] to fix.) Draw your calculator screen below. 3. The table shows ages and heights for several maple trees. Let x = tree age (in years), y = tree height (in feet). a. Enter the data table into your calculator, and find the best-fit line equation: b. Use the equation to predict the height of a 60year-old tree. Does this make sense? tree age tree height 3 6 72 60 25 21 24 18 5 9 42 35 17 14 50 40 5 11 87 70 c. Use the equation to estimate the age of a tree that is 100 feet tall. Does this make sense? Math 2 Week 7 Packet Page 11 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Name: Date: 4. This table shows the amount spent on TV advertising in the USA in various years. Year 1995 1997 1999 2001 TV advertising expenditures 32,720 34,593 36,830 40,140 (in millions of dollars) a. Enter the data table into your calculator, and find the best-fit line equation: b. Use the equation to predict the amount spent on advertising in 1996. Does this make sense? c. Use the equation to predict in what year the money spent on advertising will reach 60,000 million dollars. Does this make sense? 5. The data table shows the number of farmers and the number of cows at each of ten different farms. This problem is about how these two numbers are related. Let x = the number of farmers, y = the number of cows. a. Find the best-fit line equation. You may round off the numbers to three decimal places. b. Predict how many cows there would be at a farm with 6 farmers. Show work. number of farmers number of cows 1 8 10 4 2 1 4 3 2 12 3 48 73 27 14 8 19 17 11 80 c. Predict how many farmers there would be at a farm with 100 cows. Show work. Math 2 Week 7 Packet Page 12 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Name: Date: Correlation: Examining Residuals and Correlation Coefficient Today, we will be more closely examining best-fit lines. Begin by creating a scatterplot for the data given in the Excel file on the web site. Remember to label your axes appropriately. Lately, we have been using our calculators to find equations of lines of best fit. Excel can calculate best-fit lines (trendlines) as well. Now, we are going to examine part of the procedure that the calculator and the computer use to find them. Excel can make this process easier for us. Residuals are the: Car Crashes # Deaths (per every 100,000 people) y = 0.9515x -‐ 1814.1 R² = 0.76471 Year Math 2 Week 7 Packet Page 13 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Name: Date: The goal of finding the best-fit line is to make the numbers in the residual column as small as possible. Since some sets of data can contain hundreds of points, you need a single number that represents the error for the entire data set. Adding the individual residuals does not really help. One prediction that is 10,0000 too high offsets a prediction that is 10,000 too low. A line of best fit is the graph of the linear equation that shows the relationship between two sets of data most accurately (with the least errors). When working with a line of best fit we want to assess how well the line is fitting the data. The sum of square residuals (and thus positive numbers) is one way to measure, but the minimum value that this can reach changes depending on the data. Instead, we use what is called the linear correlation coefficient (r), which is a number that is always between –1 and 1. Note: When calculating r, the linear correlation coefficient, the calculator uses the sum of square residuals. They are related, but we are not going to go into how. The number r, called the correlation coefficient, is a measurement of how closely the bestfit line fits the data. • A value of r = 1 or –1 would say the line fits the data perfectly (perfect correlation). • A value of r that is greater than 0.8 is considered strong correlation • A value of r that is less than 0.5 is often considered weak correlation • A value of r = 0 would stand for the worst possible fit (no correlation) o A positive value of r would mean: o A negative value of r would mean: r=1 r = 0.99 r = 0.95 r = 0. 90 r = 0.70 r = 0.23 Our calculator will tell you r for any best-fit line. To turn this feature on: Math 2 Week 7 Packet Page 14 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Name: Date: Enter your original Excel data into your calculator and find the line of best fit. You will now notice that the calculator displays r along with the equation. Equation: What value of r does your calculator display? What does this number tell you? The calculator also display r2, the coefficient of determination. This allows you to determine how certain one can be in making predictions from the best-fit line. This is the number that Excel gives you. You can find r by taking the square root of r2. Now you try it with some data about the weather and the number of visitors to a water park: What is the best-fit line that Excel gives you? Visitors to a Water Park What is the R2 value that Excel gives you? What is the value of r? What would you say about the correlation of the best-fit line to the data? Temperature (F) # of Visitors (thousands) 62 36.8 67 40.9 69 36.9 70 37.1 72 44.3 77 42.3 79 45.8 80 45.4 83 48.4 85 51.5 88 57.7 94 57.3 96 56.1 98 58.9 Predicted Math 2 Week 7 Packet Page 15 Residuals Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Homework 1. The table of data comes from an experiment by Tor Carlson in 1913 on Saccharomyces cerevisiae (a type of yeast). The data show the number of hours elapsed and the number of yeast cells per square unit of area in a Petri dish. a. Find the balance point of the data and add it to the table. b. Make a scatterplot of this data below. Name: Date: Hours Yeast Density 0 9.6 1 18.3 2 29.0 3 47.2 4 71.1 5 119.1 6 174.6 7 257.3 c. Using your calculator, find the line of best fit. Write the equation and r-value below, and plot the best-fit line on your scatterplot. d. With a different color pen or highlighter, mark the residuals on your scatterplot. e. Either in Excel, with your calculator or by hand calculate all of the residuals and label them in your scatterplot. You can either fill out the table below, or print out your Excel work and paste it below. Predicted Residuals Hours Yeast Density Density 0 9.6 1 18.3 2 29.0 3 47.2 4 71.1 5 119.1 6 174.6 7 257.3 Math 2 Week 7 Packet Page 16 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Name: Date: f. Look at your r-value. What does this tell you about your best-fit line? (Make sure you address the sign of your r-value and the magnitude of your r-value.) 2. This table gives the mean height in centimeters of boys ages 5 to 13 in the United States. (Source: National Center for Health Statistics) a. Using your calculator find the best-fit line. a. What does the slope represent? Does this make sense? b. What does the y-intercept represent? Does this make sense? Age Height (cm) 5 109.2 6 115.7 7 122.0 8 128.1 9 133.7 10 138.8 11 12 13 143.7 149.3 156.4 c. What does the data point (5, 109.2) represent? Does this make sense? d. Predict the height of a 14-year old. Does this make sense? e. Predict the height of a 30-year old. Does this make sense? f. Predict how old a 130cm tall boy is. g. What is the correlation coefficient, r, for your best-fit line? h. What does r tell you about your best-fit line? Math 2 Week 7 Packet Page 17 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Name: Date: 3. Explain how your calculator finds the line of best-fit using residuals. 4. For the scatterplots below estimate the r-value for the lines of best fit. d. h. e. g. i. Math 2 Week 7 Packet Page 18 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Name: Date: Two-variable Data Test Review You should be able to: • Distinguish between categorical and quantitative data • Construct two-way tables for categorical data • Calculate joint frequencies, marginal distributions, and conditional distributions • Construct segmented bar graphs • Determine whether two categorical variables are associated • Construct scatterplots for quantitative data • Find lines of best fit for data that is approximately linear • Determine the equation of a line in point-slope and slope-intercept forms o Point-slope: y = m(x − x1 ) + y1 o Slope-intercept: y = mx + b • Determine whether two quantitative variables are correlated • Determine and interpret the correlation coefficient €difference between correlation and causation • Explain the € predictions based on best-fit lines • Make appropriate • Answer practical questions about data and justify your reasoning 1. The following is data collected on time spent on homework (min) and time spent watching TV (min) each night. Homework 1 4 5 4 4 2 5 2 6 5 5 3 3 3 4 2 4 11 (min) 0 0 5 6 6 3 7 8 5 8 2 8 8 9 5 7 Television (min) 7 8 3 0 15 7 2 2 5 3 0 9 0 4 0 3 5 5 6 12 5 9 5 2 7 3 8 5 0 1 0 41 4 4 6 0 3 4 4 2 a. Construct a scatterplot for the data. USE GRAPH PAPER! b. Explain your choices for which variable is on which axis and the scales of each axis. c. Draw in an appropriate line of best fit on your scatterplot. d. Write the equation of the best-fit line in point-slope form. e. Change into slope-intercept form. Math 2 Week 7 Packet Page 19 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 f. Name: Date: Find the line of best fit given by your calculator. How close is your line from part e to the one given by the calculator? g. What is the slope of the line from your calculator? Write a sentence explaining what it represents about homework and watching television. Does this make sense? h. What is the y-intercept of the line from your calculator? Write a sentence explaining what it represents about homework and watching television. Does this make sense? i. Use your calculator to find the correlation coefficient for this data. Explain what it tells you about the data. Math 2 Week 7 Packet Page 20 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Name: Date: 2. The following chart shows data from a telephone survey of 16 American adults. Respondent Gender Party Affiliation Person #1 Female Democrat Person #2 Female Democrat Person #3 Female Democrat Person #4 Female Republican Person #5 Male Republican Person #6 Male Democrat Person #7 Female Democrat Person #8 Female Republican Person #9 Female Republican Person #10 Male Democrat Person #11 Male Democrat Person #12 Male Republican Person #13 Female Republican Person #14 Male Democrat Person #15 Male Republican Person #16 Male Republican a. Construct a two-way table for this data. b. Choose one of the joint frequencies in your table. Write a complete sentence explaining what it represents about this survey. c. Choose one of the rows or columns and calculate the conditional distribution. Write a complete sentence explaining what each value calculated represents about this survey. d. Determine whether gender and party affiliation are associated or independent. Show your work by constructing segmented bar graphs. Explain your reasoning. Math 2 Week 7 Packet Page 21 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Name: Date: 3. The following chart shows nutrition information for items at fast food restaurants. a. Using your calculator, generate a scatterplot comparing total calories and carbohydrates. Estimate the correlation coefficient for theses variables and explain your reasoning. b. Determine the equation for the line of best fit and the correlation coefficient from your calculator. Write them below. How closely did your estimate match the actual correlation coefficient? c. A “Bacon Cheeseburger” from Five Guys has 920 calories. Use your line of best fit to predict the grams of carbohydrates in this burger. Is this an appropriate prediction? Math 2 Week 7 Packet Page 22 Math 2: Algebra 2, Geometry and Statistics Ms. Sheppard-Brick 617.596.4133 http://lps.lexingtonma.org/Page/2434 Name: Date: d. An “Original Slider” from White Castle has 13 grams of carbohydrates. Use your line of best fit to predict the total calories in this burger. Is this an appropriate prediction? 4. A survey was conducted about whether a person was an organ donor and whether they were married or single. The two-way table below summarize the data collected. a. Fill in the missing boxes in the table. b. What percentage of single people are not organ donors? Show or explain how you got your answer. c. What percentage of organ donors are married? Show or explain how you got your answer. d. What percentage of people surveyed are organ donors? Show or explain how you got your answer. e. Determine whether martial status and organ donor status are associated or independent. Show your work by constructing segmented bar graphs. Explain your reasoning. 5. Play the calculator game from class again. Did your performance improve? J Math 2 Week 7 Packet Page 23