Stat 201 – University of TN Knoxville



Stat 201 – Project 2 – Spring 2020Due Monday, March 9th, 2020(by 11:59pm, submitted to Canvas)Assignments submitted by 11:59pm on Friday, March 6th will receive +7 bonus pointsLate projects will be counted off 15 points per day late. New Project File: For project 2 you will be using “STAT 201 – Spring 2020 – Project 2.jmp”. This file can be found on the STAT 201 webpage under the “Projects” tab.Getting Started: In this project, you will explore a survey taken by students. See page 7 for a complete list of the questions asked in this survey (but don’t answer these questions!). Please be aware that some responses to the survey have been deleted, mostly to ensure anonymity of the results. You will be including a substantial amount of output within your write-up. INCLUDE ONLY THE OUTPUT NECESSARY TO ANSWER THE PROJECT QUESTIONS. The data are found in the file “STAT 201 – Spring 2020 – Project 2.jmp”, which is located on the STAT 201 webpage under the “Projects” tab. This file contains 1308 responses. In real life situations, researchers would use all of the data they have available after conducting a survey. For this project, however, you will get JMP to help you take a random sample from the entire data set so that each student will have different results, and therefore will be turning in a UNIQUE project. The size of the random sample must be 500 plus the last two digits of your UT student ID number. For example, if your UT student ID number is 000314791, you will take a random sample of size 500 + 91 = 591. When you create your random sample from the original JMP file, JMP creates a new file that will be named “Subset of STAT 201 – Spring 2020 – Project 2”. You should immediately save a copy of this file by clicking the “File” menu and choosing “Save As…”. JMP will prompt you to keep the same name, which is acceptable, or you can rename it to something like “Stat Project2 – My Data”Taking Screenshots: Although there are many ways to get JMP graphics into a written presentation, we want you to use the “screen shot” method in all cases. Please see the video at for instructions on how to take selective screen shots on a PC or a Mac. See page 6 for an example project format.Tutorials and Write-up: Video tutorials are supplied for each question to help with the JMP output. You should put this output immediately after your comments regarding that specific part of the assignment (i.e., not just a series of printouts from JMP at the back of your write-up). You can get help in the Stat 201 Lab with specific questions about the project. You can NOT ask a Stat 201 Lab worker to read your entire project for suggestions on what to change. Your finished work must be submitted within Canvas (see “Assignments”), and must be a Microsoft Word document (.doc or .docx).JMP and Hodges Library Computers: Using JMP installed on your own computer is much simpler than using JMP on a library computer! If you choose to use a computer in the library to do your project, be sure to first read the document “Using JMP in the Library”, found in MyLab under the Project Files tab. Also, you will need to save your project and your random sample subset file to a location you can access later, such as a memory stick. You could also e-mail these files to yourself for later use.Writing a Good STAT 201 Project Report: Please take note that on page 10 of this document there is a page titled “Writing a Good Stat 201 Project Report”. This page contains a series of guidelines for the written part of your report. A portion of your grade (10%) is related to following these guidelines.Project QuestionsPromptThe University of Tennessee Knoxville wants to understand their students better. You’ve been commissioned to write a report on data collected by the university. The goal of the report is open ended. It is important to write your executive summary after completing your report. In your executive summary make sure to clearly state the questions you examined throughout the report, your key findings and finally your suggestions. At the end of your report, include a short outro which should examine future possibilities for research that go beyond the current report. This section of the report is worth 10 points and is mentioned in the guidelines (Video help) at the end of the report. General GuidelinesDo not include numbers for questions. The report should look like a professional report given to the executives of the University of TN Knoxville.Include screenshots that communicate information well. Make sure they are sized properly and are easy to read. Crop images to remove excess parts of the image.Links next to questions are video help showing how to use JMP.Question One – Taking a Random Sample (Click Here for Help Video)(8 points) Using the JMP data file specified earlier, get JMP to select a random sample of size 500 plus the last 2 digits of your student ID number. Report the sample size in your executive summary. All remaining questions will use this random sample. Do not include a screenshot of the random sample.Question Two – Displaying Categorical Data (Help Video 1 / Help Video 2 /Help Video 3)The first thing the university of Tennessee wants to know is if the Q16 - Family Economic Level is associated to another categorical variable in the data set.(5 point) Create a graphical display of your choosing that displays Q16 - Family Economic Level. Your display should have proper titles and include the numeric values on the graphic.(3 points) Include a short write-up of the graphical display mentioning the values in the display.(3 points) Use the tabulate option to add a table that shows the frequency of Q16 - Family Economic Level by another variable of your choosing. Include Q16 - Family Economic Level as the row variable and another variable of your choosing as the column variable. Include in the table the counts and row percentages.(3 points) Include a write-up of the relationship between the two variables. Be specific on what the relationship or lack of relationship between the two variables is.Question Three – Displaying Quantitative Data (Help Video 1 / Help Video 2)The second question of interest to UT is how much students work is connected their family economic level. Using your random sample, analyze the Q20 – Weekly Hours Worked variable.(4 points) Create a histogram in horizontal layout. The histogram should include a count axis and summary statistics. In the summary statistics have JMP report the IQR along with the default summary statistics it reports.(3 points) Interpret the histogram to the executives making sure to go over the shape, center and spread. Make sure to use the proper measure of center and spread based on the shape of the distribution.(4 points) Use tabulate to create a table displaying Q20 – Weekly Hours Worked compared to Q16 - Family Economic Level. Include in the table the median and IQR.(3 points) Interpret the key differences you see between the distributions based on these values.Question Four – Decision Tree (Click here for Help Video)UT has tasked you to choose a two-level categorical variable of interest to you and their goal to help students at UT. Pick a variable of interest to you and answer the following questions.(3 points) Report the variable you selected and the reason you selected it.(3 points) Before making the decision tree, remove variables that are identifiers or act like identifiers. Mention the removal of these variables in your report.(6 points) Place the variable you selected in the “Y,Response” box.? Place all other variables in the “X,factor” box.? Be sure to remove your Y variable from the “X,factor” box.? Produce three (3) “splits”.? Create a screenshot of the decision tree that includes the R-squared, the tree itself and a leaf report.? This can be done with three separate screenshots.? Make sure the screenshots are large enough to read but not too large.? An example format is given on page 6.NOTE – The example decision tree does not contain a leaf report in the output. Make sure to include a screenshot of your leaf report in your report.(3 points) Examine the first split in the tree and explain the nature of the association between your Y-variable and the variable in the first split.(3 points) Look through the leaf report and find the leaf with the most people in it. Describe the X variables that characterize the people in this leaf.(3 points) Do you believe UT can use this information to improve the university? If yes, how can they utilize this information? If no, what variables could they collect in another study that might be related to this variable?Question Five – Linear Regression (Click Here for Help Video)You will now do a regression analysis using two quantitative variables of your choosing. The two variables you pick should be something you want to explain (Y) and a variable you think might explain it (X).(3 point) Explain the importance of the variables you selected.(3 point) Produce a scatterplot of your Y and X variables using JMPs Analyze - Fit Y by X platform.(5 points) Fit a least-squares regression line to your scatterplot, and include the scatterplot with the line, and all resulting output in your report.(5 points) Examine your scatterplot for potential outliers and report whether your data has outliers or not. Even if your data does not have outliers, identify the most extreme residual in your data set. Include a short write-up regarding this individual and some of their values.(5 points) Report the value of R squared. Interpret this value: don’t comment on the magnitude of this number, tell the reader what this number means.(6 points) Is the linear relationship “statistically significant”? How do you know?Additional point values:Project organization and flow (6 points)Projects should look neat and organized. Use the crop tool in Word if you need to improve screenshots. Your project should read like a report without the prompt of each question. Large blank spaces and images that have areas that should be cropped will lose 1 point per instance.Use of the guidelines on page 10 and the help video – (10 points) (Click Here for Help Video)The opening paragraph on the project should give an executive summary (4-6 sentences) of the analysis they’re about to read. The video above goes in to more detail how to structure your executive summary. It is important that your executive summary is impactful. The closing paragraph should summarize interesting finds and discuss any ideas regarding further data collection and analysis. Think of the outro as a “To be continued” for the project where you talk about future work you might complete based on the work in this project. Make sure to use an opening and closing that is relevant to this project and your data.EXAMPLE FORMAT – Showing Decision Tree OutputNote: We suggest you use the majority of one full page of your report to clearly show the decision tree. Not all decisions trees will be this big but make sure yours is large enough to be read. The following decision tree was made using Spring 2016 data and is not possible to replicate with Spring 2020 data. Your decision tree should follow the guidelines in the project regarding the number of splits. If you have extra space on the page, use this space to answer questions in the report.STAT 201 SURVEYFOR REFERENCE ONLY - FULL TEXT OF QUESTIONS ASKEDQ0 Which section of Stat 201 are you in?Q1 What is your gender?Q2 How old are you (In years)?Q3 Were you born in Tennessee?Q4 What is your relationship status?Q5 How far do you live from campus?Q6 What was your high school GPA?Q7 Are you a member of a fraternity or sorority??Q8 Are you an only child, oldest child, middle child or youngest child? Pick one answer that best describes your birth order.Q9 Have you ever broken a bone?Q10 Do you have a Roth IRA?Q11 How many pets do you own?Q12 Estimate how much you spend on your pets per year; include veterinary expenses, food, toys, treats, grooming, etc. Enter zero if you have no pets.Q13 How many hours per week do you spend reading assignments from textbooks that your instructors assign Include all classes, not just Stat 201.Q14 What is your major?Q15 How many credit hours are you taking this semester?Q16 How would you identify the economic level of your immediate family?Q17 Are you in the honors program at UT?Q18 What do you expect your starting annual salary (in US dollars) to be when you obtain a college degree?Q19 Do you (or your parents) plan on, or have you (or your parents) already, taken out student loans to pay for your college expenses? ?Q20 How many hours a week do you currently work at a job? If you are not employed, please put 0.Q21 How many languages can you speak fluently? This includes your native language. ?This is the language you first learned.Q22 How would you classify your views on economic political issues?Q23 How would you classify your views on social political issues?Q24 What should happen to Confederate statues?Q25 Should the United States stop making pennies? (Eliminate the penny)Q26 Will humans step foot on Mars?Q27 Select the option below that completes the following sentence in a way that best describes your opinion. ?"Climate change on Earth:Q28 Which of these do you believe to be closest to the truth regarding life on Earth?Q29 Have you ever smoked marijuana?Q30 Do you think marijuana should be legalized at the federal level (For the whole US)?Q31 Should states have the ability to regulate what couples can marry? ?(i.e. defining marriage as only one man and one woman)STAT 201 SURVEYFOR REFERENCE ONLY - FULL TEXT OF QUESTIONS ASKEDQ32 Age when you had your first alcoholic beverage. IMPORTANT- Don't count sips or communion. ?This should be an actual drink of alcohol.Q33 When you eat out at a restaurant that involves a waitress or waiter, what percent do you usually tip? Enter response as a whole number with no decimals for a percentage from 0 to 100.Q34 Which statement best describes you behavior when you drink water on campus?Q35 What is the most you've paid for a single coffee based drink? ?This includes any size, additions and tips given for the drink.Q36 Do you use any tobacco products??Q37 Is vaping a safer alternative to smoking?Q38 How many times have you cheated in college? This includes looking at another test during an exam, taking another student's work and presenting it as your own and other forms of academic dishonesty.Q39 How many times did you cheat in high school? This includes looking at another test during an exam, taking another student's work and presenting it as your own and other forms of academic dishonesty.Q40 On an average night, how many hours of sleep during the school year do you usually get?Q41 What is the longest number of consecutive hours you've stayed awake?Q42 Have you ever been arrested?Q43 Approximately how many text messages do you send a week?Q44 Approximately how many text messages do you send on the weekend?Q45 On a typical school day last semester, approximately how many text messages would you send during class (while you were attending class)?Q46 What is your favorite app on your phone?Q47 What percentage of your income do you believe you should save in your 20s? Enter response as a whole number with no decimals for a percentage from 0 to 100.Q48 Have you ever purchased perishable food items online?Q49 In the past 6 months, have you purchased a product based on a TV commercial??Q50 How many on-line purchases (not counting music downloads) have you made in the last week?Q51 How often do you use coupons when you shop (not including on-line shopping)??Q52 Roughly, how many selfies have you posted on social media in the past month?Q53 How much are you will to pay to see your favorite musician in concert?Q54 What do you think of?Kanye West?as an individual?Q55 What do you think of?Steven Colbert?as an individual?Q56 Have you ever watched most or all of a live sporting event on a smart phone or tablet?Q57 Have you ever gone to a physical store to check out the features of a product, with the intention of purchasing the item online later??Q58 How do you usually to listen to music?Q59 Have you ever reviewed a business or product on social media (i.e. Twitter, Facebook, ect.)?Q60 Are you "friends" on Facebook with one or more of your parents (include step-parents in your answer)?STAT 201 SURVEYFOR REFERENCE ONLY - FULL TEXT OF QUESTIONS ASKEDQ61 Approximately how many friends do you have on Facebook? If you don't have a Facebook account, answer 0 for this question.Q62 Approximately how many people have you defriended on Facebook. If you don't have a Facebook account, answer 0 for this question.Q63 In a typical week last semester, how much time (in hours) did you spend on "social media"? Include both time reading social media and time communicating with social media. Writing a Good STAT 201 Project Report Writing a report to your boss about a statistical analysis he has asked you to do is very different than writing a novel, or writing to your Statistics instructor. What does it take to write a good project report? Of course, it’s important to know your audience when you write anything.Let’s assume you are writing your project report for some busy executives in the company, and they have asked you to answer the questions in the project. They are very intelligent people, but they are not “Statisticians”. Assume that these executives have had some basic statistical education, but perhaps a long time ago. Keep this in mind as you complete your project.Below are some guidelines for writing an effective project report:1.The first paragraph for the report should be an executive summary. The executive summary should be a 4-6 sentence summary that orients the reader and highlights the key points of the report. The goal of the executive summary in this report is to orient the read to the problem, explain key findings and then gives a solution or suggestion.2.Answer each question on the project instructions using correct sentence structure, spelling and grammar. Sentences should be succinct and clear. You can assume the executives have a copy of the questions they asked.3.Avoid using "statistical jargon". Explain the results of the analysis in a way that the executives can understand it.4.As explained in the project instructions, graphics from JMP and/or Excel that address the project question must be imbedded within the document, at the point where the executives need to see them. Don’t make them hunt for the output at the back of your report.5.The closing paragraph should mention key findings but this is not the primary focus. The primary focus is on future research opportunities. The goal is to bring innovation and imagination to the executives. The closing paragraph should be 4-6 sentences in length. ................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download