University of Tennessee



Stat 201 – Project 1 – Spring 2019Due Friday, February 8th, 2019*(11:59 PM in the evening, submitted to Canvas)*Assignments submitted by *11:59pm on Wednesday February 6th will receive +7 bonus pointsLate projects should be counted off 15 points per day late. In this project, you will explore a subset (i.e., a sample) of the data collected from the survey that most Stat 201 students completed this semester. See pages 6-8 for a complete list of the survey questions. You will be including a substantial amount of output within your write-up. INCLUDE ONLY THE OUTPUT NECESSARY TO ANSWER THE PROJECT QUESTIONS. ANY JMP WORK YOU DO TO ANSWER A QUESTION MUST BE INCLUDED AS A SCREENSHOT IN YOUR REPORT.Getting Started: The data are found in the file “Stat 201 Spring 2019 Survey Results.jmp”, which is located on the Stat 201 Webpage under the “Projects” tab. This file contains 1308 responses. In real life situations, researchers would use all the data they have available after conducting a survey. For this project, however, you will get JMP to help you take a random sample from the entire data set so that each student will have different results, and therefore will be turning in a UNIQUE project. The size of the random sample must be 900 plus the last two digits of your UT student ID number. For example, if your UT student ID number is 000314791, you will take a random sample of size 900 + 91 = 991. When you create your random sample from the original JMP file, JMP creates a new file that will be named “Subset of Stat 201 Spring 2019 Survey Results”. You should immediately save a copy of this file by clicking the “File” menu and choosing “Save As…”. JMP will prompt you to keep the same name, which is acceptable, or you can rename it to something like “Stat Project1 – My Data”Taking Screenshots: Although there are many ways to get JMP graphics into a written presentation, we want you to use the “screen shot” method in all cases. Please see the video at for instructions on how to take selective screen shots on a PC or a Mac. Clearly label what question and part you are answering so your project is graded correctly! See page 5 for an example project format.Tutorials and Write-up: See the JMP tutorials at and the Stat 201 YouTube Channel Playlist for instructions on how to get JMP to perform most tasks. Use page 4 of this project for guidance in which tutorial to look at for each question in the project, as well as important “notes” to further help you with certain JMP tasks. In every question that asks you to produce output from JMP, we expect the output you produce to answer the question to be within the write-up. You should put this output immediately after your comments regarding that specific part of the assignment (i.e., not just a series of printouts from JMP at the back of your write-up). You can get help in the Stat 201 Lab with specific questions about the project. You can NOT ask a Stat 201 Lab worker to read your entire project for suggestions on what to change. Your finished work must be submitted within Canvas (see “Assignments”), and must be a Microsoft Word document (.doc or .docx).Important note for Windows (PC) users: Open JMP, click on the File menu and select Preferences. Then, on the left, select Windows Specific. About three-fourths of the way down this window, where it says Auto Hide Menu and Toolbars, click the drop down arrow and select Never. Click OK, then close this window. You should only have to do this once, and Mac users do not need to do this.JMP and Hodges Library computers: Using JMP installed on your own computer is much simpler than using JMP on a library computer! If you choose to use a computer in the library to do your project, be sure to first read the document “Using JMP in the Library”, found on the Stat 201 webpage under the “Projects” tab. Also, you will need to save your project and your random sample subset file to a location you can access later, such as a memory stick. Or, you could e-mail these files to yourself for later use.Writing a Good STAT 201 Project Report: Please take note that on page 9 of this document there is a page titled “Writing a Good Stat 201 Project Report”. This page contains a series of guidelines for the written part of your report. A portion of your grade (9%) is related to following these guidelines.Project QuestionsQuestion One1.Using the full JMP data file, get JMP to select a random sample of size 900 plus the last 2 digits of your student ID number (as described earlier in this document). Report the size of your sample. Although we wouldn’t normally do this, make a histogram of the variable “Q0 - Original Row Number (Identifier)”. Make sure your histogram is in horizontal layout and has a count axis (See the example on page 5 of this project.). Include the summary statistics. Save your JMP file, and use it for all remaining questions. (10 points)Question Two2.a)Choose two categorical variables for this problem. Do not use any variable with more than five levels. For example, the variable “Q15 – Major” has over 200 levels, so this variable would not be an option. Name the two variables you will be using and describe the relationship you expect them to have. NOTE: Do not use the variables in the example project on page 5. (3 points)b)Create a mosaic plot and contingency table for your two variables mentioned in 2a. The contingency table in your output should display “Count” and “Row %” values only. Include this screenshot in your report. (9 point)c)Using the mosaic plot, explain the relationship between these two variables. In your explanation, you must mention some of the “Row %” values. (Notice that the “Row %” values are what the heights of the bars in the Mosaic plot are based on.) (9 points)d)Did the Mosaic plot verify or refute your prediction in part (a)? Do you believe the association you see between these two variables is “strong”? Briefly explain your answer. NOTE: the word “strong” is open to interpretation, so there is no right answer for this, it’s just your opinion. (6 points)e)Go through additional variables in the survey to find two variables that do not have a relationship. Try to find the weakest relationship possible. It is possible to view multiple mosaic plots at one time by including more than one variable in X variable field. Include a screenshot of the weakest relationship you could find along with the contingency table. NOTE – You do not have to find the weakest relationship possible. There are many possible graphics for this question. (12 points)Question Three3.In this question, you will explore the variable “Q61 – Facebook Friends”. Read the full survey question description for this variable (page 7).a)Generate a horizontal boxplot of this variable using graph builder. Within your document, use the “tabulate” option under “analyze” to include the min, max, median, Q1 and Q3. Interpret in context the median. Also, calculate and interpret the IQR from your results. (12 points, make sure to interpret what you report here)b)Use tabulate to create a table for the mean and standard deviation. Include this table in your report. Find the z-score for how many Facebook Friends you have. Show the math for how you found your z-score and explain what the z-score means. (3 point)c)Next you will explore possible differences for answers to this question depending on if they are in Greek life “Q07 - Greek Life”. Prior to producing any output, which group do you think will have more Facebook friends: those involved in Greek life or those not involved? Provide a guess of the difference between medians (in Facebook friends) you expect to find between the two groups. There is no “right” answer to this question, but you must provide a logical explanation for your conclusion. (3 point)d)Generate side-by-side boxplots for these two groups and include the quantiles below. Report and interpret in context the median and IQR for each group. Comment on any differences you see between the summary statistics. (12 points, make sure to interpret what you report here)e)Include another table that shows the mean and median for “Q61 – Facebook Friends”. Explain briefly why the differences between the mean and median is so large for this variable. (6 Points)f)The data is heavily skewed so the normal model does not apply. Use your distribution to show this by calculating according to the normal curve the lowest 2.5% of Facebook friends we would expect people to have. Show your work on how you came to this answer. NOTE – Use the empirical rule to find your answer. (6 Points)Remaining points distribution:Project organization and flow (3 points)Projects should look neat and organized. Use the crop tool in Word if you need to improve screenshots. Your project should read like a report without the prompt of each question.Use of the guidelines on page 9 – (6 points)The opening paragraph on the project should give a short summary (3-5 sentences) of the analysis they’re about to read. The closing paragraph should summarize interesting finds and discuss any ideas regarding further data collection and analysis. Make sure to use an opening and closing that is relevant to this project and your data.JMP Tutorials Needed for Each QuestionQuestionHeadingTutorialNotes1Miscellaneous TopicsTaking a Simple Random Sample1Graphical Display of Quantitative DataHistogram and Box PlotAfter you place “Q0 - Original Row Number (Identifier)” in the Y,Columns box. The red arrow at the top allows you to stack the histogram and put it in horizontal format. The second red arrow has histogram options for a count axis.2b and 2eGraphical Display of Categorical DataMosaic PlotTo get the contingency table to display “Count” and “Row%” only, uncheck everything except “Count” and “Row%” in the drop-down menu next to the “Contingency Table” output. Also, since your data are “unaggregated”, you should place nothing in the “Freq” box.3aGraphical Display of Quantitative DataBox PlotUnder the graph tab click on “graph builder”. Here you can add “Q61 – Facebook Friends” to the x-axis so it is in horizontal layout. Click “Done” and then take a screenshot.3dGraphical Display of Quantitative DataSide-by-Side Box PlotsAfter you place “Q61 – Facebook Friends” in the Y,Columns box, click on “Q07 - Greek Life” and then click the “X” button to produce separate output for the two groups.3a, 3b, 3eGraphical Display of Quantitative DataTabulateUse the “Tabulate” option under “Analyze”. From here hold control (PC) or command (Mac) and select the summary statistics you want then drag these summary statistics to the column field. Next choose the variable you want the summary statistics for and put this variable in to the rows field. Decimals can be changed with the “Change format” option and changing the default of 12 to 6.EXAMPLE FORMAT – Showing Question 1, 2a and 2bSTAT 201 Project #1 – Spring 2019 – Learning about Spring 2019 Stat 201 StudentsSubmitted by Jane Q. StudentNote: Your first three to five sentences should address item #1 on page 9 of this document. Make sure that your opening is unique to you and is not copy/pasted between projects. Your introduction should feel like the introduction to a book which informs the reader.1. Since the last two digits of my student ID number are 22, I took a random sample of size 922.2a.I have picked the variables “Q7 – Greek Life” and “Q3 – Born in TN”. I expect the relationship between them to… (Include reasoning)2b.STAT 201 SURVEY – Spring 2019FOR REFERENCE ONLY - FULL TEXT OF QUESTIONS ASKEDQ1 Which section of Stat 201 are you in?Q2 What is your gender?Q3 How old are you (In years)?Q4 Were you born in Tennessee?Q5 What is your relationship status?Q6 How far do you live from campus?Q7 What was your high school GPA?Q8 Are you a member of a fraternity or sorority??Q9 Are you an only child, oldest child, middle child or youngest child? Pick one answer that best describes your birth order.Q10 Have you ever broken a bone?Q11 Do you have a Roth IRA?Q12 How many pets do you own?Q13 Estimate how much you spend on your pets per year; include veterinary expenses, food, toys, treats, grooming, etc. Enter zero if you have no pets.Q14 How many hours per week do you spend reading assignments from textbooks that your instructors assign Include all classes, not just Stat 201.Q15 What is your major?Q16 How would you identify the economic level of your immediate family?Q17 Are you in the honors program at UT?Q18 What do you expect your starting annual salary (in US dollars) to be when you obtain a college degree?Q19 Do you (or your parents) plan on, or have you (or your parents) already, taken out student loans to pay for your college expenses? ?Q20 How many hours a week do you currently work at a job? If you are not employed, please put 0.Q21 How many languages can you speak fluently? This includes your native language. ?This is the language you first learned.Q22 How would you classify your views on economic political issues?Q23 How would you classify your views on social political issues?Q24 What should happen to Confederate statues?Q25 Should the United States stop making pennies? (Eliminate the penny)Q26 Will humans step foot on Mars?Q27 Select the option below that completes the following sentence in a way that best describes your opinion. ?"Climate change on Earth:Q28 Which of these do you believe to be closest to the truth regarding life on Earth?Q29 Have you ever smoked marijuana?Q30 Do you think marijuana should be legalized at the federal level (For the whole US)?Q31 Should states have the ability to regulate what couples can marry? ?(i.e. defining marriage as only one man and one woman)Q32 Age when you had your first alcoholic beverage. IMPORTANT- Don't count sips or communion. ?This should be an actual drink of alcohol.STAT 201 SURVEY – Spring 2019FOR REFERENCE ONLY - FULL TEXT OF QUESTIONS ASKEDQ33 When you eat out at a restaurant that involves a waitress or waiter, what percent do you usually tip? Enter response as a whole number with no decimals for a percentage from 0 to 100.Q34 Which statement best describes you behavior when you drink water on campus?Q35 What is the most you've paid for a single coffee based drink? ?This includes any size, additions and tips given for the drink.Q36 Do you use any tobacco products??Q37 Is vaping a safer alternative to smoking?Q38 How many times have you cheated in college? This includes looking at another test during an exam, taking another student's work and presenting it as your own and other forms of academic dishonesty.Q39 How many times did you cheat in high school? This includes looking at another test during an exam, taking another student's work and presenting it as your own and other forms of academic dishonesty.Q40 On an average night, how many hours of sleep during the school year do you usually get?Q41 What is the longest number of consecutive hours you've stayed awake?Q42 Have you ever been arrested?Q43 Approximately how many text messages do you send a week?Q44 Approximately how many text messages do you send on the weekend?Q45 On a typical school day last semester, approximately how many text messages would you send during class (while you were attending class)?Q46 What is your favorite app on your phone?Q47 What percentage of your income do you believe you should save in your 20s? Enter response as a whole number with no decimals for a percentage from 0 to 100.Q48 Have you ever purchased perishable food items online?Q49 In the past 6 months, have you purchased a product based on a TV commercial??Q50 How many on-line purchases (not counting music downloads) have you made in the last week?Q51 How often do you use coupons when you shop (not including on-line shopping)??Q52 Roughly, how many selfies have you posted on social media in the past month?Q53 How much are you will to pay to see your favorite musician in concert?Q54 What do you think of?Kanye West?as an individual?Q55 What do you think of?Steven Colbert?as an individual?Q56 Have you ever watched most or all of a live sporting event on a smart phone or tablet?Q57 Have you ever gone to a physical store to check out the features of a product, with the intention of purchasing the item online later??Q58 How do you usually to listen to music?Q59 Have you ever reviewed a business or product on social media (i.e. Twitter, Facebook, ect.)?Q60 Are you "friends" on Facebook with one or more of your parents (include step-parents in your answer)?STAT 201 SURVEY – Spring 2019FOR REFERENCE ONLY - FULL TEXT OF QUESTIONS ASKEDQ61 Approximately how many friends do you have on Facebook? If you don't have a Facebook account, answer 0 for this question.Q62 Approximately how many people have you defriended on Facebook. If you don't have a Facebook account, answer 0 for this question.Q63 In a typical week last semester, how much time (in hours) did you spend on "social media"? Include both time reading social media and time communicating with social media.Q64 The Tennessee Vols are scheduled to play 12 football games in the 2019 season (I.e., not including the SEC championship game, and not including a bowl game). Number of games won reportedQ65 Did you lie at any point on this survey? Writing a Good STAT 201 Project Report Writing a report to your boss about a statistical analysis he has asked you to do is very different than writing a novel, or writing to your Statistics instructor. What does it take to write a good project report? Of course, it’s important to know your audience when you write anything.Let’s assume you are writing your project report for some busy executives in the company, and they have asked you to answer the questions in the project. They are very intelligent people, but they are not “Statisticians”. Assume that these executives have had some basic statistical education, but perhaps a long time ago. Keep this in mind as you complete your project.Below are some guidelines for writing an effective project report:1.The first sentence or two of your report should “orient” the reader. What is this document about? Who is it from? On what date did you complete the analysis?2.Answer each question on the project instructions using correct sentence structure, spelling and grammar. Sentences should be succinct and clear. You can assume the executives have a copy of the questions they asked.3.Avoid using "statistical jargon". Explain the results of the analysis in a way that the executives can understand it.4. As explained in the project instructions, graphics from JMP and/or Excel that address the project question must be imbedded within the document, at the point where the executives need to see them. Don’t make them hunt for the output at the back of your report.5. Avoid including discussion and/or graphics within the report that have no relevance to the question being addressed. 6.Wrap-up and sign off: some sort of closing sentence regarding how happy you were to answer the executives’ questions, and demonstrate your willingness to do similar work in the near future. The word “Sincerely” or “Regards”, followed by your name is a nice touch at the very end!rev. 2015-01-22 ................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download