{"id":51321,"date":"2021-09-15T00:47:37","date_gmt":"2021-09-15T00:47:37","guid":{"rendered":"https:\/\/papersspot.com\/blog\/2021\/09\/15\/need-help-with-simple-r-script-to-execute-the-following-data-preprocessing-and-statistical-analysis\/"},"modified":"2021-09-15T00:47:37","modified_gmt":"2021-09-15T00:47:37","slug":"need-help-with-simple-r-script-to-execute-the-following-data-preprocessing-and-statistical-analysis","status":"publish","type":"post","link":"https:\/\/papersspot.com\/blog\/2021\/09\/15\/need-help-with-simple-r-script-to-execute-the-following-data-preprocessing-and-statistical-analysis\/","title":{"rendered":"Need help with simple R script to execute the following data preprocessing and statistical analysis."},"content":{"rendered":"<p>Develop a simple R script to execute the following data preprocessing and statistical analysis. Where required show analytical output and interpretations. <br \/>Preprocessing <br \/>Load the file \u201cData set .xlsx\u201d into R (attached here). This file contains information on the times required for each of 337,145 transactions at tax collector facilities in four disguised cities of a county in Florida. This is your master data set. <br \/>Split the data by facility. You can use any of several approaches to this, including the \u201csubset\u201d command which will pull out only those data rows matching a logical condition. If needed help can be found online with this common R command. Bear in mind all spelling, spacing, caps, etc. must be exact for the logical test to work properly. The command is of the form: <br \/>new.data.1 = subset(master.data, facility == &#8220;Hooterville&#8221;) <br \/>3. the numerical portion of your U number as a random number seed and the method demonstrated in class, take a random sample of 70 cases from each facility. Store the sampled observations from each facility in separate data frames. <br \/>Analysis <br \/>Using your 70-case sample, construct a 90% confidence interval on the population mean transaction time for Hooterville. <br \/>Assuming the data in the primary Hooterville data frame represents the population, does your 90% confidence interval include the true population mean on the transaction time variable? <br \/>Use R and your reduced 70-case data set for Oblong. Can you say (\u03b1 = .05) that the population mean transaction time is greater than 8 minutes? How about greater than 9 minutes and 15 seconds? <br \/>Referencing Part 3 above, what \u201ctest against\u201d (mu) value in a two-tailed hypothesis test would yield p = .05 in a two-tailed hypothesis test on the Oblong transaction time? <br \/>Using R and your sample 70-case data sets, show comparative notched boxplots of the four facilities\u2019 transaction time variable. Your boxplots should be displayed side by side in a single graphic with an appropriate title and x-axis labels. Do these plots indicate a possible difference between the transaction times for the two facilities? Do these plots indicate a difference in skewness or number of potential outliers between Hooterville and Pixley? <br \/>Using R and your sample 70-case data sets, does there appear to be a statistically significant difference (\u03b1 = .05) between the mean transaction times for Hooterville and Bunnyville <br \/>Delivery: <br \/>Single MS-Word file showing 1) the R script which executes the above instructions and 2) the results of those instructions. The first line of your script file should be a \u201c#\u201d comment line showing your name (Use &#8216;Sinora&#8217;) as it appears in Canvas. Results should be presented in the order in which they are listed here. Paper requirements: <br \/>Please submit it as an attached MS Word document <br \/>0 % Plagiarism <\/p>\n","protected":false},"excerpt":{"rendered":"<p>Develop a simple R script to execute the following data preprocessing and statistical analysis. Where required show analytical output and interpretations. Preprocessing Load the file \u201cData set .xlsx\u201d into R (attached here). This file contains information on the times required for each of 337,145 transactions at tax collector facilities in four disguised cities of a [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[10],"class_list":["post-51321","post","type-post","status-publish","format-standard","hentry","category-research-paper-writing","tag-writing"],"_links":{"self":[{"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/posts\/51321","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/comments?post=51321"}],"version-history":[{"count":0,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/posts\/51321\/revisions"}],"wp:attachment":[{"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/media?parent=51321"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/categories?post=51321"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/tags?post=51321"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}