{"id":71544,"date":"2021-11-06T13:31:43","date_gmt":"2021-11-06T13:31:43","guid":{"rendered":"https:\/\/papersspot.com\/blog\/2021\/11\/06\/answer-the-intro-to-data-mining-questions\/"},"modified":"2021-11-06T13:31:43","modified_gmt":"2021-11-06T13:31:43","slug":"answer-the-intro-to-data-mining-questions","status":"publish","type":"post","link":"https:\/\/papersspot.com\/blog\/2021\/11\/06\/answer-the-intro-to-data-mining-questions\/","title":{"rendered":"Answer the Intro to data Mining Questions"},"content":{"rendered":"<p>At the End of the Lecture files attached will have the questions.Questions from Lecture 4 <br \/>1) Define what is the data preprocessing and explain four steps. <br \/>2) In real-world data, tuples with missing values for some attributes are a common occurrence. Describe various methods for handling this problem. <br \/>3) Briefly outline the normalization methods and show an example. <br \/>Questions from Lecture 5 <br \/>4) Consider the data set shown in Table (Table 6.15, 515 page), with an item taxonomy given in Figure 6.25. Example of market basket transactions \u2013 https:\/\/www.geeksforgeeks.org\/apriori-algorithm\/ \u2013 https:\/\/www.softwaretestinghelp.com\/apriori-algori&#8230; \u2013 https:\/\/www.geeksforgeeks.org\/association-rule\/ <\/p>\n<p>5)(a) Consider the approach where each transaction t is replaced by an extended transaction t1 that contains all the items in t as well as their respective ancestors. <br \/> For example, the transaction t = { Chips, Cookies} will be replaced by t1 = {Chips, Cookies, Snack Food, Food}. Use this approach to derive all frequent itemsets (up to size 4) with support \u2265 70%. <br \/> (b) Consider an alternative approach where the frequent itemsets are generated one level at a time. Initially, all the frequent itemsets involving items at the highest level of the hierarchy are generated. Next, we use the frequent itemsets discovered at the higher level of the hierarchy to generate candidate itemsets involving items at the lower levels of the hierarchy. <br \/> For example, we generate the candidate itemset {Chips, Diet Soda} only if {Snack Food, Soda} is frequent. Use this approach to derive all frequent itemsets (up to size 4) with support \u2265 70%. 6). Consider the data set shown in Table. Example of market basket transactions <\/p>\n<p>Answer for the following questions. <br \/>(a) Compute the support for itemsets {e}, {b, d}, and {b, d, e} by treating each transaction ID as a market basket. <br \/>(b) Use the results in part (a) to compute the confidence for the association rules {b, d} \u2192 {e} and {e} \u2192 {b, d}. Is confidence a symmetric measure? <br \/>(c) Repeat part (a) by treating each customer ID as a market basket. Each item should be treated as a binary variable (1 if an item appears in at least one transaction bought by the customer, and 0 otherwise). Use this result to compute the confidence for the association rules {b, d} \u2192 {e} and {e} \u2192 {b, d}. <br \/>(d) Use the result in part (c) to compute the confidence for the association rules {b, d} \u2192 {e} and {e} \u2192 {b, d} <\/p>\n","protected":false},"excerpt":{"rendered":"<p>At the End of the Lecture files attached will have the questions.Questions from Lecture 4 1) Define what is the data preprocessing and explain four steps. 2) In real-world data, tuples with missing values for some attributes are a common occurrence. Describe various methods for handling this problem. 3) Briefly outline the normalization methods and [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[28],"class_list":["post-71544","post","type-post","status-publish","format-standard","hentry","category-research-paper-writing","tag-computer-science"],"_links":{"self":[{"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/posts\/71544","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/comments?post=71544"}],"version-history":[{"count":0,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/posts\/71544\/revisions"}],"wp:attachment":[{"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/media?parent=71544"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/categories?post=71544"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/tags?post=71544"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}