{"id":51548,"date":"2021-09-18T01:47:33","date_gmt":"2021-09-18T01:47:33","guid":{"rendered":"https:\/\/papersspot.com\/blog\/2021\/09\/18\/data-mining-question-described-briefly-below-in-description\/"},"modified":"2021-09-18T01:47:33","modified_gmt":"2021-09-18T01:47:33","slug":"data-mining-question-described-briefly-below-in-description","status":"publish","type":"post","link":"https:\/\/papersspot.com\/blog\/2021\/09\/18\/data-mining-question-described-briefly-below-in-description\/","title":{"rendered":"Data mining, question described briefly below in description."},"content":{"rendered":"<p>1 Maryland Traffic Violations <br \/>Perform exploration analysis on the Kaggle Maryland Traffic Violations dataset.<br \/> Answer the following questions: <br \/>1. Which colors of the vehicles are more likely to get involved in a traffic<br \/> violation? <\/p>\n<p> 2. Which models of the car are more likely to get involved in a traffic violation? <\/p>\n<p> This is an open-ended question. I encourage you to try as many data preprocessing and exploratory analysis tasks as you can possibly do. I am ready to be<br \/> impressed. <\/p>\n<p> 2 Comments: <br \/>1. You can download the data here: Traffic Violations in Maryland County | Kaggle It\u2019s about 500 MB uncompressed. Kaggle<br \/> Notebook has a limit of 100 GB per dataset, and Google Colab has a limit<br \/> of 70 GB storage. <br \/>2. You may use pluto as it is a powerful server with few restrictions. To work<br \/> a data science project on pluto, the easiest way is to install an anaconda<br \/> under your own directory. Then use ssh tunnel to access your Notebook<br \/> from a browser at any place, such as your home. You may Google \u2019SSH<br \/> Tunnel Jupyter Notebook\u2019 for instructions. <br \/>3. You can also use your own computer. <br \/>4. R is also allowed for this homework. <br \/>5. The most relevant skill-set you may need for this assignment is Pandas.<br \/> You may find a quick tutorial here: Learn Pandas Tutorials | Kaggle <\/p>\n","protected":false},"excerpt":{"rendered":"<p>1 Maryland Traffic Violations Perform exploration analysis on the Kaggle Maryland Traffic Violations dataset. Answer the following questions: 1. Which colors of the vehicles are more likely to get involved in a traffic violation? 2. Which models of the car are more likely to get involved in a traffic violation? This is an open-ended question. [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[28],"class_list":["post-51548","post","type-post","status-publish","format-standard","hentry","category-research-paper-writing","tag-computer-science"],"_links":{"self":[{"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/posts\/51548","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/comments?post=51548"}],"version-history":[{"count":0,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/posts\/51548\/revisions"}],"wp:attachment":[{"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/media?parent=51548"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/categories?post=51548"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/papersspot.com\/blog\/wp-json\/wp\/v2\/tags?post=51548"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}