Blog
Data analytics
Module 1 : Normal Distribution (Percentile, distribution of means, and chance of occurrence if we assume normal distribution)
Module 2 : Confidence Interval Estimation (Including Sample Size determination)
Module 3 : Inferences from data (Hypothesis testing, i.e., confirming or checking if a claim made about the data. In this module, we dealt with only one sample)
Module 4 : More Inferences from data (Multiple samples)
Module 5 : Regression analysis (Both simple and multiple, apart from basic ANOVA)
Objective
The purpose of the project is for you to apply what you learnt from at least 4 modules on your dataset and make some inferences or estimations. Here I am asking you to do only 4 tests or analysis. But the key is – you bring the data and you come up with the question, and each question/set of analysis represents something you learnt from the Modules (1-5). There should be four different ones. If you wish, you can use two data sources (datasets) to achieve it. It is not necessary all of them have to be done using one dataset.
Data source
There are 3 options, you can choose one of them (there are no restrictions on that)
Bring your own data from work (you can remove any private or confidential information, for example: if you are bringing any sales or cost data of an item/product or service – the name can be masked)
Use data from your previous work or company you have access to (again you can remove any private/confidential information)