Part 1 – 60% of 75%
Using the Superstore data set provided, determine the decline in sales/profits over the years, and evaluate the use
of Excel for pre-processing the data, analysing the data and visualising the data.
You will also need to demonstrate how you can do this practically with the use of any one of Excel functions
1500 words.
Part 2 – 40% of 75%
You are an intern at Nutritionist centre ‘Smile clinic’, who has been tasked to analyse customers data to
see whether they eat rice in their daily mail for healthy diet point of view. The clininc has a copy of
Microsoft Excel and has just downloaded a free copy of the open source SPSS data mining software. The
company has used Microsoft Excel before but not SPSS.
You need to produce a report giving an evaluation how many customers of the Smile Clinic do eat rice.
How many customers are Male and Female? Also, what is Mean, Median of the ages in the data. Further
what is the Mean & Median of participants do eat rice. Show findings on Pie, Bar or Histogram as well.
The dataset is provided in xls and csv format: smile_clinic.xls, smile_clinic.csv.
2.1 Using the smile_clinic.csv provided in conjunction with SPSS give a specific example of clustering.
Show your workings with screenshots and explain you results.
2.2 Explain the most common data mining methods that can be used in business with real world
examples.
2.3 You will need to discuss the advantages/disadvantages of SPSS over Excel. This should be a mix of
theoretical argument as well as practical argument using the csv and text files for the dataset.
1000 words
Word count guideline & marking criteria:
1. Processing the data, analysing the data and visualising the data (60 mark) 1500 – 1650 words
2. Data mining & analysis (40 marks) 1000 – 1100 words
SPSS – Description
Part 1 – 60% of 75%
Using the Superstore data set provided, determine the decline in sales/profits over the years, and evaluate the use
of Excel for pre-processing the data, analysing the data and visualising the data.
You will also need to demonstrate how you can do this practically with the use of any one of Excel functions
such as: IF, LOOKUP, PIVOT TABLES, charts and graphs.
1500 words.
Part 2 – 40% of 75%
You are an intern at Nutritionist centre ‘Smile clinic’, who has been tasked to analyse customers data to
see whether they eat rice in their daily mail for healthy diet point of view. The clininc has a copy of
Microsoft Excel and has just downloaded a free copy of the open source SPSS data mining software. The
company has used Microsoft Excel before but not SPSS.
You need to produce a report giving an evaluation how many customers of the Smile Clinic do eat rice.
How many customers are Male and Female? Also, what is Mean, Median of the ages in the data. Further
what is the Mean & Median of participants do eat rice. Show findings on Pie, Bar or Histogram as well.
The dataset is provided in xls and csv format: smile_clinic.xls, smile_clinic.csv.
2.1 Using the smile_clinic.csv provided in conjunction with SPSS give a specific example of clustering.
Show your workings with screenshots and explain you results.
2.2 Explain the most common data mining methods that can be used in business with real world
examples.
2.3 You will need to discuss the advantages/disadvantages of SPSS over Excel. This should be a mix of
theoretical argument as well as practical argument using the csv and text files for the dataset.
1000 words
Word count guideline & marking criteria:
1. Processing the data, analysing the data and visualising the data (60 mark) 1500 – 1650 words
2. Data mining & analysis (40 marks) 1000 – 1100 words