Explain the most common data mining methods that can be used in business with real world.

Part 1 – 60% of 75%

Using the Superstore data set provided, determine the decline in sales/profits over the years, and evaluate the use

of Excel for pre-processing the data, analysing the data and visualising the data.

You will also need to demonstrate how you can do this practically with the use of any one of Excel functions

1500 words.

Part 2 – 40% of 75%

You are an intern at Nutritionist centre ‘Smile clinic’, who has been tasked to analyse customers data to

see whether they eat rice in their daily mail for healthy diet point of view. The clininc has a copy of

Microsoft Excel and has just downloaded a free copy of the open source SPSS data mining software. The

company has used Microsoft Excel before but not SPSS.

You need to produce a report giving an evaluation how many customers of the Smile Clinic do eat rice.

How many customers are Male and Female? Also, what is Mean, Median of the ages in the data. Further

what is the Mean & Median of participants do eat rice. Show findings on Pie, Bar or Histogram as well.

The dataset is provided in xls and csv format: smile_clinic.xls, smile_clinic.csv.

2.1 Using the smile_clinic.csv provided in conjunction with SPSS give a specific example of clustering.

Show your workings with screenshots and explain you results.

2.2 Explain the most common data mining methods that can be used in business with real world

examples.

2.3 You will need to discuss the advantages/disadvantages of SPSS over Excel. This should be a mix of

theoretical argument as well as practical argument using the csv and text files for the dataset.

1000 words

Word count guideline & marking criteria:

1. Processing the data, analysing the data and visualising the data (60 mark) 1500 – 1650 words

2. Data mining & analysis (40 marks) 1000 – 1100 words

SPSS – Description

Part 1 – 60% of 75%

Using the Superstore data set provided, determine the decline in sales/profits over the years, and evaluate the use

of Excel for pre-processing the data, analysing the data and visualising the data.

You will also need to demonstrate how you can do this practically with the use of any one of Excel functions

such as: IF, LOOKUP, PIVOT TABLES, charts and graphs.

1500 words.

Part 2 – 40% of 75%

You are an intern at Nutritionist centre ‘Smile clinic’, who has been tasked to analyse customers data to

see whether they eat rice in their daily mail for healthy diet point of view. The clininc has a copy of

Microsoft Excel and has just downloaded a free copy of the open source SPSS data mining software. The

company has used Microsoft Excel before but not SPSS.

You need to produce a report giving an evaluation how many customers of the Smile Clinic do eat rice.

How many customers are Male and Female? Also, what is Mean, Median of the ages in the data. Further

what is the Mean & Median of participants do eat rice. Show findings on Pie, Bar or Histogram as well.

The dataset is provided in xls and csv format: smile_clinic.xls, smile_clinic.csv.

2.1 Using the smile_clinic.csv provided in conjunction with SPSS give a specific example of clustering.

Show your workings with screenshots and explain you results.

2.2 Explain the most common data mining methods that can be used in business with real world

examples.

2.3 You will need to discuss the advantages/disadvantages of SPSS over Excel. This should be a mix of

theoretical argument as well as practical argument using the csv and text files for the dataset.

1000 words

Word count guideline & marking criteria:

1. Processing the data, analysing the data and visualising the data (60 mark) 1500 – 1650 words

2. Data mining & analysis (40 marks) 1000 – 1100 words

 

Explain the most common data mining methods that can be used in business with real world.
Scroll to top