question archive System requirement analysis, design and Development Description of the assessment This coursework is completed in 2 parts (assessment 1 and 2)
Subject:WritingPrice:38.99 Bought3
System requirement analysis, design and Development
Description of the assessment
This coursework is completed in 2 parts (assessment 1 and 2). The first part is a report of evaluations of different current available big data platforms and implementation of a data warehouse using one of these platforms.
The second part is to do a big data analytics task applying on the big data process technologies and machine learning algorithms.
Assessment Content
Processing Big Data has many challenges with 4Vs.
The assessment 1 (CW1) requires you to investigate no less than 3 big data supported cloud platforms with a demo example of data warehouse implementation. The Python-SQL-based data warehouse implementation and data used for the demo will be explained in the practical sessions. You should Report your evaluation results of the platforms with at least 8 criteria following the evaluation guide (https://www.firebolt.io/resources/cloud-data-warehouse-evaluation-guide). After evaluating the cloud infrastructure, you should be able to do data analysis tasks by processing a big dataset using Python (ideally should be PySpark). The dataset will be provided.
The suggested platforms are:
BigQuery (Google)
Azure (Microsoft)
Keboola
Red Hat OpenShift
Deepnote
Deliverables:
Report on evaluation of big data cloud platform and data warehouse demo implementation process. The report structure should follow the structure below
1. Introduction 10%
The purpose and scope of the report
• how many platforms you would like to evaluate?
• the criteria for evaluation
• investigation methodology
2. Platform investigation 40%
• Detailed report of evaluation on each platform according to the defined criteria.
• The comparation result
3. Big Data processing and analysis implementation 40%
• Working on a big dataset to enable applying NoSQL, PySpark or similar techniques to do data analysis on one of the cloud platforms or simulate on your own PC. The analysis should include data EDA, classification and price prediction.
• The explanations and screenshots to support this section. Critical discussion on the reason special algorithms are selected to do the work.
• The dataset will be provided. The dataset is relatively big for assessment purpose, and you can download the data from module blackboard.
4. Conclusion 10%
• Summarisation
• Experience (what you have learnt from the assessment) discussion
• Future work (what can be improved)
Submission must be in Word (.doc, .docx) or PDF file format.
Assessment Rubric
The assessment rubric on the next page shows the complete criteria of the CW and how you will be assessed. We will explore the content of the rubric together in a synchronous session on first week session.
When the assessment is returned you will receive a digital version of the rubric showing how you performed against each criterion. You will also receive a short individual audio file that highlights both the strengths of the work and your key areas for development. There will be an opportunity to discuss these with your personal academic tutor.
Assessment weighting: (CW1 100%)
Word Count Up to 2500
Learning Outcomes: 2
Submission Method: Turnitin
Date Set: Uploaded in the BB module assessment space
Purchased 3 times