question archive The Centres for Disease Control and Prevention (CDC) is the national public health institute of the United States
Subject:Computer SciencePrice: Bought3
The Centres for Disease Control and Prevention (CDC) is the national public health institute of the United States. Its main aim is to protect people health and safety through the control and prevention of diseases. CDC had to rely on doctor reports of influenza outbreaks. CDC was weeks behind in providing vaccines to the affected patients. Using historical data from the CDC, Google compared search term queries against geographical areas that were known to have had flu outbreaks. Google then found forty five terms correlated with the outbreak of flu. With this data, CDC can act immediately.
1. To effectively identify the terms related to influenza outbreaks, Google may need to apply clustering techniques. Out of the types of clusters introduced , which type or types of clusters would best fit this application? Please justify your answer. (6 marks)
2. To better the clustering performance, what configuration of the clustering would you suggest Google to do according to the characteristics of the input data? Please justify your answer. (6 marks)
3. Another option for Google is to use the classification approach. Please compare the classification and clustering approaches in the context of this scenario, by listing at least three differences between them and the impacts to the sorting process. (3 marks)