You are required to write a report with the following content:
• Provide a high-level survey on the advances of data science in the past 2 years.
• Explain how Spark fits into the field of data science. Compare Spark with its competitors.
• Explain your design and implementation of the machine learning parts in your code,
including the following topics:
o Background of your selected data set
o For each task, which learning algorithm is used and what are its key parameters and
how you set them up
o For each task, provide comments/evaluation for the model learnt

