Project: Apply Big Data Analytics to a Real Data Set

Goals

Goal #1: motivate you to dive deep into one particular domain area and understand the analytics requirements in that domain

Goal #2: extract useful insights by applying data mining and analysis techniques to a realistic application scenario with real data

Goal #3: leverage parallel/distributed data processing system to process the real data

Your Tasks

Here are the tasks that you will need to do.

1. Write a project proposal

  1. Project Title
  2. Objectives of the project
  3. Why is the project interesting/important
  4. What data set will be used
  5. What analytics will be used and what insights would they potentially yield
  6. A timeline with milestones.

2. Work on the project

3. Present the project in class

4. Write the project report

Use ACM latex style files.

Deliverables