Mining Big Data Sets – Mining Big Data Sets – Mining Big Data Sets



Mining Big Data Sets – Mining Big Data Sets – Mining Big Data Sets

1 1


miri-od-dm-seminar

Seminar "Mining Big Data Sets". Master in Innovation and Research in Informatics, FIB-UPC

On Github chdoig / miri-od-dm-seminar

Mining Big Data Sets

Open Data Seminar- MIRI - FIB-UPC

by Christine Doig and Maryam Pashmi

Index

Introduction
  • Open Data context
  • Data Mining
  • Big Data
Applications and algorithms Big Data story and evolution Map reduce framework Data Mining Tools
  • Mahout
  • R on Hadoop
  • Cascading
  • Python
Summary Resources and Recommendations

Mining Big Data Sets

Introduction Applications and algorithms Big Data story and evolution Map reduce framework Data Mining Tools Summary Resources and Recommendations

1. Introduction

Open Data

Source: Oscar Romero, OpenData, MIRI-FIB-UPC

1. Introduction

Open Data Syllabus

The focus of this course is on enriching the available own data (i.e., data owned by the organization) with external repositories (special attention will be paid on Open Data), in order to gain insights into the organization business domain

  • MD analysis
  • Data Mining

1. Introduction

Data Mining

Discovering patterns

1. Introduction

Data Mining

  • Regression
  • Classification
  • Clustering

1. Introduction

Big Data

  • Volume
  • Velocity
  • Variety

Mining Big Data Sets

Introduction Applications and algorithms Big Data story and evolution Map reduce framework Data Mining Tools Summary Resources and Recommendations

2. DM: Applications and algorithms

Data Mining algorithms

  • Recommender systems

Spotify, Amazon, Netflix

2. DM: Applications and algorithms

Applications and algorithms

  • Recommender systems

Spotify, Amazon, Netflix

2. DM: Applications and algorithms

Applications and algorithms

  • Market Basket Analysis

Target

2. DM: Applications and algorithms

Applications and algorithms

2. DM: Applications and

Mining Big Data Sets

Context and Introduction Applications and algorithms Big Data story and evolution Map reduce framework Data Mining Tools Summary Resources and Recommendations

3. Big data story and evolution

Mining Big Data Sets

Context and Introduction Applications and algorithms Big Data story and evolution Map reduce framework Data Mining Tools Summary Resources and Recommendations

4. Map reduce framework

4. Map reduce framework

Mining Big Data Sets

Context and Introduction Applications and algorithms Big Data story and evolution Map reduce framework Data Mining Tools Summary Resources and Recommendations

5. DM Tools

5. DM Tools

Mining Big Data Sets

Context and Introduction Applications and algorithms Big Data story and evolution Map reduce framework Data Mining Tools Summary Resources and Recommendations

6. Summary

Mining Big Data Sets

Context and Introduction Applications and algorithms Big Data story and evolution Map reduce framework Data Mining Tools Summary Resources and Recommendations

7. Resources and Recommendations

Resources

Mining Massive Datasets: Stanford course Book and Slides Intro to Hadoop and Mapreduce: Udacity MOOC course