XSEDE HPC Workshop: BIG DATA Part 2

Thursday, September 6, 2018
7:00am to 1:00pm

This workshop already happened.
View upcoming workshops

flier for event

About This Workshop

Have you ever wondered how to handle data sets so large or complex that traditional data processing applications and techniques are inadequate? In this workshop you will learn about two popular frameworks for distributing and retrieving data: Hadoop and Spark. Hadoop facilitates solving problems involving massive amounts of data and computation via a MapReduce programming model, and Spark evolves this by running it in parallel while providing fault tolerance. On the second day of this workshop, we will cover two more important Big Data topics: Machine Learning with Spark and Deep Learning with TensorFlow.

This two-day remote workshop is being offered by XSEDE  and the Pittsburgh Supercomputing Center. It will focus on topics such as Hadoop and Spark and will be presented using the Wide Area Classroom (WAC) training platform.

Agenda

Wednesday, Sep 5

11:00

Welcome

11:25

Intro to Big Data

12:00

Hadoop

12:30

Intro to Spark

1:00

Lunch break

2:00

Spark

3:30

Spark Exercises

4:30

Spark

5:00

Adjourn

Thursday, Sep 6

11:00

Machine Learning: Recommender System with Spark

1:00

Lunch break

2:00

 Deep Learning with Tensorflow

4:30

Bridges: A Big Data Platform

5:00

Adjourn

 

 

Event host:

Daniel Lucio

 

Contact information:

Karen Ciccone

kacollin@ncsu.edu

 

Admission information:

Registration is required.

 

Other information:

Registered attendees need to bring their own laptops.

When

Thursday, September 6, 2018
7:00am to 1:00pm
Add to calendar 2018-09-06 07:00:00 2018-09-06 13:00:00 XSEDE HPC Workshop: BIG DATA Part 2 <p>Have you ever wondered how to handle data sets so large or complex that traditional data processing applications and techniques are inadequate? In this workshop you will learn about two popular frameworks for distributing and retrieving data: Hadoop and Spark. Hadoop facilitates solving problems involving massive amounts of data and computation via a MapReduce programming model, and Spark evolves this by running it in parallel while providing fault tolerance. On the second day of this workshop, we will cover two more important Big Data topics: Machine Learning with Spark and Deep Learning with TensorFlow.</p> <p>This two-day remote workshop is being offered by XSEDE&nbsp; and the Pittsburgh Supercomputing Center. It will focus on topics such as Hadoop and Spark and will be presented using the Wide Area Classroom (WAC) training platform.</p> <p>Agenda</p> <p>Wednesday, Sep 5</p> <p>11:00</p> <p>Welcome</p> <p>11:25</p> <p>Intro to Big Data</p> <p>12:00</p> <p>Hadoop</p> <p>12:30</p> <p>Intro to Spark</p> <p>1:00</p> <p>Lunch break</p> <p>2:00</p> <p>Spark</p> <p>3:30</p> <p>Spark Exercises</p> <p>4:30</p> <p>Spark</p> <p>5:00</p> <p>Adjourn</p> <p>Thursday, Sep 6</p> <p>11:00</p> <p>Machine Learning: Recommender System with Spark</p> <p>1:00</p> <p>Lunch break</p> <p>2:00</p> <p>&nbsp;Deep Learning with Tensorflow</p> <p>4:30</p> <p>Bridges: A Big Data Platform</p> <p>5:00</p> <p>Adjourn</p> at the

Where

Accessibility

If assistive technology, live captioning, or other accommodations would improve your experience at this event, please contact us. We encourage you to contact us early about this to allow sufficient time to meet your access needs.