日本综合久久_特级丰满少妇一级aaaa爱毛片_91在线视频观看_久久999免费视频_99精品热播_黄色片地址

課程目錄: 用Scala和Spark進行大數據分析培訓

4401 人關注
(78637/99817)
課程大綱:

用Scala和Spark進行大數據分析培訓

 

 

 

WEEK 1

Getting Started + Spark Basics

Get up and running with Scala on your computer.

Complete an example assignment to familiarize yourself with our unique way of submitting assignments.

In this week, we'll bridge the gap between data parallelism

in the shared memory scenario (learned in the Parallel Programming course, prerequisite)

and the distributed scenario. We'll look at important concerns that arise in distributed systems,

like latency and failure. We'll go on to cover the basics of Spark,

a functionally-oriented framework for big data processing in Scala.

We'll end the first week by exercising what we learned about Spark

by immediately getting our hands dirty analyzing a real-world data set.

WEEK 2

Reduction Operations & Distributed Key-Value Pairs

This week, we'll look at a special kind of RDD called pair RDDs.

With this specialized kind of RDD in hand, we'll cover essential operations on large data sets,

such as reductions and joins.WEEK 3

Partitioning and Shuffling

This week we'll look at some of the performance implications of using operations like joins.

Is it possible to get the same result without having to pay for the overhead of moving data over the network?

We'll answer this question by delving into how we can partition our data to achieve better data locality,

in turn optimizing some of our Spark jobs.WEEK 4

Structured data: SQL, Dataframes, and Datasets

With our newfound understanding of the cost of data movement

in a Spark job, and some experience optimizing jobs for data locality last week,

this week we'll focus on how we can more easily achieve similar optimizations.

Can structured data help us? We'll look at Spark SQL and its powerful optimizer which uses structure

to apply impressive optimizations. We'll move on to cover DataFrames and Datasets,

which give us a way to mix RDDs with the powerful automatic optimizations behind Spark SQL.


 

主站蜘蛛池模板: 亚洲精品视频在线播放 | 成人免费观看网站 | 亚洲欧美一区二区三区在线 | 久久久123 | 亚洲视频一区二区三区四区 | 高清不卡毛片 | 国产免国产免费 | 亚洲欧洲在线观看视频 | 在线看91 | 日韩网站在线 | 欧美一二三区 | 久久99精品久久久 | 日韩av大片免费看 | 国产精品一区在线 | 久久成人免费视频 | 久久合久久 | 精品日韩 | 日韩国产高清在线观看 | 中文字幕在线播放不卡 | 亚洲欧美一区二区在线观看 | 免费观看羞羞视频网站 | www.久草.com | 亚洲国产精品一区 | 久久一区二区三区四区 | 四虎影院免费在线播放 | 色一情一乱一伦一区二区三区 | 瑞克和莫蒂第五季在线观看 | 欧美日韩亚洲一区 | 午夜欧美一区二区三区在线播放 | 久久福利电影 | 伊人免费视频二 | 美国av片在线观看 | 中文精品一区二区 | 国产精品久久久久久久三级 | 先锋av资源在线 | 老外黄色一级片 | 影音av | 狠狠久久久 | 国产日韩精品久久 | 欧美一二区 | 中文字幕第一页在线 |