SMACK Stack for Data Science培訓

SMACK 是数据平台软件的集合,即 Apache Spark, Apache Mesos, Apache Akka, Apache Cassandra, 和 Apache Kafka。使用 SMACK 插槽,用户可以创建和规模数据处理平台。

这项由导师领导的直播培训(在线或在线)是针对数据科学家,他们希望使用SMACK stack来构建大数据解决方案的数据处理平台。

在本研讨会结束后,参与者将能够:

实施大数据处理的数据管道架构。
建立一个集群基础设施与(6)和(1)。
用 Spark 和 Scala 分析数据。
使用 Apache 管理未结构化数据(0)。

课程格式

互动讲座和讨论。
很多练习和练习。
在现场实验室环境中进行手动实施。

课程定制选项

要申请此课程的定制培训,请联系我们安排。

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

課程簡介

介紹

SMACK Stack 概述

什麼是 Apache Spark？Apache Spark 功能
什麼是 Apache Mesos？Apache Mesos 特性
什麼是 Apache Akka？Apache Akka 功能
什麼是 Apache Cassandra？Apache Cassandra 特性
什麼是 Apache Kafka？Apache Kafka 特性

Scala 語言

Scala 語法和結構
Scala 控制流程

準備開發環境

安裝和配置 SMACK 堆疊
安裝和設定 Docker

阿帕奇 Akka

使用執行元件

阿帕奇 Cassandra

創建用於讀取操作的資料庫
使用備份和恢復

連接

創建流
構建 Akka 應用程式
使用 Cassandra 儲存數據
查看連接器

Apache Kafka

使用集群
創建、發佈和使用消息

Apache Mesos

分配資源
運行集群
使用 Apache Aurora 和 Docker
運行服務和作業
在 Mesos 上部署 Spark、Cassandra 和 Kafka

Apache Spark

管理數據流
使用 RDD 和數據幀
執行數據分析

故障排除

處理服務故障和錯誤

總結和結論

最低要求

瞭解數據處理系統

觀眾

數據科學家

14 時間：

需要幫助選擇合適的課程嗎？

客戶評論 (1)

very interactive...

SMACK Stack for Data Science培訓

課程簡介

最低要求

客戶評論 (1)

Richard Langford

Course - SMACK Stack for Data Science

課程分類

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

SMACK Stack for Data Science培訓

課程簡介

最低要求

客戶評論 (1)

Richard Langford

Course - SMACK Stack for Data Science

相關課程

Introduction to Data Science and AI using Python

Apache Airflow for Data Science: Automating Machine Learning Pipelines

Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

Anaconda Ecosystem for Data Scientists

AWS Cloud9 for Data Science

Big Data Business Intelligence for Telecom and Communication Service Providers

概述

课程目标

目标观众

Introduction to Google Colab for Data Science

A Practical Introduction to Data Science

Data Science Programme

观众：

交货：

Introduction to Graph Computing

Kaggle

Accelerating Python Pandas Workflows with Modin

GPU Data Science with NVIDIA RAPIDS

Python and Spark for Big Data (PySpark)

Apache Spark MLlib

課程分類

Apache Spark

Apache Kafka

Data Science

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites