Tags
Language
Tags
April 2024
Su Mo Tu We Th Fr Sa
31 1 2 3 4 5 6
7 8 9 10 11 12 13
14 15 16 17 18 19 20
21 22 23 24 25 26 27
28 29 30 1 2 3 4

Hands-On Big Data Analytics with PySpark (repost)

Posted By: hill0
Hands-On Big Data Analytics with PySpark (repost)

Hands-On Big Data Analytics with PySpark by Rudy Lai, Bartłomiej Potaczek
English | 2019 | ISBN: 183864413X | 182 pages | EPUB | 5.36 MB

Use PySpark to easily crush messy data at-scale and discover proven techniques to create testable, immutable, and easily parallelizable Spark jobs

Key Features
• Work with large amounts of agile data using distributed datasets and in-memory caching
• Source data from all popular data hosting platforms, such as HDFS, Hive, JSON, and S3
• Employ the easy-to-use PySpark API to deploy big data Analytics for production