Аннотация
Apache Spark is written in Scala programming language. To support Python with Spark, Apache Spark community released a tool, PySpark. Using PySpark, you can work with RDDs in Python programming language also. It is because of a library called Py4j that they are able to achieve this.
This is an introductory tutorial, which covers the basics of Data-Driven Documents and explains how to deal with its various components and sub-components.
Audience This tutorial is prepared for those professionals who are aspiring to make a career in programming language and real-time processing framework. This tutorial is intended to make the readers comfortable in getting started with PySpark along with its various modules and submodules.

![Эта книга является превосходным учебным пособием для изучения языка программирования ЈavaScript на примерах. Изложение ведется последовательно: от написания первой... JavaScript на примерах [Практика, практика и только практика]](https://www.rulit.me/data/programs/images/javascript-na-primerah-praktika-praktika-i-tolko-praktika_607162.jpg)

![This book uses a functional programming language (F#) as a metalanguage to present all concepts and examples, and thus has an operational flavour, enabling practical experiments and exercises. It includes basic concepts such as abstract syntax, interpretation, stack machines, compilation, type... Programming Language Concepts [2nd Edition]](https://www.rulit.me/data/programs/images/programming-language-concepts-2nd-edition_492775.jpg)



Комментарии к книге "PySpark"