Аннотация
Apache Spark is written in Scala programming language. To support Python with Spark, Apache Spark community released a tool, PySpark. Using PySpark, you can work with RDDs in Python programming language also. It is because of a library called Py4j that they are able to achieve this.
This is an introductory tutorial, which covers the basics of Data-Driven Documents and explains how to deal with its various components and sub-components.
Audience This tutorial is prepared for those professionals who are aspiring to make a career in programming language and real-time processing framework. This tutorial is intended to make the readers comfortable in getting started with PySpark along with its various modules and submodules.
![The bible of all fundamental algorithms and the work that taught many of today The Art of Computer Programming, Volume 2: Seminumerical Algorithms [3rd Edition]](https://www.rulit.me/data/programs/images/the-art-of-computer-programming-volume-2-seminumerical-algor_492685.jpg)
![Эта книга идеально подходит как для начинающих программистов, так и для тех, кто только собирается осваивать Python, но уже имеет опыт программирования на других... Простой Python [Современный стиль программирования]](https://www.rulit.me/data/programs/images/prostoj-python-sovremennyj-stil-programmirovaniya_568582.jpg)




Комментарии к книге "PySpark"