Аннотация
Spark for Python Developers aims to combine the elegance and flexibility of Python with the power and versatility of Apache Spark. Spark is written in Scala and runs on the Java virtual machine. It is nevertheless polyglot and offers bindings and APIs for Java, Scala, Python, and R. Python is a well-designed language with an extensive set of specialized libraries. This book looks at PySpark within the PyData ecosystem. Some of the prominent PyData libraries include Pandas, Blaze, Scikit-Learn, Matplotlib, Seaborn, and Bokeh. These libraries are open source. They are developed, used, and maintained by the data scientist and Python developers community. PySpark integrates well with the PyData ecosystem, as endorsed by the Anaconda Python distribution. The book puts forward a journey to build data-intensive apps along with an architectural blueprint that covers the following steps: first, set up the base infrastructure with Spark. Second, acquire, collect, process, and store the data. Third, gain insights from the collected data. Fourth, stream live data and process it in real time. Finally, visualize the information.
The objective of the book is to learn about PySpark and PyData libraries by building apps that analyze the Spark community’s interactions on social networks. The focus is on Twitter data.
![This book is aimed at the data scientist with some familiarity with the R and/or Python programming languages, and with some prior (perhaps spotty or ephemeral) exposure to statistics. Two of the authors came to the world of data science from the world of statistics, and have some appreciation of... Practical Statistics for Data Scientists [50+ Essential Concepts Using R and Python]](https://www.rulit.me/data/programs/images/practical-statistics-for-data-scientists-50-essential-concep_607160.jpg)

![«Как пасти котов» – это книга о лидерстве и руководстве, о том, как первое совмещать со вторым. Это, если хотите, словарь трудных случаев управления IT-проектами.... Как пасти котов [Наставление для программистов, руководящих другими программистами]](https://www.rulit.me/data/programs/images/kak-pasti-kotov-nastavlenie-dlya-programmistov-rukovodyashch_561737.jpg)



Комментарии к книге "Spark for Python Developers"