Pandas for everyone [Python Data Analysis]

Pandas for everyone [Python Data Analysis]
Другая компьютерная литература, Программы
Добавил: Admin 1 Авг 20
Проверил: Admin 1 Авг 20
Формат:  PDF (5768 Kb)
  • Currently 0/5

Рейтинг: 0/5 (Всего голосов: 0)

Аннотация

Pandas is an open source Python library for data analysis. It gives Python the ability to work with spreadsheet-like data for fast data loading, manipulating, aligning, merging, etc. To give Python these enhanced features, Pandas introduces two new data types to Python: Series and DataFrame. The DataFrame will represent your entire spreadsheet or rectangular data, whereas the Series is a single column of the DataFrame. A Pandas DataFrame can also be thought of as a dictionary or collection of Series.
Why should you use a programming language like Python and a tool like Pandas to work with data? It boils down to automation and reproducibility. If there is a articular set of analysis that needs to be performed on multiple datasets, a programming language has the ability to automate the analysis on the datasets. Although many spreadsheet programs have its own macro programming language, many users do not use them. Furthermore, not all spreadsheet programs are available on all operating systems. Performing data takes using a programming language forces the user to have a running record of all steps performed on the data. I, like many people, have accidentally hit a key while viewing data in a spreadsheet program, only to find out that my results do not make any sense anymore due to bad data. This is not to say spreadsheet programs are bad or do not have their place in the data workflow, they do, but there are better and more reliable tools out there.

Похожие книги

Комментарии к книге "Pandas for everyone [Python Data Analysis]"

Комментарий не найдено. Будьте первыми!
Чтобы оставить комментарий или поставить оценку книге Вам нужно зайти на сайт или зарегистрироваться