================= Using Pandas ================= Install ======= When working with data, often within a DataFrame, the library of choice is Pandas. Install Pandas:: pip install pandas Some OS already supply Pandas but before using the package check it is current:: pip install -U pandas One can use Anaconda, but there are over 100 packages, so one can control the installation better by using Minicondas and install in a similar fashion to pip:: conda install pandas There are three plugins that may require installing, **dash**, **anyio** and **hypothesis**. If working with large amounts of data it is recommended to install **numexpr** for accelerating some numerical operations, and **bottleneck** for certain **nan** evaluations. Other dependancies may be necessary but as a start ensure **matplotlib** and **numpy** are present. If spreadsheet files are used for loading or saving **xlrd**, **xlwt** for xls files or **openpyxl** for reading and writing xlsx files. When working with statistics **SciPy** may be required, although pandas has a useful collection of built-in functions. **Seaborn** can quickly visualise data. Some of its built in functions blend in very nicely with Pandas, many of its built-in tutorial examples are pandas' dataframes. At a pinch the plotting functions of pandas itself can be used instead of matplotlib. Using Pandas ============ One can work with scripts but it is eminantly suitable to use interactively. Some python IDEs (such as Idle and PyScripter) do not accept multiline inputs, check on your IDE of choice. If using Jupyter remember to start it within a directory that you have full control. Open an OS command window, change to a user owned directory, then start with:: jupyter lab This opens a user website in your browser. After the startup procedure is complete it starts in a console. Select the upper tab with **+**, then select the **notebook**. Pandas works flexibly with the dataframes, if the command has no name (handle) then nothing changes, but the results of that command is shown.