DataFrames - Part 1


  • Pandas package contains useful functions to work with DataFrames.
  • The iloc property is used to index and slice a DataFrame.
  • describe function is used to obtain a statistical summary of basic data features.
  • The simplest method for data visualisation, is to use Pandas’ in-built functionality.
  • Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations, in Python.

Data Frames - Part 2


  • Quantities based on data from two variables are referred to as bivariate measures.
  • Bivariate properties can be studied and visualised using matplotlib and NumPy.
  • Multivariate data analyses can help to uncover relationships between recorded variables.
  • The functions corr and corrcoef can be used to calculate the \(PCC\).
  • A correlation matrix can be visualised as a heatmap.

Image Handling


  • The imread function can be used to read in and interpret multiple image formats.
  • Masking isolates pixels whose intensity value is below a certain threshold.
  • Colour images typically comprise three channels (corresponding to red, green and blue intensities).
  • Python Image Library (PIL) helps to set and raise default pixel limits for reading in and handling larger images.

Time Series


  • plot_series is a Python function we defined to display multiple time series plots.
  • Data filtering is applied to remove specific and irrelevant components.
  • The Fourier spectrum decomposes the time series into a sum of sine waves.
  • Cross-correlation matrices are used for multivariate analysis.