What is NumPy in Python

The Python programming language provides a rich set of high-level data structures such as lists for enumerating a collection of objects. However, these structures are not ideally suited for high-performance numerical computations. For example, lists are very flexible but also slow to process in numerical computations. In the mid-90s, an international team of volunteers started to develop a data-structure for efficient array computation. This structure evolved into what is now known as the N-dimensional NumPy array.

The NumPy package, which comprises the NumPy array as well as many other mathematical functions, has found wide-spread adoption in academia, and industry, including in data science. In the Python world, NumPy arrays are now the standard representation for numerical data.

Basic Usage

To start using Numpy, you must first import the package in your working environment. The code examples below will assume that NumPy is imported as follows:

import numpy as np

To see that it is imported properly, you can check its version as below:

>>> import numpy as np
>>> np.__version__
'1.12.0'
>>>

The NumPy ndarray

One of the key features of NumPy is its N-dimensional array object, or ndarray. Arrays are similar to lists in Python, except that every element of an array must be of the same type, typically a numeric type like float or int. Arrays make operations with large amounts of numeric data very fast and are generally much more efficient than lists. The array has some important characteristics:

  • All elements of an array have the same type.
  • Arrays can have multiple dimensions.
  • NumPy arrays have a fixed size at creation, unlike Python lists (which can grow dynamically). Changing the size of an ndarray will create a new array and delete the original.

Creating NumPy Arrays

Arrays can be created in several ways. For example, we can create an array from a list as shown below:

import numpy as np

#Create a one-dimensional array from a list
prices = [12.6, 13.5, 13.2, 14.1, 15.5]
priceArray = np.array(prices)

#print the array
print(priceArray)

#You can also check the datatype of the array
print(priceArray.dtype)

Executing the script will print the following to the console:

C:\pythondatascience>numpyarray.py
[ 12.6  13.5  13.2  14.1  15.5]
float64

Arrays can be multidimensional. Unlike lists, different axes are accessed using commas inside bracket notation. Here is an example with a two-dimensional array (e.g., a matrix) in Python interactive mode:

>>> import numpy as np
>>> a = np.array([[1, 2, 3], [4, 5, 6]], float)
>>> a
array([[ 1.,  2.,  3.],
       [ 4.,  5.,  6.]])
>>> a[0,0]
1.0
>>> a[1,1]
5.0
>>>

The above array is two-dimensional. We can access the items in the array just like you would with any other Python object using the square brackets and specifying the item index.

Exercise

Create a new list containing the returns you earned on your stock portfolio each day over the past 10 days. Use this list of create an ndarray. Print the array in the console and share your results in the forum.

Related Downloads

Data Science in Finance: 9-Book Bundle

Data Science in Finance Book Bundle

Master R and Python for financial data science with our comprehensive bundle of 9 ebooks.

What's Included:

  • Getting Started with R
  • R Programming for Data Science
  • Data Visualization with R
  • Financial Time Series Analysis with R
  • Quantitative Trading Strategies with R
  • Derivatives with R
  • Credit Risk Modelling With R
  • Python for Data Science
  • Machine Learning in Finance using Python

Each book includes PDFs, explanations, instructions, data files, and R code for all examples.

Get the Bundle for $29 (Regular $57)
JOIN 30,000 DATA PROFESSIONALS

Free Guides - Getting Started with R and Python

Enter your name and email address below and we will email you the guides for R programming and Python.

Data Science in Finance: 9-Book Bundle

Data Science in Finance Book Bundle

Master R and Python for financial data science with our comprehensive bundle of 9 ebooks.

What's Included:

  • Getting Started with R
  • R Programming for Data Science
  • Data Visualization with R
  • Financial Time Series Analysis with R
  • Quantitative Trading Strategies with R
  • Derivatives with R
  • Credit Risk Modelling With R
  • Python for Data Science
  • Machine Learning in Finance using Python

Each book comes with PDFs, detailed explanations, step-by-step instructions, data files, and complete downloadable R code for all examples.