Arrays

Introduction

While you may be familiar with arrays, chances are you have not considered what happens when our computer either manipulates an array by adding or removing elements, or retrieves information from an array. In this lesson, we'll consider what happens when we retrieve or manipulate data in an array.

Arrays - under the hood

When we initialize an array in a programming language, the language allocates space in memory for your array, and then points that starting variable to that address in memory. Then the program assigns a fixed amount of memory for each element.

So let's say my array say starts at memory address 100. And assume that the programming language allocates 8 bits for each element in the array. And that it allocates enough space for ten elements evenly spaced in memory.

Now, let's try to think through how a computer program retrieves an element at a specific index.

let my_arr = ['a']

my_arr[0] = 'a'

Well, we first initialize the array and assign the letter 'a' as the first element, the programming language associates the letter "a" to a specific space in memory. In our example, address 100. So then, when we call my_arr[0] all the program has to do is go to address 100, and retreive the element.

So now, what do you think happens if we call my_arr[3], to return what is in that slot. If the my_arr begins at address 100, and we allocate eight bits of space for each, what address does the program go to to retreive the element at index 3?

Is there a formula that we can come up with for retrieval? Yes, it's algebra.

my_arr[3] 

100 + 3*8 = 124

So our programming language knows that if eight bits are allocated to each element, and then to retrieve an element at a specific index, the program simply visits an address by using the following formula:

  • memoryLocationOfElement = arrayStartAddress + indexNumber * bit_allocation

Manipulating array elements

Now that we talked about retrieving elements from an array, let's talk about removing elements from an array.

let arr = [1, 24, 48, 9]
arr.pop  
// 9

arr 
// [1, 24, 48]

Performing an operation like pop is fairly simple. Again let's assume that our array begins at memory address 100.

memory address 100 108 116 124
arr 1 24 48 9
arr.pop 1 24 48 X

So removing from the end of the array, is not so bad. But removing an element from the beginning involves a lot more.

memory address 100 108 116 124
arr 1 24 48 9
arr.shift 24 48 9 X

Looking at the chart above, shifting involves moving every remaining element to a new space in memory. The cost is equal to the number of elements in the array. So the time complexity of shifting is big O(n). Note that to add elements to the beginning of the array also will cost big O(n) as every subsequent element would have to move to different spot in memory.

memory address 100 108 116 124 132
arr 1 24 48 9
arr.unshift(5) 5 1 24 48 9

So unshifting is big O(n) and shifting is big O(n). However, popping and finding elements take the same amount of time regardless the size of the array. That is, the time complexity is big O(1), meaning that the cost of the operation does not depend on the number of elements in the array.

A second problem: too many elements

Remember that to retrieve information from an array, we simply need to apply the formula startingAddress + index * bitAllocation and go to the corresponding address. Well, one problem that occurs with having all of these contiguous elements, is that we must allocate a specific amount of space, say enough space for 10 elements. And now we have to think about what occurs when we want to add eleven elements?

memory address 100 108 116 124 132 140
arr 1 24 48 9 32 song.mp3
arr.push(5) 1 24 48 9 32 song.mp3

Do you see our problem? We want to push another element, but something else is on those eight bits. If we move our new element to a different location, our formula for retrieving elements no longer works. Instead what we do, is copy our array into a new location in memory where there is enough space. However, notice that the cost of doing this is big O(n) as we must incur a cost for each element we copy over.

new memory address with wide open space 300 308 316 324 332 340
arr 1 24 48 9 32
arr.push(5) 1 24 48 9 32 5

Summary

We saw in this section that some of the strengths and weaknesses of using an array. Retrieving elements by index and adding elements to the end of the array has a time complexity of big O(1), while adding or removing elements at the beginning of an array is big O(n). We also saw that because operations in our array rely on using neighboring locations in memory, we can run out of space.

But do not despair, there is alternative data structure that does not rely on elements having contiguous memory addresses and is less costly for adding and removing elements from the beginning. That is a linked list. We will learn about it in the next section.