Data Operations – AICorr.com


Data operations

This is a data operations tutorial.

Within this tutorial, data operations refers to the process of element-wise operations. Element-wise operations in Pandas involve performing operations on individual elements of a DataFrame or Series. For instance, adding or multiplying two dataframes. Operators play an important part in element-wise operations.

We cover some of the most common arithmetic and comparison methods. Let’s look at some examples of element-wise operations with scalar values.

Arithmetic operations

Arithmetic operators deal with processes such as addition, subtraction, multiplication, and division.

import pandas as pd

# Sample DataFrame
df = pd.DataFrame({'A': [1, 2, 3],
                   'B': [4, 5, 6]})

# Addition
df_add = df + 7
print(df_add)

# Subtraction
df_subtract = df - 3
print(df_subtract)

# Multiplication
df_multiply = df * 3
print(df_multiply)

# Division
df_divide = df / 4
print(df_divide)
    A   B
0   8  11
1   9  12
2  10  13

   A  B
0 -2  1
1 -1  2
2  0  3

   A   B
0  3  12
1  6  15
2  9  18

      A     B
0  0.25  1.00
1  0.50  1.25
2  0.75  1.50

Comparison operations

Comparison operators deal with comparing values. This method returns Boolean output (i.e. True or False).

import pandas as pd

# Sample DataFrame
df = pd.DataFrame({'A': [1, 2, 3],
                   'B': [4, 5, 6]})

# Greater than
df_gt = df > 2
print(df_gt)

# Less than
df_lt = df < 3
print(df_lt)

# Equality
df_eq = df == 2
print(df_eq)
       A     B
0  False  True
1  False  True
2   True  True

       A      B
0   True  False
1   True  False
2  False  False

       A      B
0  False  False
1   True  False
2  False  False

Element-wise operations

In this section, we cover element-wise operations of two series or dataframes. This method follows the same logic as with a scalar value. All methods work on both elements within a dataframe as well as two separate dataframes.

Same dataframe

import pandas as pd

# Sample DataFrame
data = {'A': [1, 2, 3],
        'B': [4, 5, 6]}
df = pd.DataFrame(data)

# Element-wise operation (addition)
result = df['A'] + df['B']
print(result)
0    5
1    7
2    9
dtype: int64

Separate dataframes

import pandas as pd

# Sample DataFrame
data_1 = {'A': [1, 2, 3],
        'B': [4, 5, 6]}

df_1 = pd.DataFrame(data_1)

# Sample DataFrame
data_2 = {'A': [3, 5, 7],
        'B': [10, 15, 20]}

df_2 = pd.DataFrame(data_2)

# Element-wise operation (addition)
result = df_1 + df_2
print(result)
    A   B
0   4  14
1   7  20
2  10  26

Applying functions

Pandas provides efficient ways of applying functions to data. There are a two different methods of applying operations, “apply()” and “map()“.

Let’s explore both techniques separately.

Apply()

This method allows you to apply a function along the axis of a dataframe or series. This means you can apply functions row-wise or column-wise. We use the common “sum” function in this scenario.

import pandas as pd

# Sample DataFrame
data = {'A': [1, 2, 3],
        'B': [4, 5, 6]}
df = pd.DataFrame(data)

# Column-wise (sum function)
column_sum = df.apply(sum, axis=0)
print(column_sum)

# Row-wise (sum functions)
row_sum = df.apply(sum, axis=1)
print(row_sum)
A     6
B    15
dtype: int64

0    5
1    7
2    9
dtype: int64

Map()

This method applies a function to every item of a dataframe. The function accepts and returns a scalar. We use the combination of map and lambda in this example.

import pandas as pd

# Sample Series
s = pd.Series([1, 2, 3, 4])

# Map & lambda functions
s_mapped = s.map(lambda x: x ** 2)

print(s_mapped)
0     1
1     4
2     9
3    16
dtype: int64

This is an original data operations educational material created by aicorr.com.

Next: Grouping and Aggregation

We will be happy to hear your thoughts

Leave a reply

0
Your Cart is empty!

It looks like you haven't added any items to your cart yet.

Browse Products
Powered by Caddy
Shopping cart