Intermediate

26 min read

Pandas Series — Creation, Indexing, Operations & Methods

Master Pandas Series for CBSE Class 12 Informatics Practices. Covers creating Series from list dictionary ndarray and scalar value, Series attributes dtype size shape index values, head and tail functions, label-based loc indexing vs position-based iloc indexing with slicing difference, boolean indexing with single and multiple conditions, arithmetic operations element-wise alignment NaN for mismatched indices, statistical methods sum mean max min count with fully verified board-level output prediction questions.

New

Pandas is the most important Python library for data analysis — and it is the heart of the CBSE Class 12 Informatics Practices syllabus, carrying 25 marks in theory alone. Pandas provides two primary data structures: Series and DataFrame. A Series is the simpler of the two — a one-dimensional, labelled array that can hold any data type. Think of it as a single column of a spreadsheet with row labels. Understanding Series deeply — how to create, access, slice, filter and compute on it — builds the exact foundation needed for the more powerful DataFrame. Board exam questions on Series include output prediction, code completion, and identifying the correct method for a given task.

1. Introduction to Pandas

Pandas (Panel Data + Python) is an open-source data analysis library built on top of NumPy. It provides fast, flexible, and expressive data structures for working with structured (tabular) data.

import pandas as pd    # Standard import convention
import numpy  as np    # Often needed for array creation

Structure	Dimensions	Analogy	Example
Series	1-D (one column)	Single column of a spreadsheet	Student marks in one subject
DataFrame	2-D (rows + columns)	Full spreadsheet or SQL table	Student records with Name, Age, Marks

2. Pandas Series — Meaning and Structure

A Pandas Series is a one-dimensional labelled array capable of holding data of any type — integers, floats, strings, booleans, or Python objects. It has two components:

Index: Labels for each element. If not specified, defaults to 0, 1, 2, ...
Values: The actual data stored in the Series.

import pandas as pd

s = pd.Series([10, 20, 30, 40, 50])
print(s)
# Output:
# 0    10
# 1    20
# 2    30
# 3    40
# 4    50
# dtype: int64

The left column (0, 1, 2...) is the index; the right column is the values. The last line shows the dtype (data type).

3. Creating a Series — Four Methods

Method 1 — From a Python List

s1 = pd.Series([10, 20, 30, 40, 50])
# Default index: 0, 1, 2, 3, 4

# With custom index
s2 = pd.Series([10, 20, 30], index=['a', 'b', 'c'])
print(s2)
# a    10
# b    20
# c    30
# dtype: int64

Method 2 — From a Python Dictionary

Dictionary keys become the index; values become the data.

marks = pd.Series({'Maths': 95, 'Science': 88, 'English': 76})
print(marks)
# Maths      95
# Science    88
# English    76
# dtype: int64

Method 3 — From a NumPy Array (ndarray)

import numpy as np

arr = np.array([100, 200, 300])
s3 = pd.Series(arr, index=['a', 'b', 'c'])
print(s3)
# a    100
# b    200
# c    300
# dtype: int64

Method 4 — From a Scalar Value

A single value repeated for all index positions — index is mandatory when creating from scalar.

s4 = pd.Series(5, index=[0, 1, 2, 3])
print(s4)
# 0    5
# 1    5
# 2    5
# 3    5
# dtype: int64

4. Series Attributes

Attribute	Description	Example output (for s with 5 int elements)
`s.dtype`	Data type of elements	`int64`
`s.size`	Total number of elements	`5`
`s.shape`	Tuple showing dimensions	`(5,)`
`s.index`	The index labels of the Series	`Index(['a','b','c','d','e'])`
`s.values`	The data as a NumPy array	`[10 20 30 40 50]`
`s.name`	Name of the Series (if assigned)	`None` by default

5. head() and tail() Functions

s = pd.Series([10,20,30,40,50], index=['a','b','c','d','e'])

print(s.head(3))   # First 3 elements
# a    10
# b    20
# c    30
# dtype: int64

print(s.tail(2))   # Last 2 elements
# d    40
# e    50
# dtype: int64

print(s.head())    # Default: first 5 elements
print(s.tail())    # Default: last 5 elements

Key point: Default value of n in both head() and tail() is 5.

6. Indexing and Slicing a Series

There are three ways to access elements in a Series:

Method	Syntax	Based on	Slice includes end?
Label-based	`s['label']` or `s.loc['label']`	Index label name	Yes — end label included
Position-based	`s.iloc[position]`	Integer position (0-based)	No — end position excluded (like Python lists)
Direct position	`s[0]`, `s[1:4]`	Works only with default (integer) index	Slicing excludes end

s = pd.Series([10,20,30,40,50], index=['a','b','c','d','e'])

# Single element access
print(s['b'])          # 20  (label-based)
print(s.loc['b'])      # 20  (same)
print(s.iloc[1])       # 20  (position: index 1)

# Slicing — label-based (END INCLUDED)
print(s.loc['b':'d'])
# b    20
# c    30
# d    40   ← 'd' IS included

# Slicing — position-based (END EXCLUDED)
print(s.iloc[1:4])
# b    20
# c    30
# d    40   ← position 4 (e) is NOT included

⚠️ Critical Difference — Most Tested in Board Exams:

s.loc['b':'d'] → includes both 'b' and 'd' (label-based — end inclusive)
s.iloc[1:4] → includes positions 1, 2, 3 — position 4 is excluded (position-based — end exclusive)

7. Boolean Indexing

Boolean indexing filters a Series by applying a condition — only elements satisfying the condition are returned.

marks = pd.Series([45, 78, 90, 55, 88],
                  index=['A', 'B', 'C', 'D', 'E'])

# Get all marks greater than 60
print(marks[marks > 60])
# B    78
# C    90
# E    88
# dtype: int64

# Multiple conditions using & (and) | (or)
print(marks[(marks >= 50) & (marks <= 80)])
# B    78
# D    55
# dtype: int64

Important: Use & (not and) and | (not or) for element-wise operations on Series. Always wrap each condition in parentheses ().

8. Mathematical Operations on Series

Arithmetic Operations

Operations are applied element-wise — and aligned by index label, not position.

s1 = pd.Series([10, 20, 30], index=['a', 'b', 'c'])
s2 = pd.Series([1,  2,  3],  index=['a', 'b', 'c'])

print(s1 + s2)   # a=11, b=22, c=33
print(s1 * s2)   # a=10, b=40, c=90
print(s1 - s2)   # a=9,  b=18, c=27
print(s1 / s2)   # a=10.0, b=10.0, c=10.0

# Scalar operations
print(s1 + 5)    # a=15, b=25, c=35
print(s1 * 2)    # a=20, b=40, c=60

Index Alignment — NaN for Mismatched Labels

# When indices don't match, result is NaN
s3 = pd.Series([10, 20, 30], index=['a', 'b', 'c'])
s4 = pd.Series([1,  2,  3],  index=['b', 'c', 'd'])

print(s3 + s4)
# a     NaN    ← 'a' only in s3
# b    21.0   ← 20 + 1
# c    32.0   ← 30 + 2
# d     NaN   ← 'd' only in s4
# dtype: float64

Statistical Methods

Method	Description	Example (marks Series)
`s.sum()`	Sum of all values	`356`
`s.mean()`	Arithmetic mean	`71.2`
`s.max()`	Maximum value	`90`
`s.min()`	Minimum value	`45`
`s.count()`	Count of non-NaN values	`5`
`s.std()`	Standard deviation	`19.3...`
`s.median()`	Median value	`78.0`
`s.describe()`	Summary statistics (count, mean, std, min, quartiles, max)	Full stats table

Series — Output Prediction Quick Reference

Code	Output	Why
`s.head()`	First 5 elements	Default n=5
`s.loc['a':'c']`	Elements at 'a', 'b', 'c'	`loc` — end label 'c' IS included
`s.iloc[0:3]`	Elements at positions 0, 1, 2	`iloc` — end position 3 is NOT included
`pd.Series(5, index=[0,1,2])`	`0→5, 1→5, 2→5`	Scalar repeated for all indices
`s1 + s2` (mismatched index)	`NaN` for unmatched labels	Alignment by label — no match = NaN
`s.size`	Integer (total elements)	Attribute, NOT a method — no parentheses
`s[s > 60]`	Only elements where value > 60	Boolean indexing — condition as filter

Board Exam Power Tips — Avoid These Common Mistakes

loc is label-based; iloc is position-based. This is the most tested distinction. s.loc['b':'d'] includes 'd'. s.iloc[1:4] excludes position 4. Writing s.loc[1:3] when the index is letters (like 'a','b','c') will cause a KeyError — because 1, 2, 3 are not labels in that index.
Series created from a dictionary uses keys as index, values as data. The order of items in the dictionary determines the order in the Series (Python 3.7+ preserves insertion order).
When creating from a scalar, the index parameter is mandatory. pd.Series(5) creates a Series with one element. pd.Series(5, index=[0,1,2]) creates three elements all with value 5.
Operations between two Series align by index label, not position. If indices don't match, result is NaN (Not a Number) — a floating-point marker for missing data. This is a very common output-prediction question.
s.size is an attribute (no parentheses), s.count() is a method (with parentheses). Both give the number of elements BUT s.count() excludes NaN values while s.size counts everything. Writing s.size() causes a TypeError.
For multiple conditions in Boolean indexing, use & and |, NOT and/or. Also, every condition must be in parentheses: (s > 50) & (s < 80). Writing s > 50 & s < 80 without parentheses gives wrong results due to operator precedence.
Default value of n in head() and tail() is 5, not 10. This is tested as an MCQ — "What does s.head() return?" → First 5 elements.

Practice Questions (CBSE Board Level)

Q1 (2 marks): Write the output of the following code:

import pandas as pd
s = pd.Series([10, 20, 30], index=['A', 'B', 'C'])
print(s['B'])
print(s.iloc[0])

Output:

20
10

Explanation:
s['B'] performs label-based access and returns the value located at the index label 'B', which is 20.
s.iloc[0] performs position-based access and returns the value at the exact integer position 0 (the very first element), which is 10.

Q2 (2 marks): Write the output of the following code:

import pandas as pd
s = pd.Series({'Math': 85, 'Science': 90, 'English': 78})
print(s.head(2))
print(s.max())

Output:

Math       85
Science    90
dtype: int64
90

Explanation:
The head(2) method returns the first 2 elements of the Series.
The max() method evaluates all values in the Series and returns the highest numeric value, which is 90.

Q3 (2 marks): Write the output of the following code:

import pandas as pd
s = pd.Series([5, 10, 15, 20, 25])
print(s[s > 12])

Output:

2    15
3    20
4    25
dtype: int64

Explanation: This is an example of Boolean indexing. It filters the Series so that only elements strictly greater than 12 are returned. The elements at positions 2, 3, and 4 (values 15, 20, 25) satisfy this condition. Note that the original default integer index (2, 3, 4) is preserved in the final output.

Q4 (3 marks): Write a Python program to create a Series named temp from the dictionary {'Mon': 32, 'Tue': 35, 'Wed': 30, 'Thu': 33, 'Fri': 29}. Display: (i) the first 3 values, (ii) all temperatures above 31, and (iii) the average temperature.

Answer:

import pandas as pd
temp = pd.Series({'Mon': 32, 'Tue': 35, 'Wed': 30, 'Thu': 33, 'Fri': 29})

# (i) First 3 values
print(temp.head(3))

# (ii) Temperatures above 31
print(temp[temp > 31])

# (iii) Average temperature
print(temp.mean())

Expected Output:

Mon    32
Tue    35
Wed    30
dtype: int64

Mon    32
Tue    35
Thu    33
dtype: int64

31.8

Q5 (2 marks): What is the difference between loc[] and iloc[]? Give one example of each.

Explanation:

loc[] is strictly label-based indexing. You access elements using their explicit index label (name). When slicing with loc, both the start and end labels are included.
Example: s.loc['b':'d'] returns all elements starting from label 'b' up to and including label 'd'.

iloc[] is strictly position-based indexing. You access elements using their integer position (0-based indexing). When slicing with iloc, the end position is excluded.
Example: s.iloc[1:4] returns elements at integer positions 1, 2, and 3. The element at position 4 is not included.

Q6 (1 mark — MCQ): What will be the output of pd.Series(10, index=[0, 1, 2]).size?

A) 10
B) 1
C) 3
D) Error

Answer: C) 3.

Explanation: When you provide a single scalar value (10) alongside an index list, pandas automatically repeats that scalar to match the length of the index. pd.Series(10, index=[0, 1, 2]) creates a Series containing three 10s. The .size attribute simply returns the total number of elements, which is 3.

Q7 (2 marks): Write the output of the following code:

import pandas as pd
s1 = pd.Series([10, 20, 30], index=['a', 'b', 'c'])
s2 = pd.Series([1, 2, 3], index=['b', 'c', 'd'])
print(s1 + s2)

Output:

a     NaN
b    21.0
c    32.0
d     NaN
dtype: float64

Explanation (Index Alignment): Pandas automatically aligns data by index label during mathematical operations.
- Label 'a' exists only in s1 (no match in s2) → Results in NaN (Not a Number).
- Label 'b' matches: 20 + 1 = 21.0.
- Label 'c' matches: 30 + 2 = 32.0.
- Label 'd' exists only in s2 (no match in s1) → Results in NaN.
Because NaN is mathematically represented as a float in Python, the entire resulting Series is upcast to the float64 datatype.

Related Study Material

More Informatics practices Notes More Data Handling And Visualization Notes All Data Handling Using Pandas Topics

Practice Questions

Informatics practices Practice Data Handling Using Pandas Questions

Topic Information

DifficultyIntermediate

Est. Read Time26 minutes

CategoryData Handling Using Pandas

Quick Actions

Track Your Learning

Study Tips

Take notes of key points while reading

Practice related questions after studying

Review topics marked as "Learning" regularly

Test your knowledge with practice questions

Edvaya Target

Detailed Notes & Solved Examples

Master complex concepts with our rich, topic-wise study notes. Access hundreds of step-by-step solved examples and interactive flashcards right from your phone.

Edvaya Target

Edvaya Aspire

Take a Free Mock Test

Target Subjects

Aspire Subjects

Subject Mastery

Pandas Series — Creation, Indexing, Operations & Methods

1. Introduction to Pandas

2. Pandas Series — Meaning and Structure

3. Creating a Series — Four Methods

Method 1 — From a Python List

Method 2 — From a Python Dictionary

Method 3 — From a NumPy Array (ndarray)

Method 4 — From a Scalar Value

4. Series Attributes

5. head() and tail() Functions

6. Indexing and Slicing a Series

7. Boolean Indexing

8. Mathematical Operations on Series

Arithmetic Operations

Index Alignment — NaN for Mismatched Labels

Statistical Methods

Series — Output Prediction Quick Reference

Practice Questions (CBSE Board Level)

Related Study Material

Practice Questions

Topic Information

Quick Actions

Track Your Learning

Study Tips

Edvaya Target

We Value Your Privacy