Importing only every nth row

Jimmy_Neutron51 · January 17, 2023, 10:56pm

How to import only every nth row from a csv file to create a dataframe?

Hyder_Zaidi · January 19, 2023, 2:52pm

Hello Jimmy,

You can use the pandas library to import a CSV file and create a DataFrame. To import only every nth row, you can use the skiprows parameter in the read_csv function. This parameter takes an integer or a list of integers, representing the rows to skip.

For example, to import only every 3rd row:

import pandas as pd
df = pd.read_csv('file.csv', skiprows=lambda i: i % 3 != 0)

This will skip all rows where the index (i) divided by 3 has a remainder other than 0.

Alternatively, you can use the iloc method to select every nth row after loading the entire CSV file:

df = pd.read_csv('file.csv')
df = df.iloc[::n, :]

This will select every nth row, starting from the first row (0-indexed) and include all columns (:).