How to import only every nth row from a csv file to create a dataframe?
Hello Jimmy,
You can use the pandas
library to import a CSV file and create a DataFrame. To import only every nth row, you can use the skiprows
parameter in the read_csv
function. This parameter takes an integer or a list of integers, representing the rows to skip.
For example, to import only every 3rd row:
import pandas as pd
df = pd.read_csv('file.csv', skiprows=lambda i: i % 3 != 0)
This will skip all rows where the index (i) divided by 3 has a remainder other than 0.
Alternatively, you can use the iloc
method to select every nth row after loading the entire CSV file:
df = pd.read_csv('file.csv')
df = df.iloc[::n, :]
This will select every nth row, starting from the first row (0-indexed) and include all columns (:).