Loading Querying Joining And Filtering Data Using Pandas
pandas
Loading data
import pandas as pd
xls = pd.ExcelFile('file.xlsx')
df = xls.parse('sheet_name') # creates a DataFrame for a specific sheetInspecting data
len(df) # number of rows
df.shape # size of rows and columns
df.count() # count of values in each column
df.columns # access column headers
df.dtypes # view data types for each column
df.describe() # built-in summary statistics for numerical values
df.head() # first 5 rows
df.head(100) # first 100 rows
df = df.drop_duplicates() # removes all duplicate rows (ALL cells identical)Querying data
Casting data
Cleaning data
Joining data
Filtering data
Computing data
Updating/creating data
PreviousLoading Querying And Filtering Data Using The Csv ModuleNextSummarizing And Visualizing Data
Last updated