Differences between revisions 1 and 6 (spanning 5 versions)
Revision 1 as of 2024-01-15 21:19:26
Size: 384
Comment: Initial commit
Revision 6 as of 2024-01-16 03:46:59
Size: 2844
Comment: Added attrs
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:

A '''`DataFrame`''' is a 2-dimensional array of columnar data.

The [[Python/Pandas/Types|type]] is fully specified as `pandas.core.frame.DataFrame`.
Line 17: Line 21:


=== DType ===

A column without significant consistency of data types will initialize with a [[Python/NumPy/Types#ObjectDType|dtype]] of `object`. Alternatives include:

 * `int64`
 * `float64`
 * `datetime64`
 * `bool`
 * `category`
Line 22: Line 38:

||'''Method'''||'''Meaning''' ||
||`axes` ||[[Python/Builtins/Types#List|list]] containing the values of the `index` and `columns` attributed ||
||`columns` ||[[Python/Pandas/Types#Index|Index]] listing the column names ||
||`dtypes` ||[[Python/Pandas/Series|Series]] listing the [[Python/NumPy/Types#ObjectDType|dtype]] of each column ||
||`iloc` ||[[Python/Pandas/Types#A_ILocIndexer|indexable accessor of member values]] ||
||`index` ||[[Python/Pandas/Types|RangeIndex]] containing the member indices ||
||`loc` ||[[Python/Pandas/Types#A_LocIndexer|indexable accessor of member values]] ||
||`shape` ||[[Python/Builtins/Types#Tuple|tuple]] of 2 [[Python/Builtins/Types#Int|ints]] representing number of rows and columns||
||`size` ||`int` count of member values ||
||`values` ||nested [[Python/NumPy/Types#NDArray|numpy.ndarray]] containing the member values ||
Line 29: Line 56:
||'''Method'''||'''Meaning''' ||
||`squeeze` ||return a [[Python/Pandas/Series|Series]] representation of the frame||
These methods return [[Python/NumPy/Types#Float64|numpy.float64]] values unless otherwise specified.

||'''Method'''||'''Meaning''' ||'''Example''' ||
||`head` ||return a new `DataFrame` of the first N rows ||`df.head(5)` ||
||`info` ||print information including types and null values || ||
||`squeeze` ||return a [[Python/Pandas/Series|Series]] view of the frame|| ||
||`sum` ||return a `Series` of sums for each row or column ||`df.sum(axis="columns"); df.sum(axis="index")`||
||`tail` ||return a new `DataFrame` of the last N rows ||`df.tail(5)` ||

Python Pandas DataFrame

A DataFrame is a 2-dimensional array of columnar data.

The type is fully specified as pandas.core.frame.DataFrame.


Example


Data Model

DType

A column without significant consistency of data types will initialize with a dtype of object. Alternatives include:

  • int64

  • float64

  • datetime64

  • bool

  • category


Attrbibutes

Method

Meaning

axes

list containing the values of the index and columns attributed

columns

Index listing the column names

dtypes

Series listing the dtype of each column

iloc

indexable accessor of member values

index

RangeIndex containing the member indices

loc

indexable accessor of member values

shape

tuple of 2 ints representing number of rows and columns

size

int count of member values

values

nested numpy.ndarray containing the member values


Methods

These methods return numpy.float64 values unless otherwise specified.

Method

Meaning

Example

head

return a new DataFrame of the first N rows

df.head(5)

info

print information including types and null values

squeeze

return a Series view of the frame

sum

return a Series of sums for each row or column

df.sum(axis="columns"); df.sum(axis="index")

tail

return a new DataFrame of the last N rows

df.tail(5)


CategoryRicottone

Python/Pandas/DataFrame (last edited 2024-01-16 03:46:59 by DominicRicottone)