Data dictionary

Titanic

The sinking of the Titanic is one of the most infamous shipwrecks in history. On April 15, 1912, during her maiden voyage, the widely considered “unsinkable” RMS Titanic sank after colliding with an iceberg. Unfortunately, there weren’t enough lifeboats for everyone onboard, resulting in the death of 1502 out of 2224 passengers and crew. While there was some element of luck involved in surviving, it seems some groups of people were more likely to survive than others.

name comment pandas_dtype nullable allow_duplicates checks
passengerid Passenger ID int64
survived Survival where 1 = survived, and 0 = perished int64 {'isin': [0, 1]}
pclass Passenger Class. PClass can be a proxy for socio-economic status (SES)* 1st ~ Upper * 2nd ~ Middle * 3rd ~ Lower int64 {'isin': [1, 2, 3]}
name First and Last Name str {'str_length': {'min_value': 12, 'max_value': 82}}
sex Passenger sex Int64 {'isin': [0, 1]}
age Passenger age float64 {'in_range': {'min_value': 0, 'max_value': 80}}
sibsp Number of Siblings/Spouses Aboard.Sibling: Brother, Sister, Stepbrother, or Stepsister of Passenger Aboard Titanic Spouse: Husband or Wife of Passenger Aboard Titanic (Mistresses and Fiances Ignored) int64 {'in_range': {'min_value': 0, 'max_value': 8}}
parch Number of Parents/Children Aboard. Parent: Mother or Father of Passenger Aboard Titanic Child: Son, Daughter, Stepson, or Stepdaughter of Passenger Aboard Titanic int64 {'in_range': {'min_value': 0, 'max_value': 6}}
ticket Ticket Number str
fare Passenger Fare float64 {'in_range': {'min_value': 0, 'max_value': 513}}
cabin Cabin number str
embarked Port of Embarkation where C = Cherbourg; Q = Queenstown; S = Southampton int64 {'isin': ['S', 'C', 'Q']}